Login or register to see your saved jobs and receive scout emails
Login or register to find a job
Job ID : 1593422 Date Updated : May 26th, 2026

AI Evaluation - Growing Gen AI Firm (No Japanese Needed)

Hiring Company Growing Gen AI Firm
Location Tokyo - 23 Wards
Job Type Permanent Full-time
Salary Negotiable, based on experience

Work Style

Remote Work and WFH Flex Time

Job Description

About the company: 

Join a rapidly growing Gen AI Firm / enterprise AI SaaS Company building LLM-based tools for businesses such as chat, content generation, workflow automation etc. 
 

About the role: 

They are currently hiring an AI Evaluation Scientist to lead the design, development, and operation of infrastructure for evaluating the quality of AI agents. This role involves researching and developing evaluation metrics, designing and building automated evaluation pipelines, driving product quality improvements through statistical experimental design, and ensuring the reliability and overall quality of production systems. 

You will also be collaborating with various members of the development organization including Research Engineers, Software Engineers (AI Platform) and Product Managers. 

General Requirements

Minimum Experience Level Over 3 years
Career Level Mid Career
Minimum English Level Business Level
Minimum Japanese Level None
Minimum Education Level Post Grad Degree (PHD/MBA etc)
Visa Status No permission to work in Japan required

Required Skills

They are looking for candidates who: 

  • Have a Master's degree in Computer Science, Statistics, Machine Learning or similar fields 
  • Have more than 3 years of experience as a ML Engineer/ Data Scientist/ Research Engineer or in any ML/AI evaluation related roles 
  • Are an expert in LLM/ generative AI evaluation methods 
  • Have knowledge in statistics and experimental design
  • Have experience building ML and evaluation pipelines in Python
  • Can design custom evaluation metrics 

Job Location

  • Tokyo - 23 Wards

Work Conditions

Job Type Permanent Full-time
Salary Negotiable, based on experience
Industry Software

Job Category