Member of Technical Staff, Model Evaluation

Inception

Palo Alto, CA

Category Research

Job Description

We are seeking experienced engineers and scientists to develop evaluation metrics and systems for advanced AI models. The ideal candidate will design and maintain robust evaluation frameworks, collaborate with research and training teams, and partner with product and customer-facing teams to advance frontier LLM performance.

Requirements

BS/MS/PhD in Computer Science, Machine Learning, Statistics, or a related field (or equivalent experience)
At least 2 years of experience in ML evaluation, applied ML research, or a related engineering role
Strong understanding of LLM fundamentals
Proficiency in Python and ML frameworks such as PyTorch
Experience designing and implementing evaluation metrics and benchmarks for generative models
Solid foundation in statistics, experimental design, and hypothesis testing

Benefits

Competitive salary and equity in a rapidly growing startup
Access to the latest GPU hardware and cloud resources
Flexible vacation and paid time off (PTO)
Health, dental, and vision insurance
A collaborative and inclusive culture

]]>