Member of Technical Staff, Model Evaluation

Inception
Palo Alto, CA
Category Research
Job Description
We are seeking experienced engineers and scientists to develop evaluation metrics and systems for advanced AI models. The ideal candidate will design and maintain robust evaluation frameworks, collaborate with research and training teams, and partner with product and customer-facing teams to advance frontier LLM performance.

Requirements

  • BS/MS/PhD in Computer Science, Machine Learning, Statistics, or a related field (or equivalent experience)
  • At least 2 years of experience in ML evaluation, applied ML research, or a related engineering role
  • Strong understanding of LLM fundamentals
  • Proficiency in Python and ML frameworks such as PyTorch
  • Experience designing and implementing evaluation metrics and benchmarks for generative models
  • Solid foundation in statistics, experimental design, and hypothesis testing

Benefits

  • Competitive salary and equity in a rapidly growing startup
  • Access to the latest GPU hardware and cloud resources
  • Flexible vacation and paid time off (PTO)
  • Health, dental, and vision insurance
  • A collaborative and inclusive culture
]]>