Member of Technical Staff - ML Training Systems

Modal Labs
San Francisco, NY
Job Description
We are looking for strong engineers with experience training production machine learning models to contribute to open-source projects and evolve Modal's infrastructure to train the next generation of language models.

Requirements

  • 5+ years of experience writing high-quality, high-performance code
  • Experience working with torch and high-level training frameworks (Huggingface, verl, slime)
  • Experience with ML training optimization (tell us a story about eliminating data loading bottlenecks, overlapping communications with compute, rewriting a trainer to handle off-policy rollouts, etc.)
  • Ability to work in-person, in our NYC or San Francisco office

Benefits

  • Generous Paid Time Off
  • 401k Matching
  • Retirement Plan
  • Visa Sponsorship
  • Four Day Work Week
  • Generous Parental Leave
  • Tuition Reimbursement
  • Relocation Assistance
]]>