Research Engineer, Distributed Training

Harmonic
Palo Alto, CA
Category Research
Job Description
Harmonic is a startup building the world's most advanced mathematical reasoning engine. We are seeking a Research Engineer with expertise in model training to focus on reasoning in formal and informal settings.

Requirements

  • Conduct research and implement solutions in areas such as model architecture, algorithms, data processing, and optimization.
  • Optimize and scale our training infrastructure to improve efficiency and reliability in a reinforcement learning setting.
  • Expertise in Python and PyTorch, experience with distributed training, parallel computing, and GPU acceleration, and strong experience with large-scale GPU clusters, HPC environments, and job scheduling/orchestration tools (e.g., SLURM or Kubernetes).
  • 2+ years of experience focused on training large language models, knowledge of cutting-edge models, and experience building evaluations for model capability.

Benefits

  • Unlimited PTO
  • 401(k) matching
  • 100% employer-paid health, vision, and dental benefits for employees and 50% coverage for dependents
  • Health Savings Account (HSA) available for qualifying health plans
]]>