Research Scientist/Engineer, Efficient ML Systems

Cox Exponential
San Francisco, CA
Job Description
Role Overview

As an AI Research Scientist (Efficient ML Systems) at Goaly, you will research and build the systems that make frontier-scale models practical. This role sits at the intersection of algorithms, systems, and hardware efficiency.

What You Will Do

Design and evaluate new training and inference techniques, prototype them in real systems, and push them to production-scale workloads. Write real systems code, run large-scale experiments, and directly shape how modern LLMs and RL systems are trained and deployed.

Why It Might Be a Fit

This is not a paper-only role. You will directly ship your work into our core platform or lead to publications at top venues such as NeurIPS, ICML, ICLR, or CVPR.

Requirements

  • Ph.D. or Master's degree in CS, AI, Systems, or related fields
  • Strong foundation in LLM or large-scale ML training, including Transformers, attention mechanisms, distributed training, and optimization methods
  • Experience or strong interest in agentic RL or large-scale reinforcement learning systems, including stability, scalability, or long-horizon training challenges
  • Demonstrated interest in efficiency-focused research, such as training acceleration, memory optimization, parallelism, kernels, or RL system robustness
  • Proficient in PyTorch or JAX. Clean coding style and strong command of Python
  • Adaptability: A fast learner with a strong sense of responsibility, capable of wearing multiple hats and handling cross-stack challenges
]]>