Member of Technical Staff, Kernels

Inception
Palo Alto, CA
Job Description
Inception creates the world’s fastest, most efficient AI models. We are looking for engineers and scientists to design, optimize, and maintain the compute foundations that power large-scale language model training. Your work will make inference faster, more cost-effective, and more reliable.

Requirements

  • Design and implement custom ML kernels (e.g., CUDA, CuTe, Triton) for core LLM operations
  • Design and think through compute primitives to reduce memory bandwidth bottlenecks and improve kernel compute efficiency
  • Contribute to infrastructure stability and scalability, ensuring reproducibility, consistency across precision formats, and high utilization of compute resources

Benefits

  • Competitive salary
  • Equity in a rapidly growing startup
  • Access to the latest GPU hardware and cloud resources
  • Flexible vacation and paid time off (PTO)
  • Health, dental, and vision insurance
  • A collaborative and inclusive culture
]]>