Member of Technical Staff, Kernels

Inception

Palo Alto, CA

Category Information Technology

Job Description

Inception creates the world’s fastest, most efficient AI models. We are looking for engineers and scientists to design, optimize, and maintain the compute foundations that power large-scale language model training. Your work will make inference faster, more cost-effective, and more reliable.

Requirements

Design and implement custom ML kernels (e.g., CUDA, CuTe, Triton) for core LLM operations
Design and think through compute primitives to reduce memory bandwidth bottlenecks and improve kernel compute efficiency
Contribute to infrastructure stability and scalability, ensuring reproducibility, consistency across precision formats, and high utilization of compute resources

Benefits

Competitive salary
Equity in a rapidly growing startup
Access to the latest GPU hardware and cloud resources
Flexible vacation and paid time off (PTO)
Health, dental, and vision insurance
A collaborative and inclusive culture

]]>