Product Manager - AI Inference & Model Serving

Mirantis
Austin, TX
Remote
Job Description
Mirantis is looking for a commercially driven, deeply technical Product Manager to own AI inference and model serving for k0rdent AI, our control plane for GPU infrastructure and distributed AI workloads.

Requirements

  • Own product strategy and solution development for inference products across on-premises, cloud, and edge environments.
  • Define positioning grounded in measurable outcomes: latency distributions, throughput per GPU, utilization, tail reliability, and cost per tokens.
  • Drive go-to-market execution: pricing and packaging, reference architectures, sizing guides, PoC playbooks, and direct engagement with customers, analysts, and ecosystem partners.
  • 7+ years in product management, technical product management, or a senior technical role owning AI/ML and inference product(s).
  • Strong understanding of production AI inference, including model serving, serverless execution, dedicated endpoints, autoscaling, routing, workload placement, observability, and reliability.
  • Proven capability to reason about performance trade-offs across GPU, network, storage, orchestration, and runtime layers, and to translate low-level technical capability into business value such as TTFT, throughput per GPU, and TCO.

Benefits

  • Professional development and training.
  • Attend conferences and working groups.
  • Customized workstation (macOS, Windows).
  • Competitive compensation package with strong benefits plan and stock options.
]]>