Gen AI Engineer

Inizio Partners Corp
Houston, TX
Job Description
Translate business requirements into robust, scalable AI solutions using RAG, embeddings, vector search, and fine-tuning. Build and maintain APIs, services, and reusable components in Python to support AI applications. Deploy and monitor AI models in cloud-native environments (GCP, Azure) leveraging Kubernetes, serverless, and MLOps pipelines.

Requirements

  • Bachelors or Masters degree in Computer Science, AI/ML, or related technical field.
  • 5+ years of software development experience, with strong proficiency in Python.
  • 3 - 5+ years hands-on experience building GenAI/LLM-based applications, with proven success from PoC to production deployment.
  • Proficiency in designing retrieval pipelines (document loaders, chunking strategies, embeddings, vector databases like FAISS, Pinecone, ChromaDB).
  • Expertise in LLM APIs (OpenAI, Claude, Gemini, etc.), prompt engineering, and fine-tuning.
  • Experience with cloud platforms (GCP, Azure), containerization (Docker, Kubernetes), and MLOps (CI/CD, monitoring).
  • Strong understanding of API design, microservices, and enterprise integration patterns.
  • Familiarity with version control systems (e.g., Git, Azure DevOps).
  • Demonstrated ability to build and scale AI solutions in production.
]]>