LLM / GenAI Engineer — Enterprise RAG

A fast-growing US AI infrastructure startup is hiring an LLM / GenAI Engineer to build and scale Retrieval-Augmented Generation (RAG) systems for enterprise clients across financial services, healthcare and legal sectors. You will work at the intersection of LLM engineering, data infrastructure and product delivery. Role & Responsibilities: • Design and implement production RAG pipelines using Azure AI Foundry, LangChain and vector databases (Pinecone, Azure AI Search, ChromaDB) • Fine-tune and evaluate open-source LLMs (LLaMA, Mistral) and integrate Azure OpenAI APIs for enterprise use cases • Build data ingestion pipelines that chunk, embed and index large document corpora on Databricks • Implement evaluation frameworks to measure retrieval quality, hallucination rates and answer relevance • Optimise inference latency and cost across hosted and self-hosted model deployments • Collaborate with product and solutions teams to translate enterprise requirements into LLM system designs • Stay current with rapidly evolving LLM research and apply relevant advances to production systems Required Skills & Experience: • 3+ years of ML or data engineering experience, with at least 1 year focused on LLM applications • Strong Python skills with experience using LangChain, LlamaIndex or equivalent RAG frameworks • Hands-on experience with vector databases and semantic search • Familiarity with Azure AI Foundry, Azure OpenAI Service or Databricks Model Serving • Understanding of prompt engineering, context window management and chunking strategies • Experience evaluating LLM outputs using frameworks like RAGAS or DeepEval • Exposure to MLflow or similar for experiment tracking preferred What We Offer: • Fully remote role, open worldwide (US/EU/India time zones preferred) • Salary $120,000–$160,000 USD depending on experience and location • Equity participation in a Series A company with strong investor backing • Cutting-edge AI infrastructure work with direct impact on enterprise products This is a foundational engineering role for someone who wants to build the RAG systems that will power the next generation of enterprise AI — not just prototype them, but ship them to production.

Remote · Worldwide | $120,000–$160,000

  • LLM
  • RAG
  • Azure AI Foundry
  • Databricks
  • Python