Senior AI Operations (AI Ops) Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Navan · 1 day ago

Senior AI Operations (AI Ops) Engineer

Navan is focused on creating a Composable AI Microservice Architecture that powers their AI support engine, Ava. The Senior AI Operations Engineer will architect the platform for managing a fleet of specialized AI services, ensuring quality and reliability while optimizing performance and deployment of language models.

TravelFinancePaymentsSoftwareBusiness TravelFinancial Services
check
H1B Sponsor Likelynote

Responsibilities

Orchestrate the AI Fleet: Build and own the runtime environment for 100+ specialized AI services. Manage model routing, context versioning, and standardized memory/history stores
High-Density Inference Optimization: Design and implement SageMaker Multi-Model Endpoints (MME) and Inference Components to serve multiple tuned SLMs per GPU, maximizing hardware utilization while minimizing latency
Deterministic Service Excellence: Treat reliability as a layered engineering problem. Build deterministic "shells" around probabilistic LM outputs, prioritizing data-layer validation and strict serialization
Automated Evaluation & Observability: Implement "LLM-as-a-judge" patterns and automated benchmarking to detect semantic drift and hallucinations across the fleet before they impact the user
Standardize the Workflow: Obsess over building reusable patterns and Terraform-based infrastructure that eliminate "snowflake" configurations, allowing us to deploy new specialized AI tasks in minutes
Agency Strategy: Partner with AI Researchers to find the "Goldilocks zone" for agentic autonomy—balancing the flexibility of LLM tool-use with the precision required for production stability

Qualification

AWS SageMakerSmall Language ModelsPythonTerraformCI/CD AutomationDockerKubernetesSRE ExperienceData FamiliarityAI Ops Mindset

Required

5+ years in SRE, Platform Engineering, or MLOps, with at least 2 years focused on deploying LLMs/SLMs in production environments
Deep hands-on expertise with AWS SageMaker, specifically configuring Multi-Model Endpoints (MME), Inference Components, and GPU-backed instances (G5/P4)
Proven experience with Small Language Models (e.g., Mistral, Llama 3, Phi) and parameter-efficient fine-tuning (PEFT) deployment strategies like LoRA/QLoRA
Strong proficiency in Python and Terraform
Experience with Docker, Kubernetes (EKS), or AWS ECS/Fargate
Familiarity with Snowflake and Vector Databases
You understand that AI at scale is a statistical challenge. You are comfortable debugging issues at the data/serialization layer rather than defaulting to prompt tweaks
Experience building robust pipelines (Jenkins, GitHub Actions) for non-deterministic software, including automated 'eval' stages
BS or MS in Computer Science, Engineering, Mathematics, or a related technical field

Company

Navan provides travel, expense, and corporate card management to automate manual processes and drive spend visibility.

H1B Sponsorship

Navan has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (45)
2024 (22)
2023 (38)

Funding

Current Stage
Public Company
Total Funding
$2.25B
Key Investors
Goldman Sachs Bank USACoatueGreenoaks
2025-10-30IPO
2025-04-07Convertible Note
2022-12-08Debt Financing· $400M

Leadership Team

leader-logo
Ariel Cohen
CEO and Co-Founder
linkedin
leader-logo
Carlos Avelar
Account Executive
Company data provided by crunchbase