Staff AI Ops Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Calix · 1 day ago

Staff AI Ops Engineer

Calix provides cloud and software solutions for communications service providers. They are seeking a highly skilled Staff AI Ops Engineer to build and maintain infrastructure for machine learning and generative AI applications, collaborating closely with data scientists and software developers to ensure system robustness and efficiency.

AnalyticsInformation TechnologyInfrastructureInternetSoftwareTelecommunicationsVoIP
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Design, implement, and maintain scalable infrastructure for ML and GenAI applications
Deploy, operate, and troubleshoot production ML/GenAI pipelines/services
Build and optimize CI/CD pipelines for ML model deployment and serving
Scale compute resources across CPU/GPU architectures to meet performance requirements
Implement container orchestration with Kubernetes
Architect and optimize cloud resources on GCP for ML training and inference
Setup and maintain runtime frameworks and job management systems (Airflow, KubeFlow, MLflow, etc.)
Establish monitoring, logging and alerting for systems observability
Optimize system performance and resource utilization for cost efficiency
Develop and enforce AIOps best practices across the organization

Qualification

GCPDevOps/AIOpsContainer orchestrationCI/CD expertisePythonTerraformAirflowKubernetesML frameworksProblem-solvingCommunication

Required

Bachelor's degree in Computer Science, Information Technology, or a related field (or equivalent experience)
8+ years of overall software engineering experience
3+ years of focused experience in DevOps/AIOps or similar ML infrastructure roles
Proficient in IaC, using Terraform
Strong experience with containerization and orchestration using Docker and Kubernetes
Demonstrated expertise in cloud infrastructure management on GCP
Proficiency with workflow management such as Airflow & Kubeflow
Strong CI/CD expertise with experience implementing automated testing and deployment pipelines
Experience with scaling distributed compute architectures utilizing various accelerators (CPU/GPU)
Solid understanding of system performance optimization techniques
Experience implementing comprehensive observability solutions for complex systems
Knowledge of monitoring and logging tools (Prometheus, Grafana, ELK stack)
Strong proficiency in Python
Familiarity with ML frameworks such as PyTorch and ML platforms like Vertex AI
Excellent problem-solving skills and ability to work independently
Strong communication skills and ability to work effectively in cross-functional teams

Benefits

This role may be eligible for a bonus

Company

Calix provides the cloud, software, systems and services for service providers to simplify business, excite subscribers and grow value

H1B Sponsorship

Calix has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (37)
2024 (22)
2023 (25)
2022 (31)
2021 (19)
2020 (7)

Funding

Current Stage
Public Company
Total Funding
$100M
2010-03-24IPO
2009-08-31Series Unknown· $50M
2003-02-07Series E· $50M

Leadership Team

leader-logo
Michael Weening
President and CEO, Member of the Board
linkedin
leader-logo
Cory Sindelar
Chief Financial Officer
linkedin
Company data provided by crunchbase