Harvard University · 1 month ago
DevOps Engineer, Generative AI Applications, HBS Foundry
Harvard University is a prestigious institution dedicated to fostering innovation and collaboration. The DevOps Engineer for Generative AI Applications will lead the development and management of a GenAI application platform, ensuring the infrastructure is optimized for AI and ML workloads while collaborating with various teams to promote ethical AI practices.
EducationHigher EducationUniversities
Responsibilities
Build trust and collaboration by being present on-site and engaging directly with colleagues and various constituents
Infrastructure Management: Design and manage cloud infrastructure (AWS) optimized for the intensive computational needs of AI and ML workloads
CI/CD Pipelines: Build and maintain continuous integration and continuous deployment (CI/CD) pipelines tailored for AI/ML models
Automation: Automating model training, testing, deployment, and monitoring processes
Scalability & Reliability: Ensuring that AI applications are scalable and highly available in production environments
Monitoring & Observability: Implementing systems to monitor model performance, data drift, logs, and uptime
Collaboration: Working closely with data scientists, ML engineers, and backend developers to ensure smooth and secure deployment of AI services
This role is responsible for other duties as assigned
Qualification
Required
Minimum of seven years' post-secondary education or relevant work experience
Bachelor's degree in mathematics, physics, computer science, engineering, statistics, or an equivalent technical discipline desired
Minimum of five years Dev Ops experience with at least a year of ML Ops and Software Engineering Development background
Proficiency in cloud services - Amazon Web Services (AWS)
Expertise with Docker and Kubernetes for managing application environments
Experience with tools like Terraform or Ansible for automating infrastructure provisioning
Strong scripting skills, typically in Python or Go, for automation and building operational tools
A focus on site reliability engineering principles to ensure robust production systems
Tech Skills: Terraform, GitHub actions, Quadrant, Vector Database, CI/CD, Python, Shell Scripting
Strong Communication Skills
Good Team Player
Ability to wear multiple hats
Fast learner
Problem solving and go getter attitude
Preferred
Nice to have - Google Cloud Platform (GCP), or Microsoft Azure
Familiarity with MLOps platforms and tools (e.g., MLflow, Kubeflow, Data Version Control (DVC)) is often a plus
Benefits
Generous paid time off including parental leave
Medical, dental, and vision health insurance coverage starting on day one
Retirement plans with university contributions
Wellbeing and mental health resources
Support for families and caregivers
Professional development opportunities including tuition assistance and reimbursement
Commuter benefits, discounts and campus perks
Company
Harvard University
Harvard University is a private research university and a member of the Ivy League.
Funding
Current Stage
Late StageTotal Funding
$136.43MKey Investors
National Endowment for the Humanities (NEH)Alfred P. Sloan FoundationMassachusetts Clean Energy Center
2023-01-10Grant· $0.35M
2022-01-01Grant· $1.5M
2021-08-24Grant· $0.07M
Recent News
The Indian Express
2026-01-07
2025-12-25
Company data provided by crunchbase