SIGN IN
Site Reliability Engineer (SRE) jobs in United States
info-icon
This job has closed.
company-logo

Baseten · 5 hours ago

Site Reliability Engineer (SRE)

Baseten is a company that powers mission-critical inference for leading AI companies by providing robust infrastructure and developer tooling. As a Site Reliability Engineer, you will build and maintain scalable infrastructure, automate processes, and collaborate with cross-functional teams to enhance system reliability and performance.
Artificial Intelligence (AI)SoftwareAI InfrastructureDeveloper ToolsMachine LearningSoftware Engineering
check
H1B Sponsor Likelynote

Responsibilities

Build and maintain scalable infrastructure to support the deployment and operation of machine learning models
Establish standards and best practices for reliability and performance across the infrastructure
Automate processes when relevant, particularly for managing CI/CD pipelines
Own products and projects end-to-end, functioning as both an engineer and a project manager, with a focus on user empathy, project specification, and end-to-end execution
Collaborate with cross-functional teams to understand project requirements and translate them into technical solutions
Mentor junior team members and contribute to knowledge sharing within the organization
Navigate ambiguity and exercise good judgment on tradeoffs and tools needed to solve problems, avoiding unnecessary complexity
Demonstrate pride, ownership, and accountability for your work, expecting the same from your teammates

Qualification

KubernetesInfrastructure-as-codeCI/CD toolingScalable infrastructureObservability toolsMachine learning knowledgeProject management

Required

Bachelor's, Master's, or Ph.D. degree in Computer Science, Engineering, Mathematics, or related field
5+ years of professional work experience in a fast-paced, high-growth environment
Extensive experience with Kubernetes
Experience in building and maintaining scalable infrastructure
Experience with infrastructure-as-code tools (e.g., Terraform, CloudFormation, Pulumi) and CI/CD tooling (e.g., GitHub Actions, GitLab CI, Circle CI, Jenkins)
Ability to own projects end-to-end, from project specification to execution

Preferred

Relevant OSS observability experience (Prometheus, ELK stack, Grafana stack, Opentelemetry) is a plus
No prior machine learning experience required, but should be open to learning about it

Benefits

Competitive compensation, including meaningful equity.
100% coverage of medical, dental, and vision insurance for employee and dependents
Generous PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)
Paid parental leave
Company-facilitated 401(k)
Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.

Company

Baseten

twittertwittertwitter
company-logo
Baseten is an AI infrastructure company that integrates machine learning into business operations, production, and processes.

H1B Sponsorship

Baseten has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (6)
2024 (8)
2023 (1)
2020 (1)

Funding

Current Stage
Late Stage
Total Funding
$585M
Key Investors
CapitalG,IVP,NVIDIABondIVP,Spark Capital
2026-01-20Series Unknown· $300M
2025-09-05Series D· $150M
2025-02-19Series C· $75M

Leadership Team

leader-logo
Aaron Relph
Design
linkedin
Company data provided by crunchbase