Lead Site Reliability Engineer (SRE) jobs in United States
cer-icon
Apply on Employer Site
company-logo

Capital One · 17 hours ago

Lead Site Reliability Engineer (SRE)

Capital One is a technology-driven company that focuses on solving real problems and meeting customer needs. As a Lead Site Reliability Engineer (SRE), you will be responsible for improving the reliability and performance of services, focusing on automation, monitoring, and problem management.

Financial Services
check
Comp. & Benefits
badNo H1Bnote

Responsibilities

Guide site reliability automation to help eliminate manual toil and create a self-healing capability
Fosters a culture of excellence and continuous learning within the chapter. Establishes and tracks appropriate OKRs to ensure outcomes are met
Creates solutions addressing high impact technology and business priorities
Competent in multiple contexts, such as programming languages, security, automation, testing, infrastructure, and performance and is the go-to person for many people (inside and outside of their team)
Proactively identifies and mitigates issues based on intuition and experience in multiple domains

Qualification

AWSSite Reliability EngineeringDevOpsPythonDockerKubernetesApplication MonitoringAgile PracticesIncident ManagementTeam Leadership

Required

High School Diploma, GED, or equivalent certification
At least 6 years of experience using build and deployment tools (Jenkins, GitHub, or Artifactory)
At least 4 years of experience with AWS
At least 2 years of team leadership experience

Preferred

5+ years of experience with AWS
2+ years of experience in Agile practices
Experience with SRE design to address reliability and resiliency with availability of 5-9s
Experience in working in a cloud environment (OCP and AWS EMR)
Experience with application monitoring tools, observability, and performance assessments
Experience with DevOps (CI/CD pipelines with Jenkins or similar; Git/GitHub)
Experience developing automation solutions in Python (or other similar languages)
Comfortable with production environments, firewalls, and networking
Experience with networking such as routing, load balancers, and VPC
Experience with Docker and Kubernetes
Experience in deploying, observing, altering, logging, and monitoring systems (Splunk, Datadog, New Relic) with a mindset towards predictive analysis
Working knowledge of the Incident Management process

Benefits

Comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being

Company

Capital One

company-logo
Capital One is a financial services company that provides banking, credit card, auto loan, savings, and commercial banking services.

Funding

Current Stage
Public Company
Total Funding
$5.45B
Key Investors
Berkshire Hathaway
2025-09-11Post Ipo Debt· $2.75B
2025-01-30Post Ipo Debt· $1.75B
2023-05-15Post Ipo Equity· $954M

Leadership Team

leader-logo
Daniel Arellano
Senior Vice President, Business Cards and Payments
linkedin
leader-logo
Justin Burch
Senior Vice President
linkedin
Company data provided by crunchbase