MetLife · 1 month ago
Sr. Site Reliability Engineer
MetLife is one of the world’s leading financial services companies, recognized for its commitment to helping create a more confident future. As a Senior Site Reliability Engineer, you will serve as the technical authority for the enterprise’s experience protection layer, focusing on designing and implementing safeguards for applications and infrastructure while collaborating with various teams to enhance operational resilience.
Asset ManagementFinancial ServicesInsuranceLife InsuranceRisk Management
Responsibilities
Architect end-to-end protection patterns across app, data, infra, and security layers
Design early-detection logic for drift, latency creep, SLO/SLA degradation, data freshness issues, schema changes, batch/reporting failures, authentication friction, and infrastructure saturation
Define control-as-code patterns embedded within ServiceNow workflows for Change, Incident, Problem, Release, and Access Management
Create stabilization logic such as autoscale, route shifts, job controls, controlled rollbacks, and safety limits
Partner with domain architects and L3 SMEs to align safe automation boundaries and system-level design constraints
Develop diagnostic correlation logic to accelerate triage and root-cause identification for SHIELD Engineers
Govern the quality of production findings, ensuring high-fidelity evidence, cross-domain correlation, severity scoring, and actionable recommendations
Own SHIELD’s engineering standards, domain integration patterns, and protection roadmap
Collaborate with cross-functional teams to uplift operational maturity and resilience
Qualification
Required
7 - 10+ years in engineering across Application, Data, Infrastructure, or Security domains
Deep familiarity with observability (logs, metrics, traces), SLO design, automation patterns, and distributed systems
Experience with cloud platforms, Kubernetes, CI/CD, messaging systems, and ServiceNow or equivalent tooling
Strong diagnostic, analytical, and pattern-recognition capabilities
Ability to operate calmly under pressure in critical moments
Preferred
Advanced degree in Computer Science, Engineering, or related discipline
Prior experience in leading protection or automation programs within large, regulated, global enterprises
Exposure to SRE, MLOps, or AI-driven operational analytics
Certifications in relevant infrastructure domains (e.g., AWS/Azure Architect, ITIL)
Strategic thinker with the ability to translate complex technical initiatives into measurable business outcomes
Benefits
Comprehensive health plan that includes medical/prescription drug and vision
Dental insurance
No-cost short- and long-term disability
Company-paid life insurance
Legal services
Retirement pension funded entirely by MetLife
401(k) with employer matching
Group discounts on voluntary insurance products including auto and home, pet, critical illness, hospital indemnity, and accident insurance
Employee Assistance Program (EAP)
Digital mental health programs
Parental leave
Volunteer time off
Tuition assistance
Company
MetLife
MetLife is a provider of insurance, employee benefits, and financial services .
H1B Sponsorship
MetLife has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (164)
2024 (108)
2023 (113)
2022 (155)
2021 (75)
2020 (81)
Funding
Current Stage
Public CompanyTotal Funding
$500M2024-06-20Post Ipo Debt· $500M
2000-04-14IPO
Leadership Team
Recent News
2026-01-22
Beinsure - Insurance, Reinsurance, InsurTech Insights
2026-01-14
Private Debt Investor
2026-01-07
Company data provided by crunchbase