Senior Site Reliability Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Bolton · 15 hours ago

Senior Site Reliability Engineer

Bolt On Technology is seeking a Senior Site Reliability Engineer responsible for ensuring the reliability, scalability, performance, and security of production systems. This role blends software engineering and systems engineering to build resilient infrastructure and improve automation while reducing operational risk.

Consumer GoodsManufacturingSeafoodWellnessWholesale
check
H1B Sponsor Likelynote

Responsibilities

Design, build, and maintain highly available, scalable, and fault-tolerant systems
Lead reliability improvements across production and non-production environments
Own and improve monitoring, alerting, and observability platforms
Drive incident response, root cause analysis, and post-incident reviews
Implement automation to reduce manual operational work
Partner with Engineering, Security, and Product to support platform needs
Establish and track SLIs, SLOs, and error budgets
Lead capacity planning and performance tuning efforts
Improve deployment, CI/CD, and infrastructure-as-code practices
Identify and mitigate reliability and scalability risks before they impact customers
Mentor and guide junior engineers and contribute to team technical standards
Participate in on-call rotation and help mature on-call processes

Qualification

Site Reliability EngineeringCloud platforms AWSCloud platforms AzureCloud platforms GCPInfrastructure as codeContainerizationOrchestrationLinux systems administrationCI/CD pipelinesMonitoringObservability toolsScriptingAutomationTroubleshooting skillsIncident managementAccountabilityMentorshipCommunication skillsOwnershipCalm under pressure

Required

6+ years of experience in Site Reliability Engineering, DevOps, Platform Engineering, or related roles
Strong experience with cloud platforms (AWS, Azure, or GCP)
Proficiency with infrastructure as code (Terraform, CloudFormation, Pulumi, etc.)
Experience with containerization and orchestration (Docker, Kubernetes)
Strong Linux systems administration and networking fundamentals
Experience building and maintaining CI/CD pipelines
Hands-on experience with monitoring and observability tools (Datadog, Prometheus, Grafana, New Relic, etc.)
Strong troubleshooting and incident management skills
Experience with scripting and automation (Python, Bash, Go, or similar)

Preferred

Experience designing multi-region or highly distributed systems
Experience with security best practices and compliance in production environments
Experience supporting high-availability SaaS platforms
Experience in a fast-growing or PE-backed environment
Experience influencing reliability culture across engineering teams

Benefits

Medical, dental, and vision benefits
Company-paid life insurance
Flexible schedules
Unlimited PTO
Volunteer Time Off
Sick leave
Parental leave
9 company-paid holidays

Company

Bolton

twittertwitter
company-logo
We are a family-owned multinational that makes a difference for families every day by producing and distributing a diverse offering of more than 60 quality brands.

H1B Sponsorship

Bolton has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2021 (1)

Funding

Current Stage
Late Stage
Company data provided by crunchbase