Site Reliability Engineering (SRE) Architect jobs in United States
cer-icon
Apply on Employer Site
company-logo

STAFFWORXS · 5 hours ago

Site Reliability Engineering (SRE) Architect

STAFFWORXS is seeking a highly experienced Site Reliability Engineering (SRE) Architect to lead the strategic design, development, and maturity of their reliability engineering practices. This role focuses on defining architectural blueprints and standards to guide development and SRE operations teams in building resilient and scalable systems.

AppsConsultingInformation TechnologyProject Management
check
H1B Sponsor Likelynote
Hiring Manager
VENKAT R.
linkedin

Responsibilities

Architect scalable, highly available, secure, and cost-effective solutions on AWS
Define and promote SRE standards, best practices, and architectural blueprints across engineering teams
Evaluate and enhance current observability systems, identifying gaps and driving next-level maturity to improve system insights
Lead the definition and implementation of SLIs, SLOs, and error budgets for critical services
Design solutions to eliminate operational toil through automation and improved system architecture
Assess existing SRE tools, CI/CD pipelines, IaC modules, and automated remediation frameworks, proposing improvements
Evaluate and recommend new tools, technologies, and practices to strengthen reliability, productivity, and operational excellence
Serve as a senior advisor on reliability, scalability, and performance across development and platform teams
Offer architectural guidance for new services to ensure reliability principles are integrated from the start
Mentor SREs and engineers, promoting strong engineering discipline and adherence to SRE principles
Lead architecture reviews and production readiness assessments for critical systems
Lead blameless postmortems for major incidents and drive systemic architectural improvements
Advocate and architect resilience patterns including circuit breakers, rate limiting, graceful degradation, and chaos engineering

Qualification

AWSSRE principlesContainerizationOrchestrationObservability solutionsProgramming/scriptingAnalytical skillsCommunication skillsCollaboration skillsLeadership abilities

Required

Proven experience in architectural roles focused on reliability, scalability, and performance
Deep hands-on expertise with SRE principles (SLIs/SLOs, error budgets, automation, incident management)
Strong AWS experience across infrastructure, networking, and security
Expertise with containerization and orchestration (Kubernetes, Docker, serverless)
Experience building observability solutions (Dynatrace, Prometheus, Grafana, ELK/EFK, Jaeger, OpenTelemetry)
Strong programming/scripting abilities (Python, Go, Bash)
Excellent analytical and strategic problem-solving skills
Strong communication, collaboration, and leadership abilities

Preferred

Experience implementing and maturing chaos engineering practices and platforms

Company

STAFFWORXS

twittertwitter
company-logo
StaffWorxs is an IT consulting and services firm that provides application development, project management, and data services.

H1B Sponsorship

STAFFWORXS has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (25)
2024 (17)
2023 (12)
2022 (8)
2021 (8)
2020 (1)

Funding

Current Stage
Growth Stage
Company data provided by crunchbase