REMOTE Site Reliability Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Insight Global · 7 hours ago

REMOTE Site Reliability Engineer

Insight Global is seeking a REMOTE Site Reliability Engineer with a strong software engineering background. The role focuses on enhancing the reliability of production systems, driving reliability outcomes, and automating processes within cross-functional teams.

EmploymentHuman ResourcesRecruitingSales
check
H1B Sponsor Likelynote
Hiring Manager
Emma Parks
linkedin

Responsibilities

Embed with product and platform teams to own reliability for key services; come in and “run with” active projects
Define and drive SLOs/SLAs/SLIs; implement actionable alerting and dashboards (primary: Datadog)
Automate reliability work (deployment, scaling, failover, incident workflows) using code-first approaches
Author infrastructure as code (primarily Terraform) and collaborate on Docker/Kubernetes workflows
Instrument services (.NET primary stack; Python/Rust for tooling; Java is a plus) for observability and performance
Own incidents end-to-end: triage, root cause, postmortems, and preventative engineering
Apply systems thinking to reduce complexity, improve resilience, and increase change velocity safely
Partner with security and cloud teams on guardrails, least-privilege, and cross-cloud considerations
Write stories and technical docs that clarify problems, solutions, and acceptance criteria
Continuously improve reliability patterns, runbooks, and automation pipelines

Qualification

SRE experience.NET expertiseDatadog experienceInfrastructure as CodeScripting languagesAWS foundational knowledgeDockerKubernetesSQL/Postgres familiarityAgile/Scrum experienceAtlassian software suiteRust experienceAWS GlueAWS Neptune

Required

Proven SRE experience (3+ years minimum at mid-staff level) owning reliability for production systems
Software engineering background with strong procedural thinking; you've shipped production code
Proficient in scripting languages such as Python, Bash, or similar
.NET expertise as the primary skillset (services, APIs, performance, instrumentation)
Datadog hands-on experience (dashboards, monitors, logs, APM, alerting)
AWS foundational knowledge (you don't need a pro cert; you can reason about core services and IAM)
Infrastructure as Code with Terraform (modules, state, environments)
Practical knowledge of Docker and Kubernetes (how it works, how to debug and operate)
Familiarity with SQL/Postgres (querying, performance basics)

Preferred

Continued education and/or advanced degree(s) in Computer Science, Information Technology, or a related field
AWS certifications (such as AWS Certified Solutions Architect, AWS Certified Database - Specialty, or AWS Certified Security - Specialty)
Ability to understand and refactor complex legacy software
Experience in environments subject to HIPAA and/or PCI regulations
Professional experience with project lifecycle planning such as Agile/Scrum
Comfortable with Atlassian software suite (Jira, Confluence, and OpsGenie)
Experience with Rust
AWS Glue
AWS Neptune or other AWS purpose-built databases

Company

Insight Global

company-logo
Insight Global provides top talent and staffing solutions that help job seekers find careers in healthcare, finance, IT, and government.

H1B Sponsorship

Insight Global has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (281)
2024 (164)
2023 (75)
2022 (17)
2021 (3)
2020 (2)

Funding

Current Stage
Late Stage
Total Funding
unknown
2010-07-01Acquired

Leadership Team

leader-logo
Jared Streppa
President Of Company’s Technology Division
linkedin
Company data provided by crunchbase