Senior Site Reliability Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Juul Labs · 22 hours ago

Senior Site Reliability Engineer

Juul Labs is committed to transitioning adult smokers away from combustible cigarettes and is seeking a Senior Site Reliability Engineer to own the operational stability and performance of their hybrid cloud infrastructure. The role involves leading automation efforts, architecting for reliability, and managing critical incidents to ensure a scalable and efficient platform.

B2CConsumer ElectronicsConsumer Goods
check
Comp. & Benefits
check
H1B Sponsor Likelynote

Responsibilities

A Senior Site Reliability Engineer (SRE) is expected to own the operational stability and performance of Juul’s hybrid cloud infrastructure (Nutanix, AWS/GCP). This involves leading automation efforts, architecting for reliability, and acting as the final escalation point for critical incidents to ensure the platform is scalable and efficient
Design, deploy, and maintain enterprise-scale Nutanix AHV clusters and Prism Central for multi-cluster management
Expert-level proficiency with Nutanix CLI (nCLI and acli) for advanced operations, troubleshooting, and automation
Develop automation scripts using Nutanix REST APIs, Python SDK, PowerShell, and Terraform for infrastructure-as-code
Create and manage VM templates, golden images, and standardized deployment catalogs for consistent provisioning
Design disaster recovery solutions using Leap, Protection Domains, cross-cluster replication, and metro clustering
Implement network micro-segmentation using Nutanix Flow and configure RBAC, encryption, and security hardening
Lead L3 troubleshooting using advanced diagnostics, log analysis (CVM, Genesis), NCC health checks, and cluster service resolution
Configure high availability, VM affinity rules, QoS policies, and optimize performance for mission-critical workloads
Manage AHV networking with OVS bridges, VLANs, bonds, LACP and implement resource reservations and workload balance
Design, deploy, and maintain hybrid cloud infrastructure across Nutanix HCI, AWS, and GCP platforms
Architect and implement multi-cloud solutions ensuring high availability, scalability, and disaster recovery
Architect and deploy enterprise-scale, highly available multi-cloud solutions across AWS and GCP with multi-region/multi-account strategies
Expert-level proficiency with AWS CLI, GCP CLI, SDK, boto3, and Python for advanced automation and infrastructure orchestration
Design AWS Organizations and GCP Organization hierarchies with consolidated billing, IAM policies, and centralized governance
Configure and manage AWS Systems Manager (SSM) including Session Manager, Run Command, State Manager, and Automation for centralized fleet operations
Implement centralized logging using CloudWatch/CloudTrail and GCP Cloud Logging with S3/Cloud Storage aggregation
Integrate AWS and GCP with Splunk using HEC, CloudWatch subscriptions, Pub/Sub, Dataflow, and cloud-specific add-ons for SIEM correlation
Design and deploy advanced load balancing solutions with AWS ALB/NLB/ELB and GCP Cloud Load Balancing including SSL termination and auto-scaling
Develop infrastructure-as-code using Terraform, CloudFormation, CDK for repeatable multi-cloud deployments and CI/CD pipelines
Configure AWS SSO, cross-account IAM roles, GCP Workload Identity, and federated access for centralized identity management
Design VPC architectures with AWS Transit Gateway/PrivateLink and GCP Shared VPC/VPC peering for hybrid connectivity
Manage containerized workloads using EKS, GKE, ECS, Cloud Run with service mesh, observability, and security best practices
Implement disaster recovery using AWS Backup, Cross-Region Replication, GCP snapshots, and multi-region failover strategies
Lead L3 troubleshooting using CloudWatch Insights, GCP Cloud Trace, VPC Flow Logs, X-Ray, and vendor support escalation
Perform cost optimization through Reserved Instances, Committed Use Discounts, rightsizing, and automated resource lifecycle management
Administer and support Windows Server and Unix/Linux environments in production and non-production settings
Perform OS-level hardening, patch management, and security compliance across heterogeneous systems
Automate routine administrative tasks using PowerShell, Bash, Python, or similar scripting languages
Manage GitHub organization settings, user permissions, repository access controls, and monitor GitHub Actions workflows and repository health across multiple teams
Configure Splunk forwarders, heavy forwarders and other integrations for data ingestion from cloud and on-premises sources

Qualification

Nutanix HCIAWSGCPPythonTerraformPowerShellBash scriptingKubernetesDisaster recoveryNetworking knowledgeAnalytical skillsCustomer service orientationCommunication skillsAttention to detailContinuous learnerDocumentation skillsCalm under pressure

Required

8-12+ years infrastructure experience with 8+ years in Nutanix HCI and enterprise cloud AWS/GCP
Expert-level skills in Python, PowerShell, Bash scripting, infrastructure-as-code (Terraform/CloudFormation), and container orchestration (Kubernetes, EKS/GKE)
Proven experience managing enterprise-scale environments, hybrid cloud migrations, disaster recovery, and L3 critical incident management
Strong networking knowledge (TCP/IP, VLANs, routing, VPN), security hardening, and compliance frameworks (ITIL)
Strategic thinker with exceptional analytical and troubleshooting abilities for complex multi-layer infrastructure issues
Excellent communication skills to translate technical concepts to executives and non-technical stakeholders
Calm under pressure during critical outages with meticulous attention to security, compliance, and configuration management
Self-motivated continuous learner committed to staying current with evolving cloud technologies and automation opportunities
Available for on-call rotations with strong documentation skills and customer service orientation
Bachelor's or master's degree in computer science/IT

Preferred

Certifications (plus): Nutanix NCP/NCAP, AWS Solutions Architect Professional, AWS DevOps Professional, GCP Professional Cloud Architect, Terraform

Benefits

People. Work with talented, committed and supportive teammates
Equity and performance bonuses. Every employee is a stakeholder in our success
Cell phone subsidy, commuter benefits and discounts on JUUL products
Excellent medical, dental and vision, disability, and life insurance, plus family support, wellness, legal, and employee assistance program benefits
401(k) plan with company matching
Plus biannual discretionary performance bonuses

Company

Juul Labs

company-logo
Juul Labs is a thriving team of scientists, engineers, designers and professionals who are committed to offering adult smokers alternatives to combustible cigarettes, while combating underage use of our products.

H1B Sponsorship

Juul Labs has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (7)
2024 (7)
2023 (20)
2022 (37)
2021 (30)
2020 (31)

Funding

Current Stage
Late Stage
Total Funding
$16.45B
Key Investors
AltriaTiger Global Management
2023-11-09Series Unknown· $1.28B
2022-11-10Series Unknown
2020-02-06Debt Financing· $721.56M

Leadership Team

leader-logo
K.C. Crosthwaite
Chairman & CEO
linkedin
leader-logo
Robert Terra
Vice President Corporate Communications
linkedin
Company data provided by crunchbase