Thrive Market · 14 hours ago
Principal Software Engineer, DevOps
Thrive Market is an online, membership-based market dedicated to making healthy and sustainable living accessible to everyone. They are seeking a Principal DevOps Engineer to shape their technology strategy, drive architectural decisions, and mentor engineering teams as they scale their infrastructure and improve automation, reliability, and monitoring.
BeautyE-CommerceGroceryHealth CareRetailShopping
Responsibilities
Define and drive the organization's DevOps strategy, establishing best practices, standards, and tooling across all engineering teams
Lead the ongoing Trellis migration, ensuring a smooth transition with minimal disruption to business operations
Architect and oversee our Kubernetes-based container orchestration platform, optimizing for reliability, performance, and cost efficiency
Evaluate and lead the potential platform migration from Magento to Shopify (or comparable platforms), building the technical roadmap and stakeholder alignment needed to execute at scale
Design and implement automated system installation, configuration, and deployment procedures that enable rapid, error-free releases
Build and maintain comprehensive monitoring, alerting, and observability systems to ensure high availability across all production environments
Develop and own disaster recovery plans, capacity expansion strategies, and system hardening initiatives
Identify systemic problems and inefficiencies across the engineering organization and make strategic recommendations for improvement
Mentor and guide junior and mid-level DevOps engineers, fostering a culture of continuous learning and operational excellence
Collaborate closely with development teams to help them scale their infrastructure in AWS and adopt modern deployment practices
Create and maintain technical documentation covering architecture decisions, runbooks, incident response procedures, and operational playbooks
Troubleshoot complex performance issues across production environments and lead incident response efforts
Participate in weekly on-call rotations and serve as a senior escalation point for critical infrastructure issues
Drive ad hoc strategic projects based on the evolving needs of the Engineering organization
Qualification
Required
B.S. in Computer Science or equivalent professional experience
7+ years of hands-on experience in DevOps, SRE, or Infrastructure Engineering, with a proven track record of scaling infrastructure at rapidly growing companies
Deep expertise in Kubernetes (K8s) — including cluster management, Helm charts, service meshes, and production-grade container orchestration
Strong systems engineering background with advanced proficiency in Linux administration
Advanced scripting and automation skills in Bash, Python, Golang, Ruby, or similar languages
Extensive experience with core AWS services including EC2, ECS/EKS, S3, VPC, IAM, CloudWatch, Route 53, RDS, and Lambda
Strong experience with Infrastructure as Code tools (Terraform, CloudFormation, Pulumi, or similar)
Strong experience with configuration management tools (Ansible, Chef, Puppet, or similar)
Deep understanding of CI/CD pipelines and deployment strategies (blue-green, canary, rolling deployments)
Expertise in monitoring and observability platforms (Datadog, Prometheus, Grafana, New Relic, or similar)
Strong knowledge of web application infrastructure, networking, load balancing, and security best practices
Demonstrated ability to influence technology direction at an organizational level and communicate effectively with both technical and non-technical stakeholders
Preferred
Experience with e-commerce platforms (Magento, Shopify, or comparable) and the unique infrastructure challenges they present
Experience with Trellis or similar WordPress/WooCommerce deployment frameworks
Experience leading platform migrations or large-scale re-architecture initiatives
Familiarity with GitOps workflows (ArgoCD, Flux) and service mesh technologies (Istio, Linkerd)
Experience building and managing cost-optimization strategies for cloud infrastructure
Background in SRE practices including SLIs, SLOs, error budgets, and blameless postmortems
Benefits
Comprehensive health benefits (medical, dental, vision, life and disability)
Competitive salary (DOE) + equity
401k plan
9 Observed Holidays
Flexible Paid Time Off
Subsidized ClassPass Membership with access to fitness classes and wellness and beauty experiences
Ability to work in our beautiful office in Playa Vista
Free Thrive Market membership with exclusive employee discount
Coverage for Life Coaching & Therapy Sessions on our holistic mental health and well-being platform
Company
Thrive Market
Thrive Market is a membership-based online company that offers natural and organic food products.
H1B Sponsorship
Thrive Market has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (10)
2024 (3)
2023 (5)
2022 (14)
2021 (8)
2020 (6)
Funding
Current Stage
Late StageTotal Funding
$241.4MKey Investors
InvusGreycroft
2019-10-24Convertible Note· $20M
2018-07-02Series B· $42.4M
2017-05-30Series B· $20M
Recent News
2026-02-04
2026-01-21
2025-12-10
Company data provided by crunchbase