Ascendion · 3 hours ago
Principal Site Reliability Engineering Manager
Ascendion is a full-service digital engineering solutions company that provides software platforms and products for enterprise clients. They are seeking a Principal Site Reliability Engineering Manager to lead reliability engineering efforts across enterprise-grade services, requiring deep technical expertise in cloud-native architecture and operational excellence.
Responsibilities
Architect and implement scalable, highly available infrastructure across Azure and hybrid environments
Lead reliability initiatives for services like Active Directory, Dynamics 365, MS Teams, and Azure Identity
Develop and maintain automation for provisioning, deployment, monitoring, and incident resolution using tools like Terraform, Helm, and GitHub Actions
Build observability frameworks using Grafana, Prometheus, Azure Monitor, Application Insights, and Splunk
Apply chaos engineering principles to validate system resilience and disaster recovery capabilities
Collaborate with development and operations teams to eliminate toil and improve service performance
Ensure compliance with security and regulatory standards across infrastructure and application layers
Create and maintain documentation for architecture, operational procedures, and best practices
Qualification
Required
15+ years of experience in infrastructure architecture, DevOps, or SRE roles
Proven experience supporting large-scale live services (e.g., Active Directory, Dynamics 365, Azure AD, MS Teams)
Strong command of Azure services including AKS, Azure Functions, App Gateway, Traffic Manager, and Azure Front Door
Expertise in CI/CD pipelines using Azure DevOps, GitHub, Jenkins, or similar tools
Proficiency in scripting languages (PowerShell, Python, Bash) and Infrastructure as Code (Terraform, ARM, Bicep)
Experience with containerization and orchestration (Docker, Kubernetes)
Familiarity with security practices, identity management, and compliance frameworks
Preferred
MS Certified: Azure Solutions Architect Expert or DevOps Engineer Expert
Experience with service-oriented architecture and three-tier web applications
Background in customer-facing roles with strong communication and stakeholder management skills
Prior experience in chaos engineering and incident response automation
Benefits
Medical insurance
Dental insurance
Vision insurance
401(k) retirement plan
Long-term disability insurance
Short-term disability insurance
5 personal days accrued each calendar year. The Paid time off benefits meet the paid sick and safe time laws that pertains to the City/ State
10-15 days of paid vacation time
6 paid holidays and 1 floating holiday per calendar year
Ascendion Learning Management System
Company
Ascendion
Ascendion is a trusted ally for enterprise business and technology leaders engineering the digital future.
H1B Sponsorship
Ascendion has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (172)
2024 (193)
2023 (93)
Funding
Current Stage
Late StageLeadership Team
Recent News
The Guardian Nigeria News - Nigeria and World News
2025-08-19
2025-08-09
Company data provided by crunchbase