Disaster Recovery Principal Systems Engineer (Remote) jobs in United States
cer-icon
Apply on Employer Site
company-logo

CareFirst BlueCross BlueShield · 3 hours ago

Disaster Recovery Principal Systems Engineer (Remote)

CareFirst BlueCross BlueShield is responsible for the overall management, strategy, and execution of its Disaster Recovery program. The Disaster Recovery Principal will work closely with internal stakeholders and external partners to ensure the resilience and recoverability of critical business applications and infrastructure.

Health CareNon ProfitService Industry
badNo H1Bnote

Responsibilities

Establish and maintain core DR program policies, standards, and procedures
Set program strategies and publish program metrics, leveraging AI-driven analytics for predictive risk assessment and continuous improvement
Complete program assessment responses for internal and external audits
Document new standards for program, plans, and exercises, incorporating emerging technologies and best practices
Update and maintain current policies, procedures, strategy, and work instruction documents
Publish monthly DR metrics and respond to program assessments and audits, utilizing automated dashboarding and reporting tools
Evaluate and implement AI-powered solutions for automated failover, incident response, and predictive modeling
Provide consultation and direction to application and infrastructure owners for DR strategy and planning, with a focus on cloud-native and hybrid environments
Maintain oversight of critical and non-critical enterprise business application DR plans
Conduct annual review and update of enterprise environment DR plan documents, integrating lessons learned from automated and scenario-based exercises
Facilitate plan reviews and feedback sessions with owners
Collaborate and coordinate with core teams (e.g., Enterprise Architecture, Change Advisory Board, Enterprise Risk, IT Quality, IT Service Readiness, Technology Operations Center)
Support real-time dashboarding of environmental changes affecting DR strategies
Align DR program with industry standards such as NIST SP 800-34, ISO 22301, and FFIEC guidelines
Coordinate, plan, execute, and close large-scale DR exercises, including automated and chaos engineering-based tests
Collaborate and consult with application owners for single system and business application DR exercises
Drive automation and orchestration to conduct complex exercises with multiple interdependencies
Provide exercise consulting and assistance as required
Ensure continuous validation of DR plans through frequent, automated testing and scenario-based exercises
Develop and maintain advanced dashboards for real-time DR metrics using business intelligence and AI tools
Define and track key performance indicators (KPIs) for DR readiness, recovery time objectives (RTO), and recovery point objectives (RPO)
Analyze DR program data to identify trends, gaps, and opportunities for improvement
Collaborate with cybersecurity, cloud architecture, and DevOps teams to ensure DR plans are integrated with broader enterprise resilience strategies
Manage relationships with third-party DR service providers, including contract negotiation and performance monitoring

Qualification

Disaster Recovery ManagementCloud-based DR StrategiesAI-driven AnalyticsRegulatory ComplianceITIL Foundations v3Automation ToolsDisaster Recovery StandardsProject ManagementCommunication SkillsCollaboration SkillsContinuous Learning Mindset

Required

Bachelor's Degree in Information Technology, Computer Science or related field OR in lieu of a Bachelor's degree, an additional 4 years of relevant work experience is required in addition to the required work experience
ITIL Foundations v3 within 180 Days Preferred
10 years of relevant IT systems engineering experience

Preferred

Demonstrated experience managing DR programs and exercises in large enterprise environments
Strong understanding of disaster recovery planning, policy development, and regulatory compliance
Experience working with managed services providers and cross-functional teams
Excellent communication, collaboration, and project management skills
Familiarity with automation tools, AI-driven analytics, and dashboarding for DR metrics
Experience with cloud-based DR strategies and hybrid environments
In-depth knowledge of disaster recovery industry standards and frameworks, including NIST SP 800-34, ISO 22301, and FFIEC guidelines
Proven ability to align DR programs with current industry best practices and regulatory requirements
Professional certifications such as CBCP, DRII, ABCP, CFCP, AWS Certified Solutions Architect, Azure Solutions Architect, or similar
Experience with AI/ML technologies in DR planning and execution
Knowledge of chaos engineering, automated DR testing, and advanced cloud security practices
Continuous learning mindset with a focus on emerging DR technologies and industry trends

Benefits

Comprehensive benefits package
Various incentive programs/plans
401k contribution programs/plans

Company

CareFirst BlueCross BlueShield

company-logo
CareFirst. It’s not just our name. It’s our promise.

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Ja'Ron Bridges
Interim President and Chief Executive Officer
linkedin
leader-logo
Doba Parushev
Vice President, Healthworx Ventures
linkedin
Company data provided by crunchbase