Director, Site Reliability Engineering jobs in United States
cer-icon
Apply on Employer Site
company-logo

NBCUniversal · 6 hours ago

Director, Site Reliability Engineering

NBCUniversal is one of the world's leading media and entertainment companies, and they are seeking a Director of Site Reliability Engineering. The role involves leading and performing architectural design, implementation, and maintenance for production application environments, while also managing a team of architects and engineers.

BroadcastingMedia and EntertainmentNews
check
H1B Sponsor Likelynote

Responsibilities

As a member of NBCUniversal’s Production Software Engineering team, responsible for leading and performing custom architectural design, implementation, monitoring, and maintenance for a portfolio of production application environments
Responsible for hands-on configuration and support as well as managing the work of other architects and engineers
Work closely with our Principal Software Engineer on technical architecture and design based on customer product requirements, translating product requirements to technical designs and implementations
Collaborate with cross-functional team members such as Scrum Leads, Software Engineers, QA Engineers, UX Designers, Product Managers, other Architects & Site Reliability Engineers (Contractors and/or Staff), and third-party vendors
Effectively delegate responsibilities to team members, mentoring and providing them with repeatable processes, and verifying the quality of their work
Utilize metrics to measure accomplishments and monitors progress, ensuring milestones and projects are completed on-time
Communicate progress and the impact of solutions in technical terms to technology partners and in business terms to business partners
Establish a reputation as the subject matter expert for every tech stack used in Production Software Engineering applications and how they all fit together while keeping current with new technologies, developing innovative technical ideas, and generating proposals
Work with product teams to learn business objectives, development teams to plan platform needs, QA to understand test strategy, and SRE on environments and deployments
Participate in Scrums, demos, and other Agile ceremonies and ensure accurate and timely status updates to the team
Serve as primary interface with the NBCU Cyber Security team for all security-related initiatives, patching, remediations, etc
Hands-on commissioning, configuration, administration, documentation, and support for all on-prem & cloud (AWS) environments (Servers, Storage, Databases, Networking, Security, etc.)
Technical impact analysis, implementation, and monitoring of all cyber, technology audit, enterprise engineering, & IT (Databases, Monitoring, etc.) activities related to Production Software Engineering applications and platforms
Create and manage CI/CD pipelines using tool likes Cloud Formation, Foreman, Jenkins, Nexus, Rundeck, Ansible, and Puppet
Lead implementation of monitoring and reporting framework using tools like Grafana, Influx, Graylog/Splunk, Selenium, New Relic, and Icinga
Recognize and identify potential technical impacts of enterprise change controls which could affect our applications and customers
Help improve performance, scalability, and reliability
Build and maintain distributed infrastructure and automation
Solve problems quickly and automates processes for the future
Direct management of other engineers and architects (Contractors and/or Staff). 24x7x365 availability for production outages, emergencies, and deployments

Qualification

Linux/Unix systems engineeringContinuous DeliveryDevOps principlesAWS Cloud experienceTechnical leadershipNoSQL data storesLarge scale applicationsAgile toolsPeople managementOperational toolsReal-time systemsDistributed infrastructure

Required

Bachelor's degree in Computer Science, Information Technology, or related field (or foreign degree equivalent), plus 10 years of experience as a Software Architect, in the job offered, or in a related occupation
Hands-on systems engineering experience on Linux/Unix platforms
Experience with technical leadership and people management
Experience with Continuous Delivery and SDLC practices
DevOps principles, experience with operational tools (Ansible or Puppet or Chef, Terraform) and best practices for infrastructure (on-prem or cloud) and software deployment
Operational experience with large scale applications
Experience with NoSQL data stores (MarkLogic, MongoDB, Cassandra, DynamoDB, Couchbase, PostgreSQL, etc.)
Experience with a broad range of enterprise technologies
Experience building real-time, large-scale, low-latency distributed systems
Experience with Agile tools like Jira, GitHub or similar
Experience using AWS Cloud in a production environment
Experience with AWS IAM, EC2, RDS, S3, Lambda, batch and step functions

Benefits

Medical, dental and vision insurance
401(k)
Paid leave
Tuition reimbursement
A variety of other discounts and perks

Company

NBCUniversal

company-logo
NBCUniversal is a media company that provides entertainment and news development, production, distribution, and marketing services. It is a sub-organization of Comcast.

H1B Sponsorship

NBCUniversal has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2020 (1)

Funding

Current Stage
Late Stage
Total Funding
unknown
2011-01-29Acquired

Leadership Team

leader-logo
Jeff Shell
CEO
leader-logo
Stephen Burke
Chief executive officer
Company data provided by crunchbase