Amazon · 5 hours ago
Sr. Hardware Reliability Engineer, Infrastructure Reliability & Quality
Amazon Data Services, Inc. is seeking a Sr. Hardware Reliability Engineer to drive reliability risk identification and mitigation for datacenter infrastructure equipment. The role involves conducting root cause analysis of equipment failures and implementing continuous improvements to enhance datacenter availability for AWS customers.
Artificial Intelligence (AI)DeliveryE-CommerceFoundational AIRetail
Responsibilities
Drive DFR (Design for Reliability) methodology to proactively design-in reliability in New Product Designs
Drive reliability/quality qualification of third-party critical infrastructure equipment for use in AWS data centers
Oversee factory and site testing of third-party equipment in all LLE categories (Liquid Cooling, generator, chiller, air handler, etc.)
Guide and support Root Cause Analysis of field failures performed by internal teams, the OEM, and external laboratories. Validate conclusions and ensure highest standards are used in testing and remediation
Make recommendations about AWS infrastructure maintenance and equipment replacement based on reliability data
Provide feedback to sourcing/procurement teams for evaluation of vendor performance
Analyze internal reliability data and create metrics to drive highest reliability at lowest cost
Support DFMEAs on as needed basis
Develop end of life strategy for critical infra equipment
Qualification
Required
Experience in industrial or commercial engineering in mission critical facilities including but not limited to: data centers, power generation or oil and gas facilities
3+ years of root cause analysis and troubleshooting or problem solving experience
Bachelor's or Master's degree in Reliability Engineering, Physics, Electrical, Mechanical or Materials Engineering or related field
8+ years of Reliability Engineering work experience in high reliability industry
3+ years experience with accelerated life testing, stress analysis and finite element analysis
Preferred
Experience influencing internal and external stakeholders
Experience prioritizing and handling multiple assignments at any given time while maintaining commitment to deadlines, or experience handling confidential information and maintaining professionalism in dealing with senior executives
Ph.D. in mechanical engineering, electrical engineering, material science, physics or equivalent
Experience in Data Center Engineering Operations, with a deep understanding of electrical and mechanical data center infrastructure
10+ years of work experience in reliability risk identification and assessment from component to system level applying analytical, experimental and statistical approaches to evaluate product design and manufacture quality/reliability levels
Experience with proactive and effective reliability approaches in a cost-effective manner throughout product design, manufacture and deployment stages
Proven experience in working with external design and manufacturing supply chain partners
Excellent verbal and written communication skills
Benefits
Equity
Sign-on payments
Medical
Financial
Other benefits
Company
Amazon
Amazon is a tech firm with a focus on e-commerce, cloud computing, digital streaming, and artificial intelligence.
H1B Sponsorship
Amazon has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (22803)
2024 (21175)
2023 (19057)
2022 (24088)
2021 (12233)
2020 (14881)
Funding
Current Stage
Public CompanyTotal Funding
$8.11BKey Investors
AmazonKleiner Perkins
2023-01-03Post Ipo Debt· $8B
2001-07-24Post Ipo Equity· $100M
1997-05-15IPO
Recent News
2026-01-14
2026-01-14
2026-01-14
Company data provided by crunchbase