AI Cluster & Data Center Design Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

AMD · 8 hours ago

AI Cluster & Data Center Design Engineer

AMD is dedicated to transforming lives with its technology and building products that accelerate next-generation computing experiences. The AI Cluster & Data Center Design Engineer role involves architecting and designing scalable AI/HPC clusters, focusing on power delivery and collaborating with cross-functional teams to optimize performance and reliability for global deployments.

AI InfrastructureArtificial Intelligence (AI)Cloud ComputingComputerEmbedded SystemsGPUHardwareSemiconductor
check
Growth Opportunities
badNo H1Bnote

Responsibilities

Design scalable AI/HPC clusters including compute, storage, and networking with specific focus on power delivery
Evaluate and select CPUs, GPUs, accelerators, interconnects, and memory configurations for optimal cluster performance
Design leading-edge power delivery solutions for high-density AI/GPU deployments
Understand differences in power delivery and regulatory requirements in global locations, e.g. U.S., EMEA, Asia and other countries
Define power budgets, redundancy schemes, and fault tolerance mechanisms
Design network topologies to maximize overall cluster performance
Understand the network performance needs of different types of workloads
Understand advantages and performance trade-offs of network topologies for AI/HPC clusters
Design and optimize storage solutions to maximize AI/HPC cluster performance
Understand advantages and performance trade-offs of cluster storage solutions, e.g. Lustre, Ceph, etc
Work across multiple organizations with subject matter experts from hardware, software, network, data center, and operations teams to deliver scalable, efficient, and reliable compute infrastructure

Qualification

HPC infrastructureAI infrastructureData center engineeringPower delivery solutionsNetworking componentsStorage solutionsProblem-solvingCommunication skillsDocumentation skills

Required

Experience in HPC, AI infrastructure, or data center systems engineering
Strong understanding of rack and data center power delivery
Knowledge of GPU/CPU architectures, PCIe, UALink, InfiniBand, and Ethernet networking
Familiarity with AI/ML frameworks and workload characteristics
Excellent problem-solving, communication, and documentation skills
Bachelor's or Master's degree in Electrical Engineering, Computer Engineering, Computer Science or related field

Preferred

Experience in HPC, AI infrastructure, or data center systems engineering
Experience designing power delivery solutions for racks and data centers
Contributions to open-source HPC or AI infrastructure projects

Benefits

AMD benefits at a glance.

Company

Advanced Micro Devices is a semiconductor company that designs and develops graphics units, processors, and media solutions.

Funding

Current Stage
Public Company
Total Funding
unknown
Key Investors
OpenAIDaniel Loeb
2025-10-06Post Ipo Equity
2023-03-02Post Ipo Equity
2021-06-29Post Ipo Equity

Leadership Team

leader-logo
Lisa Su
Chair & CEO
linkedin
leader-logo
Mark Papermaster
CTO and EVP
linkedin
Company data provided by crunchbase