RDMA Engineer - Supercomputing jobs in United States
cer-icon
Apply on Employer Site
company-logo

xAI · 2 weeks ago

RDMA Engineer - Supercomputing

xAI is on a mission to create AI systems that enhance humanity's understanding of the universe. They are seeking an RDMA Engineer to design and optimize networking solutions for GPU supercomputing clusters, focusing on low-latency and high-bandwidth communication systems.

Artificial Intelligence (AI)Information TechnologyFoundational AIGenerative AIMachine Learning
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Develop and tune RDMA-based communication systems leveraging NVIDIA GPUs and Mellanox NICs (InfiniBand, RoCE) for ultra-fast data transfer between nodes
Implement and optimize GPUDirect RDMA to enable direct memory access between GPUs and network interfaces, minimizing CPU overhead
Integrate RDMA solutions with Kubernetes-based workloads, ensuring seamless operation across distributed compute and storage systems
Collaborate with AI researchers and infrastructure teams to accelerate data pipelines and collective communications using NCCL and MPI
Troubleshoot and resolve performance bottlenecks in high-throughput, low-latency networking environments

Qualification

NVIDIA RDMA technologiesRust programmingC/C++ programmingMPINCCLKubernetes networkingDistributed systems optimizationPrioritization skillsCommunication skillsWork ethic

Required

Hands-on experience with NVIDIA RDMA technologies (e.g., GPUDirect RDMA, RoCE, InfiniBand) in HPC or AI supercomputing environments
Proficiency in programming with Rust, C, or C++ for low-level networking and system optimization
Familiarity with NVIDIA's networking stack, including Mellanox drivers, libraries (e.g., libibverbs), and tools (e.g., NVPeerMemory)
Experience optimizing distributed systems with MPI, NCCL, or similar frameworks for GPU-accelerated workloads
Knowledge of Kubernetes networking and integrating RDMA into containerized environments

Preferred

Background in AI/ML training workflows and their networking demands (e.g., large-scale parameter synchronization)

Company

xAI

twittertwittertwitter
company-logo
XAI is an artificial intelligence startup that develops AI solutions and tools to enhance reasoning and search capabilities.

H1B Sponsorship

xAI has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)

Funding

Current Stage
Late Stage
Total Funding
$42.73B
Key Investors
Neptune Digital AssetsSpaceXMorgan Stanley
2026-02-02Acquired
2026-01-06Series E· $20B
2025-12-11Secondary Market· $0.3M

Leadership Team

leader-logo
Greg Yang
Co-Founder
linkedin
leader-logo
Yuhuai Wu
Co-Founder
linkedin
Company data provided by crunchbase