Nebius · 12 hours ago
System Engineer
Nebius is leading a new era in cloud computing to serve the global AI economy. They are seeking a System Engineer to support their expanding North American operations, focusing on the design, deployment, and maintenance of high-performance cloud systems optimized for AI workloads.
AI InfrastructureCloud InfrastructureGPUIaaSPaaS
Responsibilities
Participate in the design, deployment, and maintenance of high-performance cloud systems optimized for AI workloads
Arrange and perform hardware R&D tests and experiments on-site in data center environments
Troubleshoot and resolve complex system issues related to GPUs, networking (InfiniBand, NVLink), PCIe, and server infrastructure
Conduct deep investigations into hardware, software, and networking issues to ensure optimal system performance and reliability
Develop and execute test plans and methodologies for advanced GPU, InfiniBand, and compute systems to benchmark and validate performance
Collaborate closely with cross-functional engineering and operations teams to improve system performance and reliability
Monitor system performance and continuously fine-tune configurations for maximum efficiency
Qualification
Required
Strong knowledge of modern server architecture, particularly in high-performance, GPU-based environments
Hands-on experience with GPUs, networking, NVLink, and PCIe technologies
Proficiency in Linux systems, with experience using Python and Bash for automation and tooling
Demonstrated ability to troubleshoot complex hardware, software, and networking issues
Experience with deep problem investigation, root cause analysis, and performance optimization in cloud or high-performance computing environments
Strong analytical and problem-solving skills with a performance-first mindset
Basic electronics modification skills, including soldering and wiring
Preferred
Knowledge of the Linux kernel and experience with kernel-level debugging or troubleshooting
Familiarity with electronic measurement equipment such as oscilloscopes and multimeters
Benefits
Health insurance: 100% company-paid medical, dental, and vision coverage for employees and families.
401(k) plan: up to 4% company match with immediate vesting.
Parental leave: 20 weeks paid for primary caregivers, 12 weeks for secondary caregivers.
Remote work reimbursement: up to $85/month for mobile and internet.
Disability & life insurance: company-paid short-term, long-term and life insurance coverage.
Company
Nebius
The Nebius AI Cloud brings powerful full-stack infrastructure for AI developers and practitioners across startups, enterprises and science institutes to build and deploy generative AI applications and rapidly deliver scientific breakthroughs by training and running ML models within a secure, high-performance, and cost-optimized cloud environment.
Funding
Current Stage
Late StageTotal Funding
$1.04B2025-06-04Debt Financing· $1B
2025-05-15Grant· $45M
2024-12-02Seed
Recent News
2025-10-25
Company data provided by crunchbase