Sr. Network Engineer/Rack Solution jobs in United States
cer-icon
Apply on Employer Site
company-logo

Supermicro · 2 days ago

Sr. Network Engineer/Rack Solution

Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for various sectors. The Sr. Network Engineer will be responsible for rolling out and maintaining business-critical applications, resolving escalated service issues, and engineering complex projects while providing leadership and communication skills.

Artificial Intelligence (AI)Cloud ComputingCloud InfrastructureEmbedded SystemsManufacturingSoftware
check
H1B Sponsor Likelynote

Responsibilities

Execute comprehensive system-level rack tests on latest NVidia and AMD GPUs, ARM-based, Intel Xeon, and AMD EPYC processors, encompassing functionality, compatibility, performance, stress, and reliability testing, leveraging proprietary in-house tools
Establish expertise in HPC/AI applications and benchmarks, delivering impactful training sessions to customers and partners, while addressing complex customer support issues, demonstrating innovative problem-solving skills and building robust processes and procedures for HPC/AI solutions
Conduct proof of concept design and testing, providing optimized benchmarks for HPC/AI applications in a timely manner. Fine-tune BIOS settings, optimize OS/network configurations, and develop diverse simulation configurations to enhance efficiency across various workloads
Deliver on-site deployment services, ensuring customer acceptance verification and providing post-level 1&2 support. Create and maintain technical documentation, including technical notes, blogs, and diagrams, to facilitate knowledge dissemination
Identify and document hardware and software quality issues and collaborate with Product Management and other Engineering teams to integrate customer feedback into future product enhancements
Proactively engage in HPC roadmap development, planning software and hardware upgrades to sustain exceptional HPC infrastructure performance
Document and analyze test plans, reports, logs, and actively contribute to the development of test utilities and automation scripts to streamline testing processes

Qualification

Deep LearningMachine LearningLinux debugging/testingAI/ML frameworksDevOpsDocker/ContainersKubernetesShell scriptingHPC/AI applicationsServer/network hardware debuggingCCNAOpenStackOpenShiftAzureAWSTeamworkCommunication skills

Required

BS/MS in Electrical Engineering, Computer Engineering or Computer Science
8+ years of work-related experience in Deep Learning and Machine Learning
Experience with leading AI/ML frameworks such as PyTorch, TensorFlow, ONNX, etc
Experience with DevOps or in cloud environments, including but not limited to Docker/Containers and Kubernetes
Hands-on experience with workload/scheduler Managers (Slurm) for rack/cluster
Familiar with MLPerf Training/Inference benchmark, LLM, HPL-AI or RCCL/NCCL
Programming experience with windows and Linux shell scripting
Strong sense of teamwork and good team player, strong communication skills

Preferred

8+ years of Linux/networking debugging/testing or relevant experience preferred
Familiar with Intel/AMD/NVIDIA development tool kits such as CUDA, oneAPI, ROCm is a plus
Experience with server/network hardware debugging and troubleshooting is a plus
CCNA, OpenStack, OpenShift, Azure or AWS is a plus

Benefits

Comprehensive benefits package
Participation in bonus and equity award programs

Company

Supermicro

company-logo
Supermicro is a global leader in high-performance, high-efficiency server technology and innovation.

H1B Sponsorship

Supermicro has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (35)
2024 (33)
2023 (27)
2022 (29)
2021 (30)
2020 (42)

Funding

Current Stage
Public Company
Total Funding
$4.5B
2025-06-24Post Ipo Debt· $2.3B
2025-02-11Post Ipo Debt· $700M
2024-02-23Post Ipo Debt· $1.5B

Leadership Team

M
Matt Thauberger
Senior Vice President Strategy Business Development
linkedin
leader-logo
Somik Behera
General Manager, Cloud, Cluster Mgmt, Datacenter & AI Software Products
linkedin
Company data provided by crunchbase