NVIDIA · 3 days ago
Senior Deep Learning Software Engineer, Inference
NVIDIA is at the forefront of breakthroughs in Artificial Intelligence, High-Performance Computing, and Visualization. They are seeking a Senior Software Engineer specializing in Deep Learning Inference to design, build, and optimize GPU-accelerated software for AI applications.
AI InfrastructureArtificial Intelligence (AI)Consumer ElectronicsFoundational AIGPUHardwareSoftwareVirtual Reality
Responsibilities
Performance optimization, analysis, and tuning of DL models in various domains like LLM, Multimodal and Generative AI
Scale performance of DL models across different architectures and types of NVIDIA accelerators
Contribute features and code to NVIDIA’s inference libraries, vLLM and SGLang, FlashInfer and LLM software solutions
Work with cross-collaborative teams across frameworks, NVIDIA libraries and inference optimization innovative solutions
Qualification
Required
Masters or PhD or equivalent experience in relevant field (Computer Engineering, Computer Science, EECS, AI)
5+ years of relevant software development experience
Excellent C/C++ programming and software design skills
SW Agile skills are helpful and Python experience is a plus
Prior experience with training, deploying or optimizing the inference of DL models in production is a plus
Prior background with performance modeling, profiling, debug, and code optimization or architectural knowledge of CPU and GPU is a plus
Preferred
Contribute to Deep Learning Software projects, such as PyTorch, vLLM, and SGLang to drive advancements in the field
Experience with Multi-GPU Communications (NCCL, NVSHMEM)
Experience building and shipping products to enterprise customers
GPU programming experience (CUDA, OAI TRITON or CUTLASS)
Benefits
Equity
Benefits
Company
NVIDIA
NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.
H1B Sponsorship
NVIDIA has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1877)
2024 (1355)
2023 (976)
2022 (835)
2021 (601)
2020 (529)
Funding
Current Stage
Public CompanyTotal Funding
$4.09BKey Investors
ARPA-EARK Investment ManagementSoftBank Vision Fund
2023-05-09Grant· $5M
2022-08-09Post Ipo Equity· $65M
2021-02-18Post Ipo Equity
Recent News
Business Insider
2026-01-09
Business Insider
2026-01-09
Company data provided by crunchbase