NVIDIA · 6 hours ago
Machine Learning Engineer, GeForce G-Assist
NVIDIA is a leading technology company focused on building innovative AI solutions. They are seeking a Machine Learning Engineer to work on GeForce G-Assist, an on-device AI assistant, where the main responsibilities include evaluating and improving Small Language Models, optimizing local inference, and designing retrieval-augmented generation systems.
AI InfrastructureArtificial Intelligence (AI)Consumer ElectronicsFoundational AIGPUHardwareSoftwareVirtual Reality
Responsibilities
Together, we focus on how models behave in production, not just on benchmarks. Evaluate and improve Small Language Models used in GeForce G-Assist, with an emphasis on accuracy, robustness, and conversational reliability. Identify and mitigate conversation and context contamination, including state drift, prompt leakage, and retrieval cross-talk
Work with SLM and VLM architectures to support text and multimodal interactions. Collaborate on hybrid architectures that combine local SLMs with cloud-based models. We value engineers who enjoy thinking across the full system—from model behavior to runtime performance
Optimize local inference using llama.cpp, including quantization, memory usage, and performance tuning. Read, write, and optimize C/C++ code in performance-critical paths
Design and integrate retrieval-augmented generation (RAG) systems that ground responses in system and user context. Support agentic AI workflows, enabling planning, tool use, and multi-step execution
Qualification
Required
8+ years of validated experience in system software or a related field, with an M.S. or higher degree in Computer Science, Data Science, Engineering, or a related field (or equivalent experience)
Strong ability to read and write C/C++ code in systems-level or performance-sensitive environments, along with proficiency in Python
Hands-on experience with llama.cpp or similar local inference frameworks
Hands-on experience evaluating Small Language Models, including task-based and conversational testing, with an understanding of conversation dynamics, long-context behavior, and contamination challenges
Knowledge of SLM and VLM architectures and their trade-offs, experience with retrieval technologies and language-model integration, and familiarity with agentic AI patterns such as tool use and planning
Preferred
Experience contributing to language or multimodal models that power user-facing products, features, or workflows
A track record of collaborating with product, platform, or systems teams to balance model capability, performance, and user experience
Demonstrated ability to translate user needs or feedback into measurable improvements in model behavior or system reliability
Benefits
Equity
Benefits
Company
NVIDIA
NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.
H1B Sponsorship
NVIDIA has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1877)
2024 (1355)
2023 (976)
2022 (835)
2021 (601)
2020 (529)
Funding
Current Stage
Public CompanyTotal Funding
$4.09BKey Investors
ARPA-EARK Investment ManagementSoftBank Vision Fund
2023-05-09Grant· $5M
2022-08-09Post Ipo Equity· $65M
2021-02-18Post Ipo Equity
Recent News
2026-01-25
Unified Communications fuel big enterprise success | CIO
2026-01-25
2026-01-25
Company data provided by crunchbase