Machine Learning Engineer, GeForce G-Assist jobs in United States
cer-icon
Apply on Employer Site
company-logo

NVIDIA · 6 hours ago

Machine Learning Engineer, GeForce G-Assist

NVIDIA is a leading technology company focused on building innovative AI solutions. They are seeking a Machine Learning Engineer to work on GeForce G-Assist, an on-device AI assistant, where the main responsibilities include evaluating and improving Small Language Models, optimizing local inference, and designing retrieval-augmented generation systems.

AI InfrastructureArtificial Intelligence (AI)Consumer ElectronicsFoundational AIGPUHardwareSoftwareVirtual Reality
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Together, we focus on how models behave in production, not just on benchmarks. Evaluate and improve Small Language Models used in GeForce G-Assist, with an emphasis on accuracy, robustness, and conversational reliability. Identify and mitigate conversation and context contamination, including state drift, prompt leakage, and retrieval cross-talk
Work with SLM and VLM architectures to support text and multimodal interactions. Collaborate on hybrid architectures that combine local SLMs with cloud-based models. We value engineers who enjoy thinking across the full system—from model behavior to runtime performance
Optimize local inference using llama.cpp, including quantization, memory usage, and performance tuning. Read, write, and optimize C/C++ code in performance-critical paths
Design and integrate retrieval-augmented generation (RAG) systems that ground responses in system and user context. Support agentic AI workflows, enabling planning, tool use, and multi-step execution

Qualification

C/C++ programmingPythonSmall Language ModelsLlama.cppRetrieval technologiesUser feedback translationCollaborationProblem-solving

Required

8+ years of validated experience in system software or a related field, with an M.S. or higher degree in Computer Science, Data Science, Engineering, or a related field (or equivalent experience)
Strong ability to read and write C/C++ code in systems-level or performance-sensitive environments, along with proficiency in Python
Hands-on experience with llama.cpp or similar local inference frameworks
Hands-on experience evaluating Small Language Models, including task-based and conversational testing, with an understanding of conversation dynamics, long-context behavior, and contamination challenges
Knowledge of SLM and VLM architectures and their trade-offs, experience with retrieval technologies and language-model integration, and familiarity with agentic AI patterns such as tool use and planning

Preferred

Experience contributing to language or multimodal models that power user-facing products, features, or workflows
A track record of collaborating with product, platform, or systems teams to balance model capability, performance, and user experience
Demonstrated ability to translate user needs or feedback into measurable improvements in model behavior or system reliability

Benefits

Equity
Benefits

Company

NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.

H1B Sponsorship

NVIDIA has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1877)
2024 (1355)
2023 (976)
2022 (835)
2021 (601)
2020 (529)

Funding

Current Stage
Public Company
Total Funding
$4.09B
Key Investors
ARPA-EARK Investment ManagementSoftBank Vision Fund
2023-05-09Grant· $5M
2022-08-09Post Ipo Equity· $65M
2021-02-18Post Ipo Equity

Leadership Team

leader-logo
Jensen Huang
Founder and CEO
linkedin
leader-logo
Michael Kagan
Chief Technology Officer
linkedin
Company data provided by crunchbase