Apply on Employer Site

NVIDIA · 6 hours ago

Machine Learning Engineer, GeForce G-Assist

US, CA, Santa Clara

Full-time

Onsite

Senior Level, Lead/Staff

$184K/yr - $356K/yr

8+ years exp

NVIDIA is a leading technology company focused on building innovative AI solutions. They are seeking a Machine Learning Engineer to work on GeForce G-Assist, an on-device AI assistant, where the main responsibilities include evaluating and improving Small Language Models, optimizing local inference, and designing retrieval-augmented generation systems.

AI InfrastructureArtificial Intelligence (AI)Consumer ElectronicsFoundational AIGPUHardwareSoftwareVirtual Reality

Growth Opportunities

H1B Sponsor Likely

Responsibilities

Together, we focus on how models behave in production, not just on benchmarks. Evaluate and improve Small Language Models used in GeForce G-Assist, with an emphasis on accuracy, robustness, and conversational reliability. Identify and mitigate conversation and context contamination, including state drift, prompt leakage, and retrieval cross-talk

Work with SLM and VLM architectures to support text and multimodal interactions. Collaborate on hybrid architectures that combine local SLMs with cloud-based models. We value engineers who enjoy thinking across the full system—from model behavior to runtime performance

Optimize local inference using llama.cpp, including quantization, memory usage, and performance tuning. Read, write, and optimize C/C++ code in performance-critical paths

Design and integrate retrieval-augmented generation (RAG) systems that ground responses in system and user context. Support agentic AI workflows, enabling planning, tool use, and multi-step execution

Qualification

C/C++ programmingPythonSmall Language ModelsLlama.cppRetrieval technologiesUser feedback translationCollaborationProblem-solving

Required

8+ years of validated experience in system software or a related field, with an M.S. or higher degree in Computer Science, Data Science, Engineering, or a related field (or equivalent experience)

Strong ability to read and write C/C++ code in systems-level or performance-sensitive environments, along with proficiency in Python

Hands-on experience with llama.cpp or similar local inference frameworks

Hands-on experience evaluating Small Language Models, including task-based and conversational testing, with an understanding of conversation dynamics, long-context behavior, and contamination challenges

Knowledge of SLM and VLM architectures and their trade-offs, experience with retrieval technologies and language-model integration, and familiarity with agentic AI patterns such as tool use and planning

Preferred

Experience contributing to language or multimodal models that power user-facing products, features, or workflows

A track record of collaborating with product, platform, or systems teams to balance model capability, performance, and user experience

Demonstrated ability to translate user needs or feedback into measurable improvements in model behavior or system reliability

Benefits

Equity

Benefits

Company

NVIDIA

Glassdoor4.6

NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.

Founded in 1993

Santa Clara, California, USA

10001+ employees

https://www.nvidia.com

H1B Sponsorship

NVIDIA has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)

Distribution of Different Job Fields Receiving Sponsorship

Represents job field similar to this job

Trends of Total Sponsorships

2025 (1877)

2024 (1355)

2023 (976)

2022 (835)

2021 (601)

2020 (529)

Funding

Current Stage

Public Company

Total Funding

$4.09B

Key Investors

ARPA-EARK Investment ManagementSoftBank Vision Fund

2023-05-09Grant· $5M

2022-08-09Post Ipo Equity· $65M

2021-02-18Post Ipo Equity

Leadership Team

Jensen Huang

Founder and CEO

Michael Kagan

Chief Technology Officer

Recent News

IndiaTimes

Nvidia board member Persis Drell after over a decade: What SEC filing said

2026-01-25

Unified Communications fuel big enterprise success | CIO

There’s (industrial) strength in numbers when building AI

2026-01-25

pv magazine USA

Enphase on how distributed energy assets could boost data centers

2026-01-25

Company data provided by crunchbase