Staff ML Engineer, Inference Platform jobs in United States
cer-icon
Apply on Employer Site
company-logo

General Motors · 19 hours ago

Staff ML Engineer, Inference Platform

General Motors is a leader in automotive innovation and AI infrastructure. They are seeking a Staff ML Engineer to build and scale robust compute platforms for machine learning workflows, ensuring efficient model serving and inference in production.

AutomotiveElectric VehicleInformation ServicesManufacturingTransportation
check
H1B Sponsor Likelynote

Responsibilities

Design and implement core platform backend software components
Collaborate with ML engineers and researchers to understand critical workflows, parse them to platform requirements, and deliver incremental value
Lead technical decision-making on model serving strategies, orchestration, caching, model versioning, and auto-scaling mechanisms
Drive the development of monitoring, observability, and metrics to ensure reliability, performance, and resource optimization of inference services
Proactively research and integrate state-of-the-art model serving frameworks, hardware accelerators, and distributed computing techniques
Lead large-scale technical initiatives across GM’s ML ecosystem
Raise the engineering bar through technical leadership, establishing best practices
Contribute to open source projects; represent GM in relevant communities

Qualification

ML inferenceModel serving frameworksDistributed systemsCloud platformsGoPythonC++Product mindsetMulti-taskingCommunicationProblem-solving skills

Required

8+ years of industry experience, with focus on machine learning systems or high performance backend services
Expertise in either Go, Python, C++ or other relevant coding languages
Expertise in ML inference, model serving frameworks (triton, rayserve, vLLM etc)
Strong communication skills and a proven ability to drive cross-functional initiatives
Experience working with cloud platforms such as GCP, Azure, or AWS
Ability to thrive in a dynamic, multi-tasking environment with ever-evolving priorities

Preferred

Hands-on experience building ML infrastructure platforms for model serving/inference
Experience working with or designing interfaces, apis and clients for ML workflows
Experience with Ray framework, and/or vLLM
Experience with distributed systems, and handling large-scale data processing
Familiarity with telemetry, and other feedback loops to inform product improvements
Familiarity with hardware acceleration (GPUs) and optimizations for inference workloads
Contributions to open-source ML serving frameworks

Benefits

Medical
Dental
Vision
Health Savings Account
Flexible Spending Accounts
Retirement savings plan
Sickness and accident benefits
Life insurance
Paid vacation & holidays
Tuition assistance programs
Employee assistance program
GM vehicle discounts
Company vehicle evaluation program

Company

General Motors

company-logo
General Motors is an automotive company that designs, produces, markets, and distributes vehicles and vehicle parts.

H1B Sponsorship

General Motors has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (599)
2024 (740)
2023 (450)
2022 (795)
2021 (748)
2020 (452)

Funding

Current Stage
Public Company
Total Funding
$8.51B
Key Investors
US Department of Energy
2025-05-05Post Ipo Debt· $2B
2024-10-31Grant· $8M
2024-07-11Grant· $500M

Leadership Team

leader-logo
Mary Barra
Chair and Chief Executive Officer
linkedin
leader-logo
Tony Cervone
Senior Vice President, Global Communications
linkedin
Company data provided by crunchbase