AI Inference Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Quadric · 2 months ago

AI Inference Engineer

Quadric is a company that has developed a unique general purpose neural processing unit architecture aimed at running neural network inference workloads. The AI Inference Engineer will serve as a crucial link between AI models and Quadric's platforms, focusing on porting, optimizing, and benchmarking AI models for efficient deployment.

ComputerHardwareSemiconductor
check
H1B Sponsor Likelynote

Responsibilities

Quantize, prune and convert models for deployment
Port models to Quadric platform using Quadric toolchain
Optimize inference deployment for latency, speed
Benchmark and profile model performance and accuracy
Collaborate across related areas of the AI inference stack to support team and business priorities
Develop tools to scale and speed up the deployment
Make Improvement to SDK and runtime
Provide technical support and documents to customers and developer community

Qualification

AI model inferenceModel quantizationC/C++ proficiencyPython proficiencyModel performance profilingAI frameworks experienceDebuggingProblem solvingCommunication

Required

Bachelor's or Master's in Computer Science and/or Electric Engineering
5+ years of experience in AI/LLM model inference and deployment frameworks/tools
experience with model quantization (PTQ, QAT) and tools
experience with model accuracy measures
experience with model inference performance profiling
experience with at least one of the following frameworks: onnxruntime, Pytorch, vLLM, huggingface-transformer, neural-compressor, llamacpp
Proficiency in C/C++ and Python
Demonstrate good capability in problem solving, debug and communication

Benefits

Health Care Plan (Medical, Dental & Vision)
Retirement Plan (401k, IRA)
Life Insurance (Basic, Voluntary & AD&D)
Paid Time Off (Vacation, Sick & Public Holidays)
Family Leave (Maternity, Paternity)
Short Term & Long Term Disability
Training & Development
Work From Home
Free Food & Snacks
Stock Option Plan

Company

Quadric

twittertwittertwitter
company-logo
Quadric develops semiconductor intellectual property and tools for on-device artificial intelligence computing.

H1B Sponsorship

Quadric has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)

Funding

Current Stage
Growth Stage
Total Funding
$73.75M
Key Investors
BEENEXTDensoNSITEXE
2026-01-14Series C· $30M
2022-12-15Series B· $5.5M
2022-12-15Debt Financing

Leadership Team

leader-logo
Daniel Firu
Co-Founder & CPO
linkedin

Recent News

Company data provided by crunchbase