Director, AI Model Deployment and Optimization jobs in United States
cer-icon
Apply on Employer Site
company-logo

Lenovo · 6 hours ago

Director, AI Model Deployment and Optimization

Lenovo is a global technology powerhouse seeking a visionary technical leader to head their AI Model Deployment Optimization team. This role involves driving the development, optimization, and large-scale deployment of cutting-edge AI capabilities across Lenovo devices and platforms, ensuring they run seamlessly across various computing environments.

ComputerConsumer ElectronicsElectronicsHardwareMobileWearables
check
H1B Sponsor Likelynote

Responsibilities

Lead and scale Lenovo’s AI model deployment and optimization strategy across devices, laptops, and cloud environments
Adapt, fine-tune, and optimize open-source foundation models (e.g., OpenAI, Google Gemma) for Lenovo’s product portfolio
Drive initiatives in model compression, quantization, pruning, and distillation to achieve maximum efficiency on constrained devices while preserving model quality
Oversee performance evaluation, benchmarking, and iterative improvement cycles for large language models, vision models, and multimodal AI
Collaborate closely with hardware architecture teams to align AI model efficiency with device and accelerator capabilities
Develop hardware-aware optimization algorithms and integrate them into model deployment pipelines
Partner with global engineering, research, and product teams to bring optimized AI-powered features (e.g., “Catch Me Up”) to market
Establish and maintain reproducible workflows, automation pipelines, and release-readiness criteria for AI models
Represent Lenovo in AI model optimization research communities, technical working groups, and industry consortiums
Build, mentor, and inspire a high-performance applied AI engineering team

Qualification

AI/ML engineeringModel deploymentOptimization techniquesCloud deploymentPyTorchTensorFlowData telemetryLeadership experienceCommunication skillsCross-functional collaboration

Required

Experience: 10+ years in production software development, including AI/ML engineering, with 5+ years in leadership roles. Proven track record in model deployment, optimization, and benchmarking at scale. Demonstrated ability to deliver production-grade AI models optimized for both on-device and cloud environments
Optimization Techniques: Strong expertise in quantization, pruning, distillation, graph optimization (ONNX, TensorRT), mixed precision, and hardware-specific tuning (GPUs, TPUs, custom accelerators)
Inference Systems: Experience with low-latency serving, batching strategies, caching, and dynamic scaling across clusters
Cloud Edge Deployment: Deep knowledge of end-to-end deployment of ML/LLM models. Proven ability to deliver across environments — cloud (AWS/GCP/Azure), hybrid, and edge devices
Tooling Frameworks: Familiarity with PyTorch, TensorFlow, JAX, ONNX Runtime, TensorRT, TVM, and model compilation stacks
Data Telemetry: Building feedback loops from runtime telemetry to guide retraining, routing, and optimization
Excellent leadership, communication, and cross-functional collaboration skills

Preferred

Graduate degree (MS or PhD) in Computer Science, AI/ML, Computational Engineering, or related field
Experience delivering AI features in consumer electronics or embedded platforms
Familiarity with AI compilation stacks (e.g., TVM, MLX, Core ML Tools)
Track record of collaboration with research institutions and contributions to open-source AI optimization libraries
Security Compliance: Ensuring secure deployments, model integrity verification, and adherence to privacy regulations
Primary contributions to an AI optimization/compression framework or toolset

Company

Lenovo Group is a computer technology company that manufactures personal computers, smartphones, televisions, and wearable devices.

H1B Sponsorship

Lenovo has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (76)
2024 (52)
2023 (75)
2022 (82)
2021 (58)
2020 (67)

Funding

Current Stage
Public Company
Total Funding
$3.35B
Key Investors
Alat
2025-01-08Post Ipo Debt· $2B
2024-04-01Post Ipo Debt· $500M
2017-10-03Post Ipo Equity· $500M

Leadership Team

leader-logo
Yang Yuanqing
Chairman & CEO
linkedin
leader-logo
Greg Huff
CTO, CSO, and SVP of Development, Quality, and Customer Care, Infrastructure Solutions Group
linkedin
Company data provided by crunchbase