Lenovo · 6 hours ago
Director, AI Model Deployment and Optimization
Lenovo is a global technology powerhouse seeking a visionary technical leader to head their AI Model Deployment Optimization team. This role involves driving the development, optimization, and large-scale deployment of cutting-edge AI capabilities across Lenovo devices and platforms, ensuring they run seamlessly across various computing environments.
ComputerConsumer ElectronicsElectronicsHardwareMobileWearables
Responsibilities
Lead and scale Lenovo’s AI model deployment and optimization strategy across devices, laptops, and cloud environments
Adapt, fine-tune, and optimize open-source foundation models (e.g., OpenAI, Google Gemma) for Lenovo’s product portfolio
Drive initiatives in model compression, quantization, pruning, and distillation to achieve maximum efficiency on constrained devices while preserving model quality
Oversee performance evaluation, benchmarking, and iterative improvement cycles for large language models, vision models, and multimodal AI
Collaborate closely with hardware architecture teams to align AI model efficiency with device and accelerator capabilities
Develop hardware-aware optimization algorithms and integrate them into model deployment pipelines
Partner with global engineering, research, and product teams to bring optimized AI-powered features (e.g., “Catch Me Up”) to market
Establish and maintain reproducible workflows, automation pipelines, and release-readiness criteria for AI models
Represent Lenovo in AI model optimization research communities, technical working groups, and industry consortiums
Build, mentor, and inspire a high-performance applied AI engineering team
Qualification
Required
Experience: 10+ years in production software development, including AI/ML engineering, with 5+ years in leadership roles. Proven track record in model deployment, optimization, and benchmarking at scale. Demonstrated ability to deliver production-grade AI models optimized for both on-device and cloud environments
Optimization Techniques: Strong expertise in quantization, pruning, distillation, graph optimization (ONNX, TensorRT), mixed precision, and hardware-specific tuning (GPUs, TPUs, custom accelerators)
Inference Systems: Experience with low-latency serving, batching strategies, caching, and dynamic scaling across clusters
Cloud Edge Deployment: Deep knowledge of end-to-end deployment of ML/LLM models. Proven ability to deliver across environments — cloud (AWS/GCP/Azure), hybrid, and edge devices
Tooling Frameworks: Familiarity with PyTorch, TensorFlow, JAX, ONNX Runtime, TensorRT, TVM, and model compilation stacks
Data Telemetry: Building feedback loops from runtime telemetry to guide retraining, routing, and optimization
Excellent leadership, communication, and cross-functional collaboration skills
Preferred
Graduate degree (MS or PhD) in Computer Science, AI/ML, Computational Engineering, or related field
Experience delivering AI features in consumer electronics or embedded platforms
Familiarity with AI compilation stacks (e.g., TVM, MLX, Core ML Tools)
Track record of collaboration with research institutions and contributions to open-source AI optimization libraries
Security Compliance: Ensuring secure deployments, model integrity verification, and adherence to privacy regulations
Primary contributions to an AI optimization/compression framework or toolset
Company
Lenovo
Lenovo Group is a computer technology company that manufactures personal computers, smartphones, televisions, and wearable devices.
H1B Sponsorship
Lenovo has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (76)
2024 (52)
2023 (75)
2022 (82)
2021 (58)
2020 (67)
Funding
Current Stage
Public CompanyTotal Funding
$3.35BKey Investors
Alat
2025-01-08Post Ipo Debt· $2B
2024-04-01Post Ipo Debt· $500M
2017-10-03Post Ipo Equity· $500M
Leadership Team
Recent News
2025-12-31
2025-12-31
2025-12-31
Company data provided by crunchbase