Lenovo · 13 hours ago
AI Systems Engineer
Lenovo is a global technology powerhouse focused on delivering Smarter Technology for All. They are seeking an AI Systems Performance Engineer to contribute to the design, development, and optimization of next-generation AI systems, particularly for large-scale AI workloads.
ComputerConsumer ElectronicsElectronicsHardwareMobileWearables
Responsibilities
End-to-end performance analysis Analyze performance of LLM and agentic workloads across the full stack: models, runtimes, compilers, kernels, memory, interconnect, and distributed deployment
Model- and context-aware tuning Characterize and optimize performance for models of varying size and context length, including tradeoffs around batch size, KV/cache management, quantization, and latency vs. throughput
Memory microarchitectural analysis Profile memory usage and access patterns across CPU, GPU, and accelerators; identify bottlenecks related to cache behavior, memory bandwidth, and compute utilization; propose and validate optimizations
Networking distributed systems Study and improve performance in heterogeneous distributed systems (multi-node, multi-accelerator), considering different networking conditions (latency, bandwidth, congestion); tune sharding, pipelining, and routing strategies
Benchmarking methodology Design, implement, and maintain benchmarks and load tests for LLM and agentic workloads under realistic traffic patterns and SLAs
Optimization experimentation Collaborate with ML, platform, and infrastructure teams to prototype and roll out optimizations (e.g., kernel-level improvements, scheduling changes, batching policies, caching strategies)
Observability capacity planning Build and refine dashboards, alerts, and reports that surface key performance and efficiency metrics; provide data-driven guidance for capacity planning and hardware selection
Cross-functional collaboration Work closely with model, runtime, and platform teams to translate performance findings into architectural improvements and product-impacting changes
Qualification
Required
2+ years of industry experience in systems performance engineering, ML infrastructure, HPC, or related fields
Master's degree or PhD in Computer Science, Computer Engineering, Electrical Engineering, or a related technical field
Strong understanding of computer architecture: CPU/GPU pipelines, caches, memory hierarchies, vector/SIMD, and accelerators
Experience profiling and optimizing performance of complex systems using tools such as perf, VTune, Nsight, rocprof, or similar
Strong coding skills in C++ and/or Python
Experience working with Linux-based systems, shell scripting, and standard tooling
Familiarity with containerized environments and orchestration (e.g., Docker, Kubernetes)
Experience working with ML workloads (preferably deep learning) in frameworks like PyTorch, TensorFlow, or JAX
Conceptual understanding of LLM inference, including batching, token generation, and context window behavior
Understanding of distributed systems concepts (RPC, load balancing, fault tolerance) and basic networking fundamentals (latency, bandwidth, throughput)
Strong data analysis skills; comfortable working with logs, traces, and metrics
Ability to clearly communicate findings and tradeoffs to both engineering and non-engineering stakeholders
Hands-on experience optimizing LLM inference or other large-scale deep learning workloads on GPUs or specialized accelerators
Experience with heterogeneous systems (e.g., mixtures of CPU, GPU, NPU/ASIC) and cluster-scale deployment
Familiarity with LLM-specific optimization techniques (KV cache strategies, quantization, tensor/sequence parallelism, speculative decoding, etc.)
Experience with large-scale observability stacks (Prometheus, Grafana, OpenTelemetry) for performance monitoring
Prior work on high-performance computing (HPC), networking-intensive systems, or real-time/low-latency services
Company
Lenovo
Lenovo Group is a computer technology company that manufactures personal computers, smartphones, televisions, and wearable devices.
H1B Sponsorship
Lenovo has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (76)
2024 (52)
2023 (75)
2022 (82)
2021 (58)
2020 (67)
Funding
Current Stage
Public CompanyTotal Funding
$3.35BKey Investors
Alat
2025-01-08Post Ipo Debt· $2B
2024-04-01Post Ipo Debt· $500M
2017-10-03Post Ipo Equity· $500M
Leadership Team
Recent News
2025-12-31
2025-12-31
2025-12-31
Company data provided by crunchbase