ByteDance · 4 months ago
Tech Lead, AML Orchestration
ByteDance is a technology company that inspires creativity and enriches life through its innovative products. They are seeking a Tech Lead for AML Orchestration to advance their distributed orchestration platforms and lead a team of Machine Learning Engineers, focusing on resource efficiency and distributed training systems.
ContentData MiningFoundational AIInternetSocial Media
Responsibilities
Lead, mentor, and grow a team of orchestration-focused ML engineers; set technical vision and ensure engineering excellence
Design and optimize distributed orchestration and scheduling strategies across large-scale Kubernetes/Godel environments, ensuring efficiency, reliability, and scalability
Drive initiatives for autoscaling, resource multiplexing, and preemption across heterogeneous workloads and clusters, including multi-datacenter and multi-cloud setups
Partner with framework, platform and research teams to build next-generation distributed training and serving systems for ultra-large, high-dimensional recommendation models
Architect robust and elastic online orchestration frameworks for large-scale inference, supporting evolving recommendation and ads models
Stay ahead of trends in orchestration, scheduling, and distributed computing, incorporating best practices and emerging technologies
Qualification
Required
Bachelor's degree or higher in Computer Science, Engineering, or a related field
5+ years of experience in large-scale distributed systems, with at least 5 years in a technical leadership role
Proficiency in one or more modern programming languages (Golang, Python, C++, or similar)
Deep understanding of orchestration frameworks (e.g., Kubernetes, Yarn) and distributed systems design principles
Proven experience optimizing system performance, resource utilization, and scheduling strategies
Strong analytical thinking, problem-solving, and communication skills
Preferred
Experience with orchestration or ML frameworks such as Ray, TFX, VeRL, vLLM, or equivalent
Familiarity with distributed computing systems (Spark, Flink) and ML pipelines
Contributions to open-source scheduling or ML infrastructure projects
Hands-on experience with multi-tenant environments and cloud-native architectures
Experience collaborating with and leading global, cross-functional teams across different time zones
Benefits
Medical, dental, and vision insurance
401(k) savings plan with company match
Paid parental leave
Short-term and long-term disability coverage
Life insurance
Wellbeing benefits
10 paid holidays per year
10 paid sick days per year
17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure)
Company
ByteDance
ByteDance is a technology company that develops content creation platforms and services.
H1B Sponsorship
ByteDance has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1350)
2024 (1123)
2023 (775)
2022 (487)
2021 (417)
2020 (245)
Funding
Current Stage
Late StageTotal Funding
$9.8BKey Investors
Capital TodayG42Tiger Global Management
2025-11-20Secondary Market· $300M
2024-07-25Secondary Market
2023-03-14Secondary Market· $100M
Leadership Team
Recent News
2026-01-08
Company data provided by crunchbase