Tech Lead, AML Orchestration jobs in United States
cer-icon
Apply on Employer Site
company-logo

ByteDance · 4 months ago

Tech Lead, AML Orchestration

ByteDance is a technology company that inspires creativity and enriches life through its innovative products. They are seeking a Tech Lead for AML Orchestration to advance their distributed orchestration platforms and lead a team of Machine Learning Engineers, focusing on resource efficiency and distributed training systems.

ContentData MiningFoundational AIInternetSocial Media
check
Comp. & Benefits
check
H1B Sponsor Likelynote

Responsibilities

Lead, mentor, and grow a team of orchestration-focused ML engineers; set technical vision and ensure engineering excellence
Design and optimize distributed orchestration and scheduling strategies across large-scale Kubernetes/Godel environments, ensuring efficiency, reliability, and scalability
Drive initiatives for autoscaling, resource multiplexing, and preemption across heterogeneous workloads and clusters, including multi-datacenter and multi-cloud setups
Partner with framework, platform and research teams to build next-generation distributed training and serving systems for ultra-large, high-dimensional recommendation models
Architect robust and elastic online orchestration frameworks for large-scale inference, supporting evolving recommendation and ads models
Stay ahead of trends in orchestration, scheduling, and distributed computing, incorporating best practices and emerging technologies

Qualification

Large-scale distributed systemsOrchestration frameworksTechnical leadershipModern programming languagesSystem performance optimizationDistributed computing systemsAnalytical thinkingProblem-solvingCommunication skillsCollaboration

Required

Bachelor's degree or higher in Computer Science, Engineering, or a related field
5+ years of experience in large-scale distributed systems, with at least 5 years in a technical leadership role
Proficiency in one or more modern programming languages (Golang, Python, C++, or similar)
Deep understanding of orchestration frameworks (e.g., Kubernetes, Yarn) and distributed systems design principles
Proven experience optimizing system performance, resource utilization, and scheduling strategies
Strong analytical thinking, problem-solving, and communication skills

Preferred

Experience with orchestration or ML frameworks such as Ray, TFX, VeRL, vLLM, or equivalent
Familiarity with distributed computing systems (Spark, Flink) and ML pipelines
Contributions to open-source scheduling or ML infrastructure projects
Hands-on experience with multi-tenant environments and cloud-native architectures
Experience collaborating with and leading global, cross-functional teams across different time zones

Benefits

Medical, dental, and vision insurance
401(k) savings plan with company match
Paid parental leave
Short-term and long-term disability coverage
Life insurance
Wellbeing benefits
10 paid holidays per year
10 paid sick days per year
17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure)

Company

ByteDance

company-logo
ByteDance is a technology company that develops content creation platforms and services.

H1B Sponsorship

ByteDance has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1350)
2024 (1123)
2023 (775)
2022 (487)
2021 (417)
2020 (245)

Funding

Current Stage
Late Stage
Total Funding
$9.8B
Key Investors
Capital TodayG42Tiger Global Management
2025-11-20Secondary Market· $300M
2024-07-25Secondary Market
2023-03-14Secondary Market· $100M

Leadership Team

leader-logo
Jochen Bischoff
Head of Global Business Solutions - Africa
linkedin
leader-logo
Matty Lin
General Manager, Global Business Solutions, KR
linkedin
Company data provided by crunchbase