Software Engineer, ML System Scheduling jobs in United States
cer-icon
Apply on Employer Site
company-logo

ByteDance · 3 hours ago

Software Engineer, ML System Scheduling

ByteDance is a leading tech company focused on inspiring creativity and driving technological advancements. They are seeking a Software Engineer for their ML System Scheduling team to design and develop resource scheduling for machine learning models, optimizing computing resources across various scenarios.

ContentData MiningFoundational AIInternetSocial Media
check
Comp. & Benefits
check
H1B Sponsor Likelynote

Responsibilities

Responsible for the design and development of resource scheduling, including model training, model evaluation and model inference in various scenarios (LLM/AIGC/NLP/CV/Speech, etc.)
Responsible for the optimal orchestration of various computing resources (GPU, CPU, other heterogeneous hardware), realizing the rational use of stable resources, tidal resources, mixed resources, and multi-cloud resources
Responsible for the optimal combination of computing resources, RDMA high-speed network resources, and storage resources, and giving full play to the power of large-scale distributed clusters
Responsible for offline and online workload scheduling in global data centers integrating multi-cloud scenarios to achieve rational distributions

Qualification

GoPythonKubernetesMachine LearningTensorFlowPyTorchDistributed SystemsLogical AnalysisSelf-DriveCommunication SkillsLearning Ability

Required

Be proficient in 1 to 2 programming languages such as Go/Python/Shell in Linux environment
Be familiar with Kubernetes architecture and container technology such as Docker/Containerd/Kata/Podman, and have rich experience in Machine Learning system practice and development
Understand the principles of distributed systems and have experience in the design, development and maintenance of large-scale distributed systems
Have an excellent logical analysis ability, able to reasonably abstract and split business logic
Have a strong sense of responsibility, good learning ability, communication skills and self-drive, able to respond and act quickly

Preferred

Familiar with at least one major Machine Learning framework (TensorFlow/PyTorch)
Experience in one of the following fields: AI Infrastructure, HW/SW Co-Design, High Performance Computing, ML Hardware Architecture (GPU, Accelerators, Networking)

Benefits

Medical, dental, and vision insurance
401(k) savings plan with company match
Paid parental leave
Short-term and long-term disability coverage
Life insurance
Wellbeing benefits
10 paid holidays per year
10 paid sick days per year
17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure)

Company

ByteDance

company-logo
ByteDance is a technology company that develops content creation platforms and services.

H1B Sponsorship

ByteDance has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1350)
2024 (1123)
2023 (775)
2022 (487)
2021 (417)
2020 (245)

Funding

Current Stage
Late Stage
Total Funding
$9.8B
Key Investors
Capital TodayG42Tiger Global Management
2025-11-20Secondary Market· $300M
2024-07-25Secondary Market
2023-03-14Secondary Market· $100M

Leadership Team

leader-logo
Jochen Bischoff
Head of Global Business Solutions - Africa
linkedin
leader-logo
Matty Lin
General Manager, Global Business Solutions, KR
linkedin
Company data provided by crunchbase