Student Researcher [Seed Vision – Multimodal Interaction & World Model Pretraining] – 2026 Start (PhD) jobs in United States
cer-icon
Apply on Employer Site
company-logo

ByteDance · 7 hours ago

Student Researcher [Seed Vision – Multimodal Interaction & World Model Pretraining] – 2026 Start (PhD)

ByteDance is dedicated to pioneering advanced AI foundation models and is seeking a PhD intern for their Seed Multimodal Interaction and World Model team. The role involves contributing to research and engineering efforts to enhance multimodal understanding and developing models for interactive exploration.

ContentData MiningFoundational AIInternetSocial Media
check
Comp. & Benefits
check
H1B Sponsor Likelynote

Responsibilities

Contribute to research and engineering to advance world models and multimodal understanding, enhancing models' reasoning and generation capabilities
Design and prototype novel architectures that balance modeling performance, generalization, and efficiency
Help establish scaling laws and conduct systematic ablations to derive transferrable insights across model families and tasks

Qualification

Computer VisionMachine LearningPyTorchLarge-scale trainingMultimodal modelingTransformer architecturesFoundation model pretrainingResearch publicationsScaling behavior analysisData preparation pipelines

Required

Currently pursuing a PhD in Computer Vision, Machine Learning, or a related technical field
Familiarity with multimodal modeling, world models, or foundation model pretraining
Strong coding skills and hands-on experience with PyTorch or JAX
Experience with large-scale distributed training frameworks and GPU/TPU compute stacks
Demonstrated research ability, with publications in top-tier conferences such as CVPR, ICCV, ECCV, NeurIPS, ICML, or ICLR

Preferred

Experience working with transformer-based architectures, including dense and Mixture-of-Experts (MoE) models
Understanding of scaling behavior in foundation models and how to analyze them
Familiarity with data preparation pipelines for large-scale multimodal pretraining

Benefits

Health insurance
Life insurance
Wellbeing benefits
10 paid holidays per year
Paid sick time (56 hours if hired in first half of year, 40 if hired in second half of year)
Housing allowance

Company

ByteDance

company-logo
ByteDance is a technology company that develops content creation platforms and services.

H1B Sponsorship

ByteDance has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1350)
2024 (1123)
2023 (775)
2022 (487)
2021 (417)
2020 (245)

Funding

Current Stage
Late Stage
Total Funding
$9.8B
Key Investors
Capital TodayG42Tiger Global Management
2025-11-20Secondary Market· $300M
2024-07-25Secondary Market
2023-03-14Secondary Market· $100M

Leadership Team

leader-logo
Jochen Bischoff
Head of Global Business Solutions - Africa
linkedin
leader-logo
Matty Lin
General Manager, Global Business Solutions, KR
linkedin
Company data provided by crunchbase