Liquid AI · 5 months ago
Member of Technical Staff - Training Infrastructure Engineer
Liquid AI is a company spun out of MIT CSAIL that builds AI systems designed for low latency and minimal memory usage. They are seeking a Training Infrastructure Engineer to design, implement, and optimize distributed systems for their next-generation Liquid Foundation Models, focusing on building critical infrastructure from the ground up.
Artificial Intelligence (AI)Foundational AIGenerative AIInformation TechnologyMachine Learning
Responsibilities
Design and implement a scalable training infrastructure for our GPU clusters
Build data loading systems that eliminate I/O bottlenecks for multimodal datasets
Develop checkpointing mechanisms balancing memory constraints with recovery needs
Optimize communication patterns to minimize distributed training overhead
Create monitoring and debugging tools for training stability
Qualification
Required
Hands-on experience building distributed training infrastructure (PyTorch Distributed, DeepSpeed, or Megatron-LM)
Understanding of hardware accelerators and networking topologies
Experience optimizing data pipelines for ML workloads
Preferred
MoE (Mixture of Experts) training experience
Large-scale distributed training (100+ GPUs)
Open-source contributions to training infrastructure projects
Benefits
We pay 100% of medical, dental, and vision premiums for employees and dependents
401(k) matching up to 4% of base pay
Unlimited PTO plus company-wide Refill Days throughout the year
Company
Liquid AI
Build efficient general-purpose AI at every scale.
H1B Sponsorship
Liquid AI has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (2)
Funding
Current Stage
Growth StageTotal Funding
$293.1MKey Investors
AMD VenturesOSS Capital L.P.
2024-12-13Series A· $250M
2023-12-01Seed· $37.5M
2023-05-05Seed· $5.6M
Recent News
2025-12-06
Digital Commerce 360
2025-11-15
Company data provided by crunchbase