Black Forest Labs · 1 month ago
Member of Technical Staff - Large Scale Data Infrastructure
Black Forest Labs builds generative models for image and video used by millions of creators, developers, and businesses worldwide. They are seeking a Member of Technical Staff to develop scalable data infrastructure, focusing on building data loaders, designing storage systems, and optimizing performance across large datasets.
Computer Software
Responsibilities
Build scalable data loaders for training runs across thousands of GPUs
Design storage and retrieval systems for petabyte-scale image and video datasets
Develop abstractions over multi-cloud object storage to support flexible training workflows
Execute and validate large-scale data migrations across storage systems and providers
Identify and resolve performance bottlenecks in distributed data pipelines
Work closely with research and infrastructure teams as training requirements evolve
Qualification
Required
Experience building or operating data pipelines at meaningful scale
Strong intuition for optimizing data loading and I/O in distributed systems
Hands-on work with large image or video datasets, often spanning millions of files
Experience debugging performance issues across large fleets of machines
Comfort working in research-adjacent environments where requirements evolve alongside the models
Preferred
Familiarity with distributed job orchestration (e.g. Slurm, Kubernetes) and object storage performance tuning is a plus
Benefits
Equity depending on profile and experience
Company
Black Forest Labs
We’re the leading frontier AI research lab, continuously building the most advanced technology that shapes the visual understanding of the world.
Funding
Current Stage
Early StageCompany data provided by crunchbase