HeyGen · 6 hours ago
Software Engineer, Data
HeyGen is dedicated to making visual storytelling accessible to everyone. They are seeking a Software Engineer with data engineering responsibilities to develop data foundational layers for next-generation features, enabling AI models to function in real-time and enhancing user experiences.
E-LearningGenerative AISoftwareWeb Apps
Responsibilities
Build & Scale Data Pipelines: Design, develop, and maintain robust batch and real-time data pipelines (using Python, Go, Spark, Kafka) that ingest and transform massive multi-modal data—text, audio, and video—to train and run AI models
Power Intelligent Features: Collaborate with ML engineers to implement data structures and APIs for new, exciting features like PPT-to-video automation and interactive AI avatars that require low-latency data fetching
Data Lakehouse Infrastructure: Architect and manage data lakehouse solutions (e.g., Snowflake, Databricks, Apache Iceberg) to store and query unstructured media data efficiently, enhancing storage and computation efficiency
Data Reliability & Observability: Implement data quality checks, data contracts, and monitoring to ensure high reliability of data, preventing downtime in production video generation
Productize Data: Transform raw data into structured, actionable data products that can be easily consumed by front-end applications, API endpoints, and AI agents
Qualification
Required
Bachelor's/Master's degree in Computer Science, Engineering, or a related field
3-5+ years of experience as a Backend Software Engineer with heavy data processing responsibilities
Strong proficiency in Python (for ETL/scripting) and SQL (for data modeling)
Experience with cloud platforms (AWS/GCP) and data technologies like Kafka, Spark, and Snowflake/Databricks
Experience or interest in Computer Vision/Generative AI data processing
Proactive, 'owner' mindset; ability to operate in a fast-paced, startup environment
Benefits
Equity
Benefits
401k plan
Health benefits
Generous PTO
A parental leave program
Emotional health resources
Company
HeyGen
HeyGen is an AI video generation platform that specializes in video creation, AI avatars, and generative AI.
H1B Sponsorship
HeyGen has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (11)
2024 (5)
Funding
Current Stage
Growth StageTotal Funding
$69MKey Investors
Benchmark
2024-03-25Series A· $60M
2022-11-08Seed· $9M
Recent News
Tech Funding News
2025-10-31
Company data provided by crunchbase