Aldea ยท 2 months ago
Senior Research Scientist (Speech)
Aldea is a multi-modal foundational AI company reimagining the scaling laws of intelligence. They are seeking a Foundational AI Research Scientist (Speech) to lead applied research in speech understanding and generation, focusing on developing novel architectures and training strategies for speech-to-text and text-to-speech systems.
Artificial Intelligence (AI)SoftwareSpeech Recognition
Responsibilities
Research and prototype novel architectures for STT, TTS, and speech-to-speech modeling
Design and execute experiments validating new methods for scalability, performance, and quality
Collaborate cross-functionally with engineering teams to integrate research into real-world products
Stay current with foundational research in speech processing and generative modeling
Qualification
Required
Requires a Ph.D. in Computer Science, Engineering, or related field
3+ years of relevant industry experience
Demonstrated experience in training or researching TTS, STT, or speech-to-speech models
Deep understanding of modern sequence modeling architectures including State Space Models (SSMs), Sparse Attention mechanisms, Mixture of Experts (MoE), and Linear Attention variants
Proven experience with pre-training foundational models from scratch on large-scale datasets
Track record of working with massive multi-modal datasets (audio, text, and speech corpora at scale)
Deep expertise in PyTorch, Transformers, and modern deep-learning frameworks
Ability to translate complex research ideas into high-performance, maintainable code
Evidence of research excellence through impactful technical contributions
Preferred
Experience with voice-based AI applications or multi-speaker synthesis
Publication record in top-tier venues (ICML, NeurIPS, ICLR, ICASSP, Interspeech)
Background in cross-lingual or multilingual speech systems
Experience with data curation, filtering, and quality assessment pipelines for speech data
Benefits
Competitive base salary
Performance-based bonus aligned with research milestones
Equity participation
Comprehensive health, dental, and vision coverage
Flexible paid time off
Company
Aldea
Aldea builds AI voice and language technology with speech-to-text, text-to-speech, and conversational interfaces.
H1B Sponsorship
Aldea has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
2024 (2)
2022 (1)
2021 (1)
2020 (1)
Funding
Current Stage
Early StageCompany data provided by crunchbase