Senior Research Scientist (Speech) jobs in United States
cer-icon
Apply on Employer Site
company-logo

Aldea ยท 2 months ago

Senior Research Scientist (Speech)

Aldea is a multi-modal foundational AI company reimagining the scaling laws of intelligence. They are seeking a Foundational AI Research Scientist (Speech) to lead applied research in speech understanding and generation, focusing on developing novel architectures and training strategies for speech-to-text and text-to-speech systems.

Artificial Intelligence (AI)SoftwareSpeech Recognition
check
H1B Sponsor Likelynote

Responsibilities

Research and prototype novel architectures for STT, TTS, and speech-to-speech modeling
Design and execute experiments validating new methods for scalability, performance, and quality
Collaborate cross-functionally with engineering teams to integrate research into real-world products
Stay current with foundational research in speech processing and generative modeling

Qualification

Speech-to-text modelingText-to-speech modelingDeep learning frameworksModern sequence modelingPyTorchTransformersLarge-scale datasetsResearch excellenceCross-lingual systemsVoice-based AI applications

Required

Requires a Ph.D. in Computer Science, Engineering, or related field
3+ years of relevant industry experience
Demonstrated experience in training or researching TTS, STT, or speech-to-speech models
Deep understanding of modern sequence modeling architectures including State Space Models (SSMs), Sparse Attention mechanisms, Mixture of Experts (MoE), and Linear Attention variants
Proven experience with pre-training foundational models from scratch on large-scale datasets
Track record of working with massive multi-modal datasets (audio, text, and speech corpora at scale)
Deep expertise in PyTorch, Transformers, and modern deep-learning frameworks
Ability to translate complex research ideas into high-performance, maintainable code
Evidence of research excellence through impactful technical contributions

Preferred

Experience with voice-based AI applications or multi-speaker synthesis
Publication record in top-tier venues (ICML, NeurIPS, ICLR, ICASSP, Interspeech)
Background in cross-lingual or multilingual speech systems
Experience with data curation, filtering, and quality assessment pipelines for speech data

Benefits

Competitive base salary
Performance-based bonus aligned with research milestones
Equity participation
Comprehensive health, dental, and vision coverage
Flexible paid time off

Company

Aldea

twittertwitter
company-logo
Aldea builds AI voice and language technology with speech-to-text, text-to-speech, and conversational interfaces.

H1B Sponsorship

Aldea has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
2024 (2)
2022 (1)
2021 (1)
2020 (1)

Funding

Current Stage
Early Stage
Company data provided by crunchbase