Apply on Employer Site

Meta · 15 hours ago

Research Scientist Intern, Multimodal AI

Redmond, WA

Internship

Onsite

Intern

$7,650/mo - $12,134/mo

3+ years exp

Meta is a technology company focused on connecting people and building immersive experiences through augmented and virtual reality. They are seeking a Research Scientist Intern to conduct groundbreaking research in audio signal processing and machine learning for AR/VR applications, contributing to various projects in multimodal representation learning and audio visual scene analysis.

Computer Software

Comp. & Benefits

Responsibilities

Design, implement, and maintain comprehensive evaluation protocols for large language models, including both automated and human-in-the-loop assessments

Develop and curate high-quality datasets and benchmarks to measure model performance, safety, fairness, and robustness across a variety of tasks and modalities

Analyze model outputs to identify strengths, weaknesses, and failure modes, and provide actionable insights to research and engineering teams

Design and implementation of novel algorithms to solve audio research problems

Collaboration with teams building Meta’s language AI products.. Collaborate with researchers, engineers, and cross-functional partners to define evaluation goals, communicate findings, and drive improvements in model quality

Develop tools and infrastructure to streamline and scale evaluation processes, including dashboards, annotation platforms, and reporting systems

Stay up-to-date with the latest research in audio LLM evaluation, benchmarking, and responsible AI, and incorporate best practices into Meta’s workflows

Disseminate evaluation results through internal reports, presentations, and, when appropriate, external publications

Qualification

PythonMachine learningAudio computational modelsPyTorchTensorFlowAudio processingSpeech quality assessmentScene analysisCross-functional communicationTeam collaboration

Required

3+ years experience with Python, Matlab, or similar

3+ years experience with machine learning software platforms such as PyTorch, TensorFlow, etc

Experience building novel audio computational models and LLM

Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment

Preferred

Experience in advancing AI techniques, including core contributions to open source libraries and frameworks in computer vision or audio processing

Experience with audio and speech quality assessment

Experience with multichannel audio processing

Experience in visual and acoustic scene analysis

Experience manipulating and analyzing complex, large scale, high-dimensionality data from varying sources

Proven track record of achieving significant results as demonstrated by grants, fellowships, patents, as well as first-authored publications at leading workshops or top computer vision and machine learning conferences such as NeurIPS, ICML, ICLR, ACL, EMNLP, CVPR, ICCV, ECCV, ICASSP, InterSpeech or similar

Experience in utilizing theoretical and empirical research to solve problems

Experience working and communicating cross functionally in a team environment

Benefits

Company

Funding

Current Stage

Late Stage

Leadership Team

Kathryn Glickman

Director, CEO Communications

Christine Lu

CTO Business Engineering NA

Recent News

Crunchbase News

Beyond The Pitch: How Emerging VCs Can Still Raise

2025-11-17

torrentfreak.com

Tit-For-Tat: Porn Producers Counter Meta’s “Personal Use” Piracy Defense

2025-11-16

Livemint.com

As AI borrowing surges, lenders and investors rush to guard against growing default risks

2025-11-16

Company data provided by crunchbase