Meta · 15 hours ago
Research Scientist Intern, Multimodal AI
Meta is a technology company focused on connecting people and building immersive experiences through augmented and virtual reality. They are seeking a Research Scientist Intern to conduct groundbreaking research in audio signal processing and machine learning for AR/VR applications, contributing to various projects in multimodal representation learning and audio visual scene analysis.
Computer Software
Responsibilities
Design, implement, and maintain comprehensive evaluation protocols for large language models, including both automated and human-in-the-loop assessments
Develop and curate high-quality datasets and benchmarks to measure model performance, safety, fairness, and robustness across a variety of tasks and modalities
Analyze model outputs to identify strengths, weaknesses, and failure modes, and provide actionable insights to research and engineering teams
Design and implementation of novel algorithms to solve audio research problems
Collaboration with teams building Meta’s language AI products.. Collaborate with researchers, engineers, and cross-functional partners to define evaluation goals, communicate findings, and drive improvements in model quality
Develop tools and infrastructure to streamline and scale evaluation processes, including dashboards, annotation platforms, and reporting systems
Stay up-to-date with the latest research in audio LLM evaluation, benchmarking, and responsible AI, and incorporate best practices into Meta’s workflows
Disseminate evaluation results through internal reports, presentations, and, when appropriate, external publications
Qualification
Required
3+ years experience with Python, Matlab, or similar
3+ years experience with machine learning software platforms such as PyTorch, TensorFlow, etc
Experience building novel audio computational models and LLM
Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment
Preferred
Experience in advancing AI techniques, including core contributions to open source libraries and frameworks in computer vision or audio processing
Experience with audio and speech quality assessment
Experience with multichannel audio processing
Experience in visual and acoustic scene analysis
Experience manipulating and analyzing complex, large scale, high-dimensionality data from varying sources
Proven track record of achieving significant results as demonstrated by grants, fellowships, patents, as well as first-authored publications at leading workshops or top computer vision and machine learning conferences such as NeurIPS, ICML, ICLR, ACL, EMNLP, CVPR, ICCV, ECCV, ICASSP, InterSpeech or similar
Experience in utilizing theoretical and empirical research to solve problems
Experience working and communicating cross functionally in a team environment
Benefits
Benefits
Company
Meta
Meta's mission is to build the future of human connection and the technology that makes it possible.
Funding
Current Stage
Late StageRecent News
Crunchbase News
2025-11-17
2025-11-16
Company data provided by crunchbase