Microsoft · 17 hours ago
Senior Research Engineer
Microsoft’s CoreAI group is seeking a Senior Research Engineer for the AI Data Platform team, which focuses on managing the lifecycle of AI training data. The role involves designing data quality frameworks, operationalizing quality signals, and collaborating with stakeholders to enhance the data lifecycle for AI model development.
Agentic AIApplication Performance ManagementArtificial Intelligence (AI)Business DevelopmentDevOpsInformation ServicesInformation TechnologyManagement Information SystemsNetwork SecuritySoftware
Responsibilities
Design and build a data quality evaluation framework for AI training datasets, including scalable metrics, testing methodologies, and automated reporting
Define and operationalize quality signals aligned to model outcomes (e.g., coverage, diversity, noise/duplication, labeling consistency, safety/toxicity, privacy/compliance risk indicators)
Develop task- and model-aware evaluation approaches that connect dataset properties to training performance, reliability, and safety
Create automated dataset validation gates and monitoring to support continuous dataset iteration (e.g., regression detection across dataset versions)
Design and implement synthetic data generation pipelines (LLM-driven and programmatic approaches) to improve long-tail representation, fill coverage gaps, and accelerate iteration cycles
Build guardrails for synthetic data: filtering, scoring, calibration, provenance tracking, and bias/safety checks to ensure quality and compliance
Partner with engineering to integrate evaluation and generation into the platform’s end-to-end data lifecycle
Collaborate with cross-functional stakeholders to run experiments, establish best practices, and deliver reusable tools that scale across multiple model and product teams
Qualification
Required
Bachelor's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 4+ years related experience (e.g., statistics predictive analytics, research)
OR Master's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience (e.g., statistics, predictive analytics, research)
OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 1+ year(s) related experience (e.g., statistics, predictive analytics, research)
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Preferred
3+ years of coding experience in Python and experience with ML frameworks such as PyTorch and Triton
2+ years experience of large-scale model training for LLMs, SLMs, and agentic models
2+ years of proven ability to design and scale training infrastructure and pipelines in production environments
Experience with agent training frameworks
Hands-on experience with large-scale distributed training and/or serving with demonstrated ability to dive deep into complex systems, troubleshoot unconventional issues, and craft innovative solutions under real-world constraints
Extensive experience with large-scale training, model inference, reinforcement learning, and reasoning models
Demonstrated ability to work in cross-functional teams and collaborate effectively with researchers, product managers, and other engineers to deliver complex ML solutions
Startup-style mindset: agile, solution-oriented, and self-driven
Benefits
Certain roles may be eligible for benefits and other compensation.
Company
Microsoft
Microsoft is a software corporation that develops, manufactures, licenses, supports, and sells a range of software products and services.
H1B Sponsorship
Microsoft has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (9192)
2024 (9343)
2023 (7677)
2022 (11403)
2021 (7210)
2020 (7852)
Funding
Current Stage
Public CompanyTotal Funding
$1MKey Investors
Technology Venture Investors
2022-12-09Post Ipo Equity
1986-03-13IPO
1981-09-01Series Unknown· $1M
Leadership Team
Recent News
2026-01-16
Morningstar.com
2026-01-16
Company data provided by crunchbase