Apple · 1 month ago
Sr. Engineering Manager, AI Evaluation Platform
Apple is seeking a hands-on Engineering Manager to architect high-availability services and internal tools for AI evaluation systems. The role involves leading an engineering team, designing APIs and platforms for self-service evaluation, and ensuring the integration of research innovations into scalable infrastructure.
AppsArtificial Intelligence (AI)BroadcastingDigital EntertainmentFoundational AIMedia and EntertainmentMobile DevicesOperating SystemsTVWearables
Responsibilities
Team Building & Leadership: Hire, mentor, and grow a diverse, high-performing team of backend and platform engineers. Foster a culture of technical excellence and rapid delivery as you build this new team from the ground up
Technical Strategy & Roadmap: Own the engineering roadmap for the core evaluation engine. Architect the APIs, SDKs, and distributed services that power our internal platform, enabling product teams to measure Generative AI performance autonomously
Operationalizing Science: Partner closely with Applied Scientists to translate novel metrics, judge prompts, and scoring algorithms into scalable, production-grade services. Create frameworks to evaluate not just simple responses, but also multi-turn agent trajectories and tool usage
System Integration: Serve as a technical bridge between the research organization and the broader engineering ecosystem, ensuring our tools integrate seamlessly with existing ML infrastructure and developer workflows
Engineering Rigor: Establish the software development lifecycle (SDLC) for the team, defining standards for code quality, automated testing (CI/CD), and monitoring to ensure high availability and reliability
Qualification
Required
5+ years of direct engineering management experience, with a proven track record of hiring, mentoring, and retaining high-performing engineers. You have successfully managed teams that ship production-grade software
7+ years of hands-on software engineering experience with deep proficiency in the Python ecosystem (e.g., FastAPI, Pydantic, Pandas). You are capable of contributing to code reviews and architectural discussions on day one
Customer Obsession & Product Thinking: Experience acting as a technical partner to internal customers. You can translate vague requirements from other teams into concrete engineering specifications and are comfortable prioritizing the roadmap in the absence of a dedicated Product Manager
Demonstrated experience partnering with Data Scientists or Researchers: You have a history of taking experimental or 'messy' code and refactoring it into reliable, scalable production systems
Functional literacy in AI/ML concepts: You understand the fundamental lifecycle of machine learning (datasets, training vs. inference, evaluation metrics) and can discuss the engineering challenges involved in serving models
Strong expertise in API Design & Internal Tools: You have architected APIs that other developers rely on, with a focus on versioning, backward compatibility, and developer experience
Operational excellence background: You have practical experience establishing CI/CD pipelines, containerization (Docker/Kubernetes), and monitoring (Datadog/Prometheus)
Preferred
Experience building MLOps & Platform Infrastructure: You have architected or managed teams that built the foundational infrastructure for AI, such as model registries, inference services, or feature stores (using tools like Kubernetes, Ray, or Kubeflow)
Deep familiarity with AI Evaluation Frameworks: You have used or contributed to modern evaluation tools like DeepEval, Ragas, TruLens, or LangSmith. You understand how to implement and scale model-based evaluation workflows
Deep understanding of Generative AI & Agents: You understand the engineering challenges of relying on LLMs and Agents as software components—specifically managing token economics, handling rate limits, and evaluating non-deterministic, multi-step reasoning capabilities
Builder Experience: You have thrived in startup-like environments or incubated new teams within larger orgs, navigating high ambiguity to define roadmaps where none existed
Benefits
Comprehensive medical and dental coverage
Retirement benefits
A range of discounted products and free services
Reimbursement for certain educational expenses — including tuition
Discretionary bonuses or commission payments
Relocation
Company
Apple
Apple is a technology company that designs, manufactures, and markets consumer electronics, personal computers, and software.
H1B Sponsorship
Apple has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (6998)
2024 (3766)
2023 (3939)
2022 (4822)
2021 (4060)
2020 (3656)
Funding
Current Stage
Public CompanyTotal Funding
$5.67BKey Investors
Berkshire HathawayMicrosoftSequoia Capital
2025-05-05Post Ipo Debt· $4.5B
2025-01-16Post Ipo Debt· $0.31M
2021-04-30Post Ipo Equity
Leadership Team
Tim Cook
CEO
Craig Federighi
SVP, Software Engineering
Recent News
Venrock
2025-12-01
2025-09-25
Mac Daily News
2025-09-25
Company data provided by crunchbase