Senior/Staff Software Engineer, Inference jobs in United States
cer-icon
Apply on Employer Site
company-logo

Anthropic · 2 weeks ago

Senior/Staff Software Engineer, Inference

Anthropic is a public benefit corporation dedicated to creating reliable and beneficial AI systems. The Senior/Staff Software Engineer in Inference will be responsible for building and maintaining critical systems that serve AI models to millions of users, focusing on maximizing compute efficiency and enabling research breakthroughs.

Artificial Intelligence (AI)Foundational AIGenerative AIInformation TechnologyMachine Learning
check
H1B Sponsorednote

Responsibilities

Designing intelligent routing algorithms that optimize request distribution across thousands of accelerators
Autoscaling our compute fleet to dynamically match supply with demand across production, research, and experimental workloads
Building production-grade deployment pipelines for releasing new models to millions of users
Integrating new AI accelerator platforms to maintain our hardware-agnostic competitive advantage
Contributing to new inference features (e.g., structured sampling, prompt caching)
Supporting inference for new model architectures
Analyzing observability data to tune performance based on real-world production workloads
Managing multi-region deployments and geographic routing for global customers

Qualification

Distributed systemsMachine learning systemsKubernetesCloud infrastructurePythonRustLoad balancingRequest routingPair programmingTechnical excellenceCommunication skills

Required

Significant software engineering experience, particularly with distributed systems
Results-oriented, with a bias towards flexibility and impact
Ability to pick up slack, even if it goes outside your job description
Enjoy pair programming
Desire to learn more about machine learning systems and infrastructure
Thrive in environments where technical excellence directly drives both business results and research breakthroughs
Care about the societal impacts of your work
At least a Bachelor's degree in a related field or equivalent experience

Preferred

High-performance, large-scale distributed systems experience
Implementing and deploying machine learning systems at scale
Experience with load balancing, request routing, or traffic management systems
LLM inference optimization, batching, and caching strategies
Kubernetes and cloud infrastructure (AWS, GCP)
Experience with Python or Rust

Benefits

Optional equity donation matching
Generous vacation and parental leave
Flexible working hours
A lovely office space in which to collaborate with colleagues

Company

Anthropic

twittertwittertwitter
company-logo
Anthropic is an AI research company that focuses on the safety and alignment of AI systems with human values.

H1B Sponsorship

Anthropic has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (105)
2024 (13)
2023 (3)
2022 (4)
2021 (1)

Funding

Current Stage
Late Stage
Total Funding
$33.74B
Key Investors
Lightspeed Venture PartnersGoogleAmazon
2025-09-02Series F· $13B
2025-05-16Debt Financing· $2.5B
2025-03-03Series E· $3.5B

Leadership Team

leader-logo
Dario Amodei
Co-Founder and CEO
linkedin
leader-logo
Daniela Amodei
President and co-founder
linkedin
Company data provided by crunchbase