Senior Software Engineer II- AI Workload Orchestration jobs in United States
cer-icon
Apply on Employer Site
company-logo

CoreWeave · 1 hour ago

Senior Software Engineer II- AI Workload Orchestration

CoreWeave is The Essential Cloud for AI™, delivering a platform of technology and tools for innovators to build and scale AI. The Senior Software Engineer II will design and operate Kubernetes-native services for AI workload orchestration, driving improvements in reliability and performance while mentoring junior engineers.

AI InfrastructureArtificial Intelligence (AI)Cloud ComputingCloud InfrastructureInformation TechnologyMachine Learning
badNo H1BnoteU.S. Citizen Onlynote

Responsibilities

Design, build, and operate Kubernetes-native services for AI workload orchestration and scheduling
Own one or more platform components end-to-end, including design, implementation, testing, and on-call support
Improve scheduling latency, cluster utilization, and workload reliability through metrics-driven engineering
Contribute to architectural discussions across services and influence design decisions within the platform
Work closely with adjacent teams (CKS, infrastructure, managed inference) to ensure clean interfaces and integrations
Mentor junior engineers and raise the quality bar for code, design, and operations

Qualification

KubernetesGoDistributed systemsAI workload orchestrationScheduling frameworksSystem reliabilityCloud infrastructurePerformance improvementData-driven engineeringMentoring

Required

5–8 years of professional software engineering experience in distributed systems, cloud infrastructure, or platform engineering
Strong experience building production systems in Go (Python or C++ a plus)
Solid understanding of Kubernetes fundamentals, APIs, controllers, and operating services in production
Experience working with scheduling, resource management, or quota-based systems
Proven ability to improve system reliability and performance using data and operational metrics
Comfortable owning services in production and participating in on-call rotations

Preferred

Experience with Kubernetes-native orchestration frameworks such as Kueue, Volcano, Ray, Kubeflow, or Argo Workflows
Familiarity with GPU-based workloads, ML training, or inference pipelines
Knowledge of scheduling concepts such as quota enforcement, pre-emption, and backfilling
Experience with reliability practices including SLOs, alerting, and incident response
Exposure to AI infrastructure, HPC, or large-scale distributed compute environments

Benefits

Medical, dental, and vision insurance - 100% paid for by CoreWeave
Company-paid Life Insurance
Voluntary supplemental life insurance
Short and long-term disability insurance
Flexible Spending Account
Health Savings Account
Tuition Reimbursement
Ability to Participate in Employee Stock Purchase Program (ESPP)
Mental Wellness Benefits through Spring Health
Family-Forming support provided by Carrot
Paid Parental Leave
Flexible, full-service childcare support with Kinside
401(k) with a generous employer match
Flexible PTO
Catered lunch each day in our office and data center locations
A casual work environment
A work culture focused on innovative disruption

Company

CoreWeave

twittertwittertwitter
company-logo
CoreWeave is a cloud-based AI infrastructure company offering GPU cloud services to simplify AI and machine learning workloads.

Funding

Current Stage
Public Company
Total Funding
$24.87B
Key Investors
Jane Street CapitalStack CapitalCoatue
2025-12-08Post Ipo Debt· $2.54B
2025-11-12Post Ipo Debt· $2.5B
2025-08-20Post Ipo Secondary

Leadership Team

leader-logo
Michael Intrator
Chief Executive Officer
linkedin
leader-logo
Brannin McBee
Founder & CDO
linkedin
Company data provided by crunchbase