Principal Engineer, AI Model LifeCycle jobs in United States
cer-icon
Apply on Employer Site
company-logo

Crusoe · 19 hours ago

Principal Engineer, AI Model LifeCycle

Crusoe is on a mission to accelerate the abundance of energy and intelligence, focusing on sustainable technology in AI. The Principal Software Engineer for the Model LifeCycle team will build a managed platform for the application development lifecycle, particularly leveraging Machine Learning models and Large Language Models (LLMs).

AI InfrastructureArtificial Intelligence (AI)Data CenterEnergyEnergy ManagementOil and Gas
check
H1B Sponsor Likelynote

Responsibilities

Manage fine-tuning systems for large foundation models (SFT, PEFT, LoRA, adapters), including multi-node orchestration, checkpointing, failure recovery, and cost-efficient scaling
Implement and maintain end-to-end training pipelines for Large Language Models
Distillation and reinforcement learning pipelines (e.g., preference optimization, policy optimization, reward modeling)
Agent execution infrastructure
Dataset, model, and experiment management: versioning, lineage, evaluation, and reproducible fine-tuning at scale
Work closely with product, business, and platform teams to shape the core abstractions and APIs of the system
Influence long-term architectural decisions around training runtimes, scheduling, storage, and model lifecycle management
Contribute to and engage with the open-source LLM ecosystem
This role offers significant 0 → 1 ownership — you'll be designing and building core systems from first principles

Qualification

Machine LearningLarge Language ModelsAI infrastructureCloud-based servicesGolangPythonPyTorchPerformance optimizationProactive approachCommunication

Required

Advanced degree in Computer Science, Engineering, or a related field
10-15+ years of industry experience driving impactful projects in the AI Space
Proven track record of delivering early-stage projects under tight deadlines
Expertise in using cloud-based services, such as, elastic compute, object storage, virtual private networks, managed database, etc
Experience in Generative AI (Large Language Models, Multimodal)
Deep experience with AI infrastructure, including training, inference

Preferred

Proficiency in Golang or Python for large-scale, production-level services
Contributions to open-source AI projects such as vLLM or similar frameworks
Performance optimizations on GPU systems and inference frameworks
Experience working with PyTorch
Experience with training and fine-tuning LLMs
Proactive and collaborative approach with the ability to work autonomously
Strong communication and interpersonal skills
Passion for building cutting-edge AI products and solving challenging technical problems

Benefits

Industry competitive pay
Restricted Stock Units in a fast growing, well-funded technology company
Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
Employer contributions to HSA accounts
Paid Parental Leave
Paid life insurance, short-term and long-term disability
Teladoc
401(k) with a 100% match up to 4% of salary
Generous paid time off and holiday schedule
Cell phone reimbursement
Tuition reimbursement
Subscription to the Calm app
MetLife Legal
Company paid commuter benefit; $300/month

Company

Crusoe

twittertwittertwitter
company-logo
Crusoe is a vertically integrated AI infrastructure company that builds and operates data centers powered by energy sources.

H1B Sponsorship

Crusoe has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (69)
2024 (14)
2023 (2)
2022 (1)
2021 (1)

Funding

Current Stage
Late Stage
Total Funding
$3.9B
Key Investors
Mubadala Capital,Valor Equity PartnersVictory Park CapitalBrookfield Asset Management
2025-12-19Secondary Market
2025-10-23Series E· $1.4B
2025-08-25Debt Financing· $175M

Leadership Team

leader-logo
Chase Lochmiller
Co-Founder and Chief Executive Officer
linkedin
leader-logo
Cully Cavness
Co-Founder, President and Chief Strategy Officer
linkedin
Company data provided by crunchbase