Crusoe · 19 hours ago
Principal Engineer, AI Model LifeCycle
Crusoe is on a mission to accelerate the abundance of energy and intelligence, focusing on sustainable technology in AI. The Principal Software Engineer for the Model LifeCycle team will build a managed platform for the application development lifecycle, particularly leveraging Machine Learning models and Large Language Models (LLMs).
AI InfrastructureArtificial Intelligence (AI)Data CenterEnergyEnergy ManagementOil and Gas
Responsibilities
Manage fine-tuning systems for large foundation models (SFT, PEFT, LoRA, adapters), including multi-node orchestration, checkpointing, failure recovery, and cost-efficient scaling
Implement and maintain end-to-end training pipelines for Large Language Models
Distillation and reinforcement learning pipelines (e.g., preference optimization, policy optimization, reward modeling)
Agent execution infrastructure
Dataset, model, and experiment management: versioning, lineage, evaluation, and reproducible fine-tuning at scale
Work closely with product, business, and platform teams to shape the core abstractions and APIs of the system
Influence long-term architectural decisions around training runtimes, scheduling, storage, and model lifecycle management
Contribute to and engage with the open-source LLM ecosystem
This role offers significant 0 → 1 ownership — you'll be designing and building core systems from first principles
Qualification
Required
Advanced degree in Computer Science, Engineering, or a related field
10-15+ years of industry experience driving impactful projects in the AI Space
Proven track record of delivering early-stage projects under tight deadlines
Expertise in using cloud-based services, such as, elastic compute, object storage, virtual private networks, managed database, etc
Experience in Generative AI (Large Language Models, Multimodal)
Deep experience with AI infrastructure, including training, inference
Preferred
Proficiency in Golang or Python for large-scale, production-level services
Contributions to open-source AI projects such as vLLM or similar frameworks
Performance optimizations on GPU systems and inference frameworks
Experience working with PyTorch
Experience with training and fine-tuning LLMs
Proactive and collaborative approach with the ability to work autonomously
Strong communication and interpersonal skills
Passion for building cutting-edge AI products and solving challenging technical problems
Benefits
Industry competitive pay
Restricted Stock Units in a fast growing, well-funded technology company
Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
Employer contributions to HSA accounts
Paid Parental Leave
Paid life insurance, short-term and long-term disability
Teladoc
401(k) with a 100% match up to 4% of salary
Generous paid time off and holiday schedule
Cell phone reimbursement
Tuition reimbursement
Subscription to the Calm app
MetLife Legal
Company paid commuter benefit; $300/month
Company
Crusoe
Crusoe is a vertically integrated AI infrastructure company that builds and operates data centers powered by energy sources.
H1B Sponsorship
Crusoe has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (69)
2024 (14)
2023 (2)
2022 (1)
2021 (1)
Funding
Current Stage
Late StageTotal Funding
$3.9BKey Investors
Mubadala Capital,Valor Equity PartnersVictory Park CapitalBrookfield Asset Management
2025-12-19Secondary Market
2025-10-23Series E· $1.4B
2025-08-25Debt Financing· $175M
Leadership Team
Recent News
2026-01-22
Bizjournals.com Feed (2025-11-12 15:43:17)
2026-01-16
Company data provided by crunchbase