Software Engineer, AI Inference Platform jobs in United States
cer-icon
Apply on Employer Site
company-logo

ElastixAI · 5 months ago

Software Engineer, AI Inference Platform

ElastixAI is an early-stage startup poised to revolutionize AI inference infrastructure. They are seeking a talented Software Engineer to build their core AI inference platform, focusing on designing and developing critical components while collaborating with cross-functional teams.

Artificial Intelligence (AI)Generative AIMachine Learning
check
H1B Sponsor Likelynote

Responsibilities

Design, develop, and maintain core components of our ML deployment platform, focusing on scalability, reliability, and ease of use
Research, prototype, and implement advanced model sharding and distribution strategies to optimize inference performance across diverse hardware targets
Develop and enhance a detailed performance simulator to model and predict the behavior of AI models on various infrastructure configurations
Collaborate with ML engineers to integrate and support optimized models within the deployment pipeline
Work with cloud and systems engineers to ensure efficient utilization of underlying hardware resources
Contribute to the design of APIs and tools that enable seamless integration and management of our inference solutions
Write clean, maintainable, and well-tested code
Participate in the full software development lifecycle, from design and implementation to testing and deployment

Qualification

ML infrastructurePythonDistributed systemsML frameworksC++GoContainerizationCloud platformsProblem-solvingCommunication skills

Required

BS/MS/PhD in Computer Science, Software Engineering, or a related field
3+ years of professional software development experience, with a focus on systems programming, distributed systems, or ML infrastructure
Strong proficiency in one or more programming languages such as Python, C++, or Go
Experience with ML frameworks (e.g., PyTorch, TensorFlow, JAX) and understanding of ML model deployment challenges
Solid understanding of software engineering best practices, including data structures, algorithms, and testing
Excellent problem-solving abilities and a knack for tackling complex technical challenges
Strong communication skills and a proven ability to collaborate effectively in a cross-functional team environment
Ability to thrive in a fast-paced, dynamic startup environment

Preferred

Experience with containerization and orchestration technologies (e.g., Docker, Kubernetes)
Familiarity with performance analysis, profiling, and optimization techniques
Experience building or working with performance modeling or simulation tools
Knowledge of network protocols and distributed computing concepts
Experience with cloud platforms (AWS, GCP, Azure)

Benefits

Comprehensive medical, dental, and vision coverage (100% paid by employer)
Life insurance and AD&D
Flexible Time Off (FTO)
12-paid holidays
Paid parental leave
Gym or fitness benefit
Commuter benefit
Weekly catered lunches in the office
Investment in employee learning & development

Company

ElastixAI

twittertwitter
company-logo
ElastixAI is developing an AI inference platform designed to optimize how large language models are run.

H1B Sponsorship

ElastixAI has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)

Funding

Current Stage
Early Stage
Total Funding
$16M
Key Investors
FUSE
2025-05-14Series Unknown· $16M
Company data provided by crunchbase