ElastixAI · 5 months ago
Software Engineer, AI Inference Platform
ElastixAI is an early-stage startup poised to revolutionize AI inference infrastructure. They are seeking a talented Software Engineer to build their core AI inference platform, focusing on designing and developing critical components while collaborating with cross-functional teams.
Artificial Intelligence (AI)Generative AIMachine Learning
Responsibilities
Design, develop, and maintain core components of our ML deployment platform, focusing on scalability, reliability, and ease of use
Research, prototype, and implement advanced model sharding and distribution strategies to optimize inference performance across diverse hardware targets
Develop and enhance a detailed performance simulator to model and predict the behavior of AI models on various infrastructure configurations
Collaborate with ML engineers to integrate and support optimized models within the deployment pipeline
Work with cloud and systems engineers to ensure efficient utilization of underlying hardware resources
Contribute to the design of APIs and tools that enable seamless integration and management of our inference solutions
Write clean, maintainable, and well-tested code
Participate in the full software development lifecycle, from design and implementation to testing and deployment
Qualification
Required
BS/MS/PhD in Computer Science, Software Engineering, or a related field
3+ years of professional software development experience, with a focus on systems programming, distributed systems, or ML infrastructure
Strong proficiency in one or more programming languages such as Python, C++, or Go
Experience with ML frameworks (e.g., PyTorch, TensorFlow, JAX) and understanding of ML model deployment challenges
Solid understanding of software engineering best practices, including data structures, algorithms, and testing
Excellent problem-solving abilities and a knack for tackling complex technical challenges
Strong communication skills and a proven ability to collaborate effectively in a cross-functional team environment
Ability to thrive in a fast-paced, dynamic startup environment
Preferred
Experience with containerization and orchestration technologies (e.g., Docker, Kubernetes)
Familiarity with performance analysis, profiling, and optimization techniques
Experience building or working with performance modeling or simulation tools
Knowledge of network protocols and distributed computing concepts
Experience with cloud platforms (AWS, GCP, Azure)
Benefits
Comprehensive medical, dental, and vision coverage (100% paid by employer)
Life insurance and AD&D
Flexible Time Off (FTO)
12-paid holidays
Paid parental leave
Gym or fitness benefit
Commuter benefit
Weekly catered lunches in the office
Investment in employee learning & development
Company
ElastixAI
ElastixAI is developing an AI inference platform designed to optimize how large language models are run.
H1B Sponsorship
ElastixAI has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
Funding
Current Stage
Early StageTotal Funding
$16MKey Investors
FUSE
2025-05-14Series Unknown· $16M
Company data provided by crunchbase