Amazon Web Services (AWS) ยท 2 days ago
Sr. Software Engineer- AI/ML, AWS Neuron Apps
Amazon Web Services (AWS) is at the forefront of AI technology with its AWS Neuron team, which focuses on deploying and optimizing sophisticated AI models. As a Senior Software Engineer, you will pioneer distributed inference solutions and collaborate with silicon architects to enhance AI performance and efficiency.
ConsultingDevOpsInformation TechnologySoftwareWeb Development
Responsibilities
Pioneer distributed inference solutions for industry-leading LLMs such as GPT, Llama, Qwen
Optimize breakthrough language and vision generative AI models
Collaborate directly with silicon architects and compiler teams to push the boundaries of AI acceleration
Drive performance benchmarking and tuning that directly impacts millions of inference calls globally
Spearhead distributed inference architecture for PyTorch and JAX using XLA
Engineer breakthrough performance optimizations for AWS Trainium and Inferentia
Develop ML tools to enhance LLM accuracy and efficiency
Transform complex tensor operations into highly optimized hardware implementations
Pioneer benchmarking methodologies that shape next-gen AI accelerator design
Qualification
Required
5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
5+ years of programming experience using Python or C++ and PyTorch
Experience with AI acceleration via quantization, parallelism, model compression, batching, KV caching, vllm serving
Experience with accuracy debugging & tooling, performance benchmarking of AI accelerators
Fundamentals of Machine learning and deep learning models, their architecture, training and inference lifecycles along with work experience on optimizations for improving the model execution
Preferred
Master's degree in computer science or equivalent
Master's degree in machine learning or equivalent
Experience with accuracy debugging & tooling, performance benchmarking of AI accelerators
Experience in developing CUDA kernels, HPC and inference optimization, tensors operations
Benefits
Flexibility in working hours
Full range of medical, financial, and/or other benefits
Company
Amazon Web Services (AWS)
Launched in 2006, Amazon Web Services (AWS) began exposing key infrastructure services to businesses in the form of web services -- now widely known as cloud computing.
H1B Sponsorship
Amazon Web Services (AWS) has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (22803)
2024 (21175)
2023 (19057)
2022 (24088)
2021 (12233)
2020 (14881)
Funding
Current Stage
Late StageTotal Funding
unknownKey Investors
BIRD Foundation
2025-01-22Grant
Leadership Team
Recent News
MarketScreener
2026-01-06
2026-01-06
2026-01-06
Company data provided by crunchbase