Sr. Software Engineer- AI/ML, AWS Neuron Apps jobs in United States
cer-icon
Apply on Employer Site
company-logo

Amazon Web Services (AWS) ยท 2 days ago

Sr. Software Engineer- AI/ML, AWS Neuron Apps

Amazon Web Services (AWS) is at the forefront of AI technology with its AWS Neuron team, which focuses on deploying and optimizing sophisticated AI models. As a Senior Software Engineer, you will pioneer distributed inference solutions and collaborate with silicon architects to enhance AI performance and efficiency.

ConsultingDevOpsInformation TechnologySoftwareWeb Development
check
H1B Sponsor Likelynote

Responsibilities

Pioneer distributed inference solutions for industry-leading LLMs such as GPT, Llama, Qwen
Optimize breakthrough language and vision generative AI models
Collaborate directly with silicon architects and compiler teams to push the boundaries of AI acceleration
Drive performance benchmarking and tuning that directly impacts millions of inference calls globally
Spearhead distributed inference architecture for PyTorch and JAX using XLA
Engineer breakthrough performance optimizations for AWS Trainium and Inferentia
Develop ML tools to enhance LLM accuracy and efficiency
Transform complex tensor operations into highly optimized hardware implementations
Pioneer benchmarking methodologies that shape next-gen AI accelerator design

Qualification

PythonPyTorchMachine LearningC++Distributed SystemsPerformance BenchmarkingCollaborationMentorshipProblem Solving

Required

5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
5+ years of programming experience using Python or C++ and PyTorch
Experience with AI acceleration via quantization, parallelism, model compression, batching, KV caching, vllm serving
Experience with accuracy debugging & tooling, performance benchmarking of AI accelerators
Fundamentals of Machine learning and deep learning models, their architecture, training and inference lifecycles along with work experience on optimizations for improving the model execution

Preferred

Master's degree in computer science or equivalent
Master's degree in machine learning or equivalent
Experience with accuracy debugging & tooling, performance benchmarking of AI accelerators
Experience in developing CUDA kernels, HPC and inference optimization, tensors operations

Benefits

Flexibility in working hours
Full range of medical, financial, and/or other benefits

Company

Amazon Web Services (AWS)

company-logo
Launched in 2006, Amazon Web Services (AWS) began exposing key infrastructure services to businesses in the form of web services -- now widely known as cloud computing.

H1B Sponsorship

Amazon Web Services (AWS) has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (22803)
2024 (21175)
2023 (19057)
2022 (24088)
2021 (12233)
2020 (14881)

Funding

Current Stage
Late Stage
Total Funding
unknown
Key Investors
BIRD Foundation
2025-01-22Grant

Leadership Team

leader-logo
Matt Garman
Chief Executive Officer
linkedin
leader-logo
Anand Desikan
CTO, CXO Advisor, and Enterprise Technologist
linkedin
Company data provided by crunchbase