Senior Performance Engineer – AI Platforms jobs in United States
cer-icon
Apply on Employer Site
company-logo

Red Hat · 1 day ago

Senior Performance Engineer – AI Platforms

Red Hat is the world’s leading provider of enterprise open source software solutions, and they are seeking a Senior Performance Engineer to join their Performance and Scale Engineering team. The role involves driving the performance and scalability of distributed inference for Large Language Models (LLMs) and collaborating with cross-functional teams to enhance AI workloads.

Enterprise SoftwareInsurTechLinuxOpen SourceOperating SystemsSoftware
check
Culture & Values
check
H1B Sponsor Likelynote

Responsibilities

Define and track key performance indicators (KPIs) and service level objectives (SLOs) for large-scale, LLM inference services
Formulate and execute performance benchmarks utilizing tools like vLLM, GuideLLM, and PyTorch Profiler and other related tools to characterize performance, drive improvements, and detect issues through data analysis and visualization
Develop and maintain tools, scripts, and automated solutions that streamline performance benchmarking and AI model profiling tasks
Collaborate closely with cross-functional engineering teams to identify and address critical performance bottlenecks within the architecture and inference stacks
Partner with DevOps to bake performance gates into GitHub Actions/RHAIIS Pipelines
Explore and experiment with emerging AI technologies relevant to software development, proactively identifying opportunities to incorporate new AI capabilities into existing workflows and tooling
Triage field and customer escalations related to performance; distill findings into upstream issues and product backlog items
Publish results, recommendations, and best practices through internal reports, presentations, external blogs, technical papers, and official documentation
Represent the team at internal and external conferences, presenting key findings and strategies

Qualification

Performance engineeringDistributed systemsPythonPerformance benchmarkingAI fundamentalsBash/Linux skillsOpen-source commitmentCommunication skills

Required

5+ years of experience in performance engineering or systems-level software design
Hands-on experience with operating systems, distributed systems, or system-level performance tooling
Understanding of AI and LLM fundamentals
Fluency in Python (data & ML) and strong Bash/Linux skills
Knowledge of performance benchmarking and profiling for LLMs
Exceptional communication skills—able to translate raw performance data into customer value and executive narratives
Commitment to open-source values

Preferred

Master's or PhD in Computer Science, AI, or a related field
History of upstream contributions and community leadership
Experience publishing blogs or technical papers
Hands-on experience with any of the following Kubernetes/OpenShift/RHAIIS/RHELAI
Familiarity with performance observability stacks such as perf/eBPF tools, Nsight Systems, PyTorch Profiler, among others
Hands-on experience with modern LLM inference server stacks (e.g., vLLM, TensorRT-LLM, TGI, Triton Inference Server)

Benefits

Comprehensive medical, dental, and vision coverage
Flexible Spending Account - healthcare and dependent care
Health Savings Account - high deductible medical plan
Retirement 401(k) with employer match
Paid time off and holidays
Paid parental leave plans for all new parents
Leave benefits including disability, paid family medical leave, and paid military leave
Additional benefits including employee stock purchase plan, family planning reimbursement, tuition reimbursement, transportation expense account, employee assistance program, and more!

Company

Red Hat is a software company that offers enterprise open-source software solutions. It is a sub-organization of IBM.

H1B Sponsorship

Red Hat has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (159)
2024 (148)
2023 (156)
2022 (181)
2021 (154)
2020 (106)

Funding

Current Stage
Public Company
Total Funding
unknown
2018-10-28Acquired
1999-08-20IPO
1999-03-09Corporate Round

Leadership Team

leader-logo
Chris Wright
Chief Technology Officer and Senior Vice President Global Engineering
linkedin
leader-logo
Mark Little
CTO JBoss
linkedin
Company data provided by crunchbase