Caterpillar Inc. · 11 hours ago
Senior Site Reliability Engineer
Caterpillar Inc. is a global team dedicated to building stronger, sustainable communities through innovation. The Site Reliability Engineer will ensure the reliability, availability, and performance of D365 ERP systems while collaborating with cross-functional teams to enhance system stability and service delivery.
ConstructionMachinery ManufacturingManufacturingMechanical Engineering
Responsibilities
Monitor and troubleshoot production and QA systems to identify and resolve performance, scalability, and reliability issues proactively
Participate in the on-call rotation to provide 24/7 critical incident support for eCommerce platform systems
Design, implement, and maintain automated processes and tools to streamline deployment and release processes
Collaborate with cross-functional teams to define, document, and implement operational processes, best practices, and procedures
Implement and maintain system monitoring tools and dashboards to provide real-time insights into system performance and identify potential issues
Work closely with developers to identify and fix bugs and performance bottlenecks in the application code
Ensure that systems and infrastructure comply with security, compliance, and regulatory requirements
Continuously evaluate systems and processes to identify areas for improvement and implement changes as needed
Qualification
Required
Effective Communications: Strong understanding of communication concepts, tools and techniques; ability to effectively transmit, receive, and accurately interpret ideas, information, and needs through the application of appropriate communication behaviors
Technical Troubleshooting: Extensive knowledge of technical troubleshooting approaches, tools and techniques; ability to anticipate, recognize, and resolve technical issues on hardware, software, application or operation
Performance Measurement and Tuning: Knowledge of system performance, testing and programming; ability to monitor, measure, and optimize system performance and network communication
Software Release Management: Knowledge of strategies, practices and tools for managing versions and distribution of software products and enhancements; ability to evaluate and improve release management practices and tools
Software Reliability Management: Knowledge of software reliability management; ability to develop and use principles, methodologies and metrics that increase software product performance and reliability
Bachelor's degree in Computer Science, Information Technology, a related field, or equivalent experience
6+ years of experience in site reliability engineering, DevOps, QA, or a related field
Strong experience with Microsoft D365 or general Azure based services
Experience with AWS infrastructure and services
Experience with IaC solutions like Cloudformation and Terraform
Experience with CI/CD solutions - Github, Azure DevOps
Strong troubleshooting and critical thinking skills
6+ years of experience and proficiency in one or more programming languages, such as Python (preferred), Javascript (preferred)
Solid understanding of networking, load balancing, on prem hosting solutions, and web application architectures
Experience with containerization technologies, such as Docker and Kubernetes
Excellent problem-solving skills and a strong attention to detail
Strong IT and Business communication skills and ability to collaborate effectively with cross-functional teams
Benefits
Medical, dental, and vision benefits
Paid time off plan (Vacation, Holidays, Volunteer, etc.)
401(k) savings plans
Health Savings Account (HSA)
Flexible Spending Accounts (FSAs)
Health Lifestyle Programs
Employee Assistance Program
Voluntary Benefits and Employee Discounts
Career Development
Incentive bonus
Disability benefits
Life Insurance
Parental leave
Adoption benefits
Tuition Reimbursement
Company
Caterpillar Inc.
For 100 years, we’ve been helping customers build a better, more sustainable world.
Funding
Current Stage
Public CompanyTotal Funding
$3.51BKey Investors
US Department of EnergyAdvanced Propulsion Centre UK
2025-08-28Post Ipo Debt· $3.5B
2024-10-31Grant· $5.04M
2019-06-23Grant
Leadership Team
Recent News
Arizona Daily Star
2026-02-08
2026-02-07
Company data provided by crunchbase