Ampstek · 2 hours ago
Site Reliability Engineer (Only W2)
Ampstek is seeking a Site Reliability Engineer to join their team. The role involves monitoring system health, managing incidents, and collaborating with development and infrastructure teams to ensure reliability and scalability of systems.
Responsibilities
Proactively monitor system health and performance using monitoring and other observability tools
Manage and resolve high critical incidents end-to-end with minimal downtime
Collaborate with development and infrastructure teams to ensure reliability and scalability
Contribute to continuous improvement of system reliability, monitoring coverage, and alerting accuracy
Drive automation and efficiency in incident response and post-incident reviews
Hands-on experience on scripting languages (Shell Script. Python etc.,)
Support and guide the migration of legacy applications to cloud platforms
Need to have knowledge on Grafana dashboards
Qualification
Required
Experience: 10 years
Splunk, Dynatrace, AppDynamics, ThousandEyes etc
Experience in Site Reliability Engineering including production support roles
Hands-on expertise in Splunk, Dynatrace, AppDynamics
Knowledge on ThousandEyes monitoring tool is a plus
Proven track record of handling critical production issues independently
Strong understanding of cloud migration strategies, tools and processes
Ability to work effectively in high-pressure environments and cross-functional teams
Excellent troubleshooting, communication, and analytical skills
Preferred
DevOps/Platform/SRE/Build & Release