Echo IT Solutions · 5 months ago
Site Reliability Engineer
Echo IT Solutions is seeking a dynamic and accomplished Site Reliability Engineer to solve complex reliability challenges in high-impact environments. The role involves owning the reliability of application processes, designing monitoring solutions, and collaborating with development and operations teams to ensure system performance and reliability.
AnalyticsArtificial Intelligence (AI)Cloud ComputingConsultingInformation TechnologyMachine LearningWeb Development
Responsibilities
Own the reliability of all processes within the application workflow by designing and implementing proactive solutions that detect and resolve issues before they affect end users
Design and implement monitoring and alerting solutions using tools such as Splunk, ServiceNow (SNOW), xMatters, Dynatrace, and AppDynamics
Assess current monitoring implementations, collaborate with owners of different components across the application ecosystem, and identify improvements to ensure issues are detected and alerted to the appropriate teams in a timely manner
Explore and implement self-healing agents to automate recurring production support tasks and reduce manual intervention
Use Dynatrace/splunk/AppDynamics for identifying performance issues and/or failures and collaborate with production support, developers, sysadmins and database admin's to identify and fix issues
Develop and manage AI agents using Agentic AI frameworks, ensuring they align with operational goals
Collaborate with development and operations teams to ensure system reliability and performance
Participate in incident response, root cause analysis, and continuous improvement initiatives
Qualification
Required
Proven experience in .NET development, debugging, and production support
Extensive knowledge and Hands-on experience with monitoring tools: Splunk, Dynatrace, AppDynamics (Optional)
Strong database troubleshooting skills in SQL Server and Oracle
Experience with Agentic AI: creating, deploying, and managing intelligent agents
Solid understanding of SRE principles, including reliability, scalability, and observability
Prior experience supporting healthcare clients and familiarity with industry standards and compliance
Excellent problem-solving and communication skills
Familiarity with HIPAA and other healthcare compliance frameworks
Application Performance Monitoring & Alerting: Splunk, Dynatrace, AppDynamics, xMatters, ServiceNow
Programming Languages: C#, .NET, PowerShell, SQL, Python (for automation and AI integration)
Databases: SQL Server, Oracle
AI & Automation: Agentic AI frameworks, self-healing systems, intelligent agents
DevOps Tools: Jenkins, Git, Azure DevOps, CI/CD pipelines
Company
Echo IT Solutions
Echo IT Solutions provides IT consulting, managed services, cloud, cybersecurity, data, and custom software development.
H1B Sponsorship
Echo IT Solutions has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (17)
2024 (16)
2023 (7)
2022 (16)
2021 (3)
Funding
Current Stage
Growth StageCompany data provided by crunchbase