Lightedge · 2 months ago
Monitoring Engineer
LightEdge Solutions is developing IT solutions to propel businesses forward over the next decade. As a Monitoring Engineer, you will be responsible for the reliable operation of the organization’s systems, focusing on designing and implementing monitoring solutions to enhance system performance and availability.
Cloud ManagementInformation TechnologyInternetWireless
Responsibilities
Design and implement monitoring solutions to track the performance, availability, and health of various systems and services
Establish robust monitoring frameworks, set up alerts, and analyze system metrics to identify and resolve issues proactively
Establish and align metrics, including SLAs, SLOs, and SLIs, to closely tie system performance to business objectives, ensuring that the site reliability engineering efforts support the overall goals and customer satisfaction
Utilize AIOPS techniques to leverage automation in Incident Management and Response
Develop and maintain automated incident response systems that can detect and mitigate issues automatically
This includes automated incident triaging, remediation, and escalation workflows to minimize manual intervention and improve response times
Leverage the IT Service Management (ITSM) platform’s capabilities to integrate monitoring into incident management, change management, and other operational processes, enhancing the efficiency and effectiveness of site reliability engineering practices
Working closely with IT functional owners & SME’s
Perform implementation, monitoring system administration and integration functions
Tasks will consist of developing detailed designs, execution and troubleshooting of strategic solutions in support of effective monitoring, alerting, escalation, automation, reporting and event correlation
Qualification
Required
5 years hands-on experience with enterprise monitoring solutions
Must possess knowledge of Network Switches, Server hardware, Storage, and Virtualization Technologies
Understanding of VMware Infrastructure
Experience working with variety of monitoring systems such as Zabbix, vRealize Operations Manager, Nagios and Science Logic
Experience and proficiency in integrating with ServiceNow or similar IT service management platforms
Experience with managing automations within a monitoring environment
Ability to provide guidance with design, maintenance, and improvements to enterprise level monitoring solutions
Excellent verbal and written communication skills, ability to present complex ideas and designs to a variety of technical or non-technical stakeholders
Experience with design, implementation, and support of monitoring tools in a complex, multi-platform environment
High level of understanding monitoring requirements for Storage, Network, and Compute servers
Applicants must be authorized to work in the United States without the need for visa sponsorship now or in the future
Company
Lightedge
Lightedge delivers technology solutions that bring the operational benefits of the public cloud to enterprise IT workloads.
Funding
Current Stage
Growth StageTotal Funding
$5M2021-09-01Acquired
2004-04-23Private Equity· $5M
Recent News
2024-05-24
Company data provided by crunchbase