TechStar Group · 1 day ago
ML Support Engineer
TechStar Group is hiring for the position of ML Support Engineer. The role involves providing end-to-end support for various ML platforms, ensuring optimal performance and adherence to operational SLAs.
Responsibilities
Own end‑to‑end support for Domino Data Lab , GCP Dataproc , Galileo , and adjacent ML platforms
Perform installation, upgrades, configuration, patching, and environment maintenance
Monitor cluster health, resource utilization, job execution, performance, and alerts
Troubleshoot ML workloads involving Spark, Python, R, GPUs, containers, and orchestrators based on the JIRA tickets (SLAs are very much applicable)
Manage access, security policies, service accounts, and platform governance
Ensure high availability, optimal performance, and adherence to operational SLAs
Qualification
Required
Own end‑to‑end support for Domino Data Lab, GCP Dataproc, Galileo, and adjacent ML platforms
Perform installation, upgrades, configuration, patching, and environment maintenance
Monitor cluster health, resource utilization, job execution, performance, and alerts
Troubleshoot ML workloads involving Spark, Python, R, GPUs, containers, and orchestrators based on the JIRA tickets (SLAs are very much applicable)
Manage access, security policies, service accounts, and platform governance
Ensure high availability, optimal performance, and adherence to operational SLAs