IMCS Group ยท 4 weeks ago
Consultant - Infrastructure Management | DevOps | Continuous delivery - Environment management and provisioning
IMCS Group is one of the fastest growing MWBE staffing firms in the U.S., focusing on diversity recruitment for Fortune 500 companies. They are seeking a skilled HPC Slurm Administrator to manage and support high-performance computing environments, ensuring high availability and reliability of HPC clusters.
Staffing & Recruiting
Responsibilities
Administer and maintain HPC clusters using Slurm
Monitor system performance and ensure high availability and reliability
Troubleshoot and resolve issues related to job scheduling, compute nodes, and storage
Manage user accounts, permissions, and security policies
Automate administrative tasks using scripting languages (e.g., Bash, Python)
Collaborate with engineering and research teams to support compute-intensive workloads
Document system configurations, procedures, and operational changes
Participate in upgrades, patching, and scaling of HPC infrastructure
Qualification
Required
Experience in Linux system administration, preferably in HPC environments
Strong expertise with Slurm workload manager
Proficiency in Bash, Python, or other scripting languages
Familiarity with parallel file systems and high-speed networking (e.g., InfiniBand)
Experience with configuration management tools (e.g., Ansible, Puppet)
Minimum years of experience needed- 3+ years of experience
Company
IMCS Group
IMCS Group is an IT, Healthcare, and Professional Staffing Company that helps Enterprises optimize the business value of their Staffing investments and enables them to achieve world-class business performance.