Sustainable Talent ยท 2 hours ago
Systems Engineering Technician
Sustainable Talent is partnering with Nvidia, a global leader in computer graphics and accelerated computing. They are seeking a Systems Engineering Technician to support the on-premise private cloud infrastructure team, maintaining a compute farm of systems for testing Nvidia hardware and software.
ConsultingHuman ResourcesInformation Technology
Responsibilities
Collaborate closely with engineering teams (system architects, hardware/software engineers, QA, and more) to craft, develop, debug, and release next-generation products
Manage and maintain a high-performing farm of builders, packagers, testers, and core infrastructure
Ensure availability targets are consistently met and lead system recovery efforts
Deploy and qualify systems while supporting exciting new technology bring-ups
Coordinate inventory and lifecycle management tasks across labs and data centers
Maintain a world-class, safe, and well-organized environment
Fix software, hardware, and infrastructure issues alongside engineers and platform operations teams
Plan, deploy, and maintain on-premises infrastructure, collaborating with datacenter and network engineering teams
Implement efficiency improvements to improve availability, throughput, and test accuracy while meeting SLAs and important metrics
Represent the team in meetings with internal collaborators and contribute to global operations
Qualification
Required
Associate's or Bachelor's Degree in Engineering/Technical Major (or equivalent experience)
Proven experience in data centers or large engineering labs
Familiarity with SCMs like GIT/Perforce
Proficiency in DCIM (Nautobot, etc.) and scripting (shell, Python, Ansible)
Working knowledge of protocols/services like TCP/IP, DNS, NFS, SSL, etc
Experience with Windows, Linux, and Mac operating systems
Hands-on experience with PCBs, GPUs, and system deployments
Outstanding communication skills, both written and verbal
Ability to explain technical concepts to non-technical audiences
Strong problem-solving skills and a collaborative spirit
Preferred
Experience managing HPC clusters using tools like BCM and Slurm
Hands-on knowledge of OpenStack
Relevant certifications such as CCNA or equivalent
Strong background in Windows and Linux administration, with an understanding of dense datacenter design, including compute, storage, and networking
Experience with hypervisors and VM applications
Knowledge of DC infrastructure with an emphasis on liquid cooling
A track record of technical curiosity and innovation
Mechanically inclined and comfortable with tools and physical tasks
Upbeat, hardworking, and a highly developed ability to propel the team to the finish line
Ready to focus and commit to completing tasks efficiently
Benefits
Full benefits
PTO
Amazing company culture
Company
Sustainable Talent
Sustainable Talent provides staffing, consulting and outsourcing services.