Senior HPC DevOps Engineer
As a Senior HPC DevOps Engineer, you will design, implement, and maintain large-scale HPC and AI clusters, focusing on state-of-the-art monitoring, logging, and alerting systems. You will work closely with HPC, OS, GPU compute, and systems specialists to develop and optimize performance platforms, automate deployment processes, and troubleshoot complex issues from bare metal to application level.