LINUX High Performance Computing (HPC) System Admin
Administration of Linux HPC clusters (in the local datacenter and AWS cloud), administration of related Linux workstations
Coordination of software and hardware upgrades for HPC clusters
Problem solving (software/configuration issues, hardware issues), including coordination with suppliers.
Monitoring
Continuous improvement of setup, analysis and implementation of users requests
Technical
Good knowledge related to HPC in applications for CAE, AI, machine learning and rendering.
(Os: Red Hat Linux | Storage: NetApp, DDN, Dell, HP | Filesystems: Lustre, Beegfs, CEPH, GPFS | Schedulers: Slurm, PBS/Torque, SGE) (Bright cluster manager, HPC networking)
Ability to provide support in datacenter (Installation & maintenance of servers, disks, ethernet and infiniband connectivity) Knowledge related to HPC on AWS cloud (Terraform, OpenStack)
Knowledge related to Linux workstation administration
Basic project management capabilities (Making schedule, cost analysis, project documentation, reporting)
Soft skills
Language: English (fluent)
Good communication with colleagues, users, management
Willing to learn, challenge existing technology and make proposals for the future evolution of the clusters Autonomous