Job Title:ย Senior HPC Infrastructure Engineer
Primary Location:ย Chicagoland, Hybrid with minimum of 2ย days in-office.
Position Type:ย 12 mos. Contract with Contract-to-Hire potential.
Overview
client is casting a line for aย Sr. HPC Infrastructure Engineer! This is a 12 mos.ย contractย with potential to convert to FTE opportunity. This position plays a key role in supporting the design, deployment, and optimization of high-performance computing (HPC) infrastructureโ both on-prem and in cloud environments. This role combines deep technical system expertise with hands-on administration to ensure scalable, reliable, and secure environments for advanced scientific research and computational workloads.
What You Bring to the Role (Ideal Experience)
โข Strong background in Linux/Unix system administration
โข Experience designing and supporting HPC clusters in research, academic, or scientific computing environments
โข Proficiency with parallel computing frameworks such as MPI and OpenMP
โข Familiarity with job scheduling/resource management systems (e.g., Slurm, Torque, PBS)
โข Hands-on experience with high-speed interconnects (e.g., InfiniBand, Omni-Path)
โข Strong understanding of networking, storage solutions, and system performance tuning
โข Experience with backup, disaster recovery, and data integrity solutions in high-performance environments
โข Fluency in scripting (e.g., Bash, Python)
โข Strong troubleshooting skills and collaborative communication style
โข Bachelor's degree in Computer Science, Engineering, or equivalent experience (Master's preferred)
โข Relevant technical certifications (e.g., Red Hat, CompTIA Linux+) are a plus
What You'll Do (Skills Used in this Position)
โข Design, deploy, and manage scalable HPC systems across both cloud and on-prem environments
โข Define system requirements and optimize Linux-based systems for performance, reliability, and scalability
โข Maintain, monitor, and patch HPC environments to ensure high availability and security
โข Design and manage high-performance storage systems with robust backup, replication, and archival strategies
โข Conduct benchmarking and performance tuning, collaborating with HPC operations to resolve bottlenecks
โข Partner with cybersecurity teams to ensure compliance and security in HPC environments
โข Maintain technical documentation, SOPs, and troubleshooting guides
โข Provide end-user training and technical support, managing on-site computing technologies
โข Contribute to overall operational efficiency through team collaboration and continual improvement initiatives
If you are interested or have any references please share resume at mukul@brightmindsol.com.