1

Linux Hpc System Administrator Jobs (NOW HIRING)

The ideal candidate combines strong Linux systems expertise, HPC workload management experience ... Administer, configure, and optimize HPC job scheduling environments, including IBM Spectrum LSF ...

The Linux/HPC Engineer will primarily work on a small team of HPC Systems Administrators responsible for the installation and operational support of an HPC cluster located in Phoenix, Arizona.

The Linux/HPC Engineer will primarily work on a small team of HPC Systems Administrators responsible for the installation and operational support of an HPC cluster located in Phoenix, Arizona.

We are looking for an HPC System Administrator to join us. The HPC System Administrator will ... Required Skills: * 7 or more years of Linux systems administration, preferably in a Red Hat and/or ...

$180K - $220K/yr

System Administrators (HPC), must provide High Performance Computing (HPC) services in the form of ... Configure and manage Linux, Unix, and Windows (or other applicable) operating systems and installs ...

next page

Showing results 1-20

Linux Hpc System Administrator information

See salary details

$20

$51

$75

How much do linux hpc system administrator jobs pay per hour?

As of Jun 5, 2026, the average hourly pay for linux hpc system administrator in the United States is $51.96, according to ZipRecruiter salary data. Most workers in this role earn between $41.11 and $62.50 per hour, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive as a Linux HPC System Administrator, and why are they important?

To thrive as a Linux HPC System Administrator, you need expertise in Linux system administration, networking, and parallel computing environments, typically supported by a degree in computer science or a related field. Familiarity with HPC workload managers (such as Slurm or PBS), scripting languages (like Bash or Python), and monitoring tools is commonly required. Strong problem-solving, communication, and time-management skills help administrators effectively support users and respond to issues. These competencies ensure the reliability, performance, and security of high-performance computing clusters vital for research and enterprise operations.

What are some common challenges faced by Linux HPC System Administrators, and how can I prepare for them?

Linux HPC System Administrators often face challenges such as managing large-scale clusters, ensuring system stability, and maintaining high performance for compute-intensive workloads. You'll regularly troubleshoot hardware and software issues, coordinate scheduled downtimes for maintenance, and implement security updates without disrupting users. Preparing by gaining experience with cluster management tools (like Slurm or PBS), scripting for automation, and staying up-to-date with the latest in HPC technologies will help you succeed in this dynamic environment.

What is a Linux HPC System Administrator?

A Linux HPC (High Performance Computing) System Administrator is a professional responsible for managing, maintaining, and optimizing large-scale computing clusters that run on Linux operating systems. These systems are used for scientific research, engineering, and data-intensive applications that require significant computational power. The role involves installing and configuring hardware and software, monitoring system performance, troubleshooting issues, and ensuring security and data integrity. Additionally, Linux HPC System Administrators often support researchers and users by managing job scheduling systems, resource allocation, and software updates.

What is the difference between Linux Hpc System Administrator vs Linux Server Administrator?

AspectLinux Hpc System AdministratorLinux Server Administrator
CredentialsLinux certifications (e.g., RHCE), HPC-specific trainingLinux certifications, server management courses
Work EnvironmentHigh-performance computing clusters, research labsData centers, enterprise server rooms
Employer & IndustryResearch institutions, scientific organizationsBusinesses, IT service providers
Common Search/ComparisonFocus on HPC, parallel computing, cluster managementFocus on server setup, maintenance, and virtualization

The main difference is that Linux Hpc System Administrators specialize in managing high-performance computing clusters used for scientific and research purposes, while Linux Server Administrators focus on maintaining and supporting enterprise servers in data centers. Both roles require Linux expertise and certifications but serve different operational environments and industry needs.

More about Linux Hpc System Administrator jobs
What states have the most Linux Hpc System Administrator jobs? States with the most job openings for Linux Hpc System Administrator jobs include:
What job categories do people searching Linux Hpc System Administrator jobs look for? The top searched job categories for Linux Hpc System Administrator jobs are:
CAE HPC System Administrator

CAE HPC System Administrator

Prolim Global

Saline, MI • On-site

Contractor

Posted 10 days ago


Job description

CAE HPC System Administrator
Saline, Michigan (Hybrid)

Description

We are seeking a highly skilled CAE HPC Systems Administrator to manage, optimize, and support enterprise-level High-Performance Computing (HPC) environments dedicated to Computer-Aided Engineering (CAE) workloads.

This role is responsible for ensuring system stability, scalability, and performance of HPC clusters while supporting CAE applications, job scheduling systems, and underlying Linux infrastructure. The ideal candidate combines strong Linux systems expertise, HPC workload management experience, and a solid understanding of CAE engineering environments.

Key Responsibilities:

  1. HPC Job Queuing & Workload Management
    • Administer, configure, and optimize HPC job scheduling environments, including IBM Spectrum LSF, Open PBS,  or equivalent schedulers.
    • Design and tune job queues, resource allocation policies, and scheduling strategies to support diverse CAE workloads.
    • Monitor system performance and utilization trends and implement improvements to maximize efficiency and throughput.
  2. CAE Application and Licensing Support
    • Install, upgrade, test, and support CAE applications and simulation tools in production environments.
    • Provide integration support between CAE applications and HPC scheduling systems.
    • Manage CAE software licensing systems (e.g., FlexLM, RLM) and ensure availability.
    • Troubleshoot application-related issues and ensure minimal disruption to engineering activities.
  3. Linux Systems Administration & Automation
    • Administer and maintain Red Hat Enterprise Linux (RHEL) environments across HPC clusters.
    • Perform OS provisioning, deployment, and patch management using automated tools (e.g., PXE, or configuration management solutions).
    • Develop and maintain scripts (Bash, Korn shell, C Shell, Perl, Awk, or equivalent) to automate system monitoring, health checks, and routine administrative tasks.

o   Maintain system logs, monitoring processes, and standard operating procedures.

  1. Hardware & Infrastructure Management
    • Troubleshoot and resolve issues related to servers, storage systems, and high-performance networking (e.g., InfiniBand, high-speed Ethernet).
    • Support hardware lifecycle activities including installation, maintenance, and upgrades.
    • Conduct capacity planning based on system utilization trends and future demand.
  2. Operations, Monitoring & Continuous Improvement
    • Perform system health checks, monitoring, and incident tracking for HPC and CAE environments.
    • Document system configurations, procedures, incidents, and best practices.
    • Track outages, analyze root causes, and implement preventive measures.
    • Follow change management processes for system updates and deployments.
    • Provide accurate reporting (e.g., utilization, incidents, system performance) and support project initiatives.

Requirements

• 3+ years of Linux system administration experience (preferably RHEL environments).

• Hands-on experience managing HPC clusters and job schedulers (LSF, Slurm, PBS, or similar).

• Proven experience in CAE application support and integration.

• Strong scripting skills (Bash, Shell, Perl, or equivalent).

• Experience with OS deployment, patching, and system automation.

• Solid understanding of enterprise server hardware, storage, and networking fundamentals.

• Experience with CAE tools such as Ansys, LS-DYNA, Nastran, or similar.

• Familiarity with high-performance networking technologies is plus (e.g., InfiniBand).

• Experience developing internal tools or dashboards are plus (e.g., PHP or web-based tooling).

Position Type / Expected Hours

• Hybrid Full-time: Standard business hours with flexibility required to support maintenance windows and critical production issues.

• Occasional after-hours or weekend work may be required based on business needs.