1

Hpc System Engineer Jobs (NOW HIRING)

HPC Systems Engineer Contract: 3 to 6 Months + Possibility to extend Location: Remote, USA Required Experience: The HPC Systems Engineer supports storage, networking, GPU systems, and compute ...

AI/HPC System Engineer Office Location: San Jose, CA Job Type: Full-Time Work Model: Onsite About SK hynix America At SK hynix America, we're at the forefront of semiconductor innovation, developing ...

AI/HPC System Engineer Office Location: San Jose, CA Job Type: Full-Time Work Model: Onsite About SK hynix America At SK hynix America, we're at the forefront of semiconductor innovation, developing ...

Transforming the Future with the Convergence of Simulation and Data Systems Engineer - HPC Do you like a challenge, are you a complex thinker who likes to solve problems? If so, then you might be the ...

Transforming the Future with the Convergence of Simulation and Data Systems Engineer - HPC Do you like a challenge, are you a complex thinker who likes to solve problems? If so, then you might be the ...

Transforming the Future with the Convergence of Simulation and Data Systems Engineer - HPC Do you like a challenge, are you a complex thinker who likes to solve problems? If so, then you might be the ...

As an AI/HPC System Performance Engineer on the Network Infrastructure Engineering team, you will drive end-to-end performance characterization, bottleneck analysis, and optimization of large-scale ...

RedLine Performance Solutions (RedLine) has been in the HPC solutions engineering services business ... We are looking for an HPC System Administrator to join us. The HPC System Administrator will ...

Analyzes system requirements and leads design and development activities. Guides users in ... Provides system engineering services and support HPC System Design & Engineering organization in ...

next page

Showing results 1-20

Hpc System Engineer information

See salary details

$53.5K

$127.2K

$167K

How much do hpc system engineer jobs pay per year?

As of May 28, 2026, the average yearly pay for hpc system engineer in the United States is $127,215.00, according to ZipRecruiter salary data. Most workers in this role earn between $98,000.00 and $157,000.00 per year, depending on experience, location, and employer.

What is an HPC System Engineer job?

An HPC (High-Performance Computing) System Engineer designs, deploys, and manages supercomputing environments used for complex computations. They optimize hardware and software components, ensuring system performance, scalability, and reliability. Responsibilities include configuring clusters, troubleshooting performance issues, and maintaining parallel file systems. They work with researchers and developers to optimize code for maximum efficiency. Strong knowledge of Linux, networking, and parallel computing is essential for this role.

What are the key skills and qualifications needed to thrive in the Hpc System Engineer position, and why are they important?

Excelling as an HPC System Engineer requires strong expertise in Linux systems administration, parallel computing, and networking, often supported by a degree in computer science or a related field. Familiarity with HPC resource managers (such as Slurm or PBS), file systems like Lustre or GPFS, and certifications like CompTIA Linux+ or RHCE are highly valuable. Effective problem-solving, teamwork, and communication skills help engineers address complex technical issues and interact with diverse research and engineering teams. These competencies are essential to ensure optimized system performance and support for high-demand computational workloads.

What are some common challenges faced by HPC System Engineers?

HPC System Engineers often encounter challenges related to managing large-scale clusters, troubleshooting performance bottlenecks, and ensuring system reliability under demanding workloads. Keeping up with evolving hardware, software updates, and security requirements is also a key part of the job. The role frequently involves responding to urgent issues, supporting a variety of users with different computational needs, and balancing maintenance with ongoing project deadlines. Successfully navigating these challenges requires both strong technical troubleshooting skills and the ability to communicate solutions effectively with researchers and IT peers.
What cities are hiring for Hpc System Engineer jobs? Cities with the most Hpc System Engineer job openings:
What are the most commonly searched types of Hpc System Engineer jobs? The most popular types of Hpc System Engineer jobs are:
What job categories do people searching Hpc System Engineer jobs look for? The top searched job categories for Hpc System Engineer jobs are:
HPC System Engineer

HPC System Engineer

3B Staffing LLC

Murphy, TX โ€ข On-site

Full-time

This job post hasย expired today.ย Applications are no longer accepted.


Job description

Role: HPC Systems Engineer
Contract: 3 to 6 Months + Possibility to extend
Location: Remote, USA
Required Experience:
The HPC Systems Engineer supports storage, networking, GPU systems, and compute environments, ensuring system performance, availability, and reliability while troubleshooting issues and supporting users. Storage Administration (NetApp)
  • Administer NetApp storage systems (volumes, aggregates, qtrees, snapshots)
  • Manage replication technologies (SnapMirror, SnapVault)
  • Monitor storage performance (I/O, latency, capacity) and report on trends
  • Troubleshoot storage issues impacting HPC workloads
  • Maintain backup, recovery, and data protection policies Network Administration (Arista)
  • Configure and maintain Arista switches within HPC environments
  • Manage VLANs, ACLs, and link aggregation
  • Support network documentation, topology diagrams, and change management NVIDIA DGX & GPU Systems
  • Support NVIDIA DGX systems including health checks, driver updates, and OS maintenance
  • Monitor GPU utilization, thermal performance, and interconnects (DCGM, nvidia-smi)
  • Troubleshoot and escalate hardware or performance issues HPC Operations
  • Perform system health checks, patching, and firmware updates on HPE servers
  • Support HPC schedulers such as Slurm or PBS (queue monitoring, job troubleshooting)
  • Maintain documentation, runbooks, and operational logs Requirements:
  • 3-5 years of Linux systems administration or HPC infrastructure experience
  • Experience supporting GPU-based systems (NVIDIA preferred)
  • Strong command-line troubleshooting across distributed systems
  • Solid communication and documentation skills
  • Preferred: Advanced Linux experience (7-10+ years)
  • Preferred: Experience with Slurm or similar schedulers
  • Preferred: Exposure to HPCM or parallel file systems