2

Remote Hpc System Engineer Jobs in Arizona (NOW HIRING)

The Linux/HPC Engineer will primarily work on a small team of HPC Systems Administrators ... The position can be remote, but will be required to support the normal business hours for the ...

Position Overview This role provides first-contact remote technical support for network, server ... system-impacting incidents and accurately documenting all actions in Thrive's tools. * Perform ...

next page

Showing results 1-20

Remote Hpc System Engineer information

What are the key skills and qualifications needed to thrive as a Remote HPC System Engineer, and why are they important?

To thrive as a Remote HPC System Engineer, you need expertise in Linux system administration, parallel computing, networking, and a degree in computer science or related field. Familiarity with job schedulers (like Slurm), cluster management tools, scripting languages (such as Python or Bash), and certifications like CompTIA Linux+ or Red Hat Certified Engineer are highly valuable. Strong problem-solving abilities, effective communication, and self-motivation are essential soft skills for remote collaboration and troubleshooting. These skills ensure the reliable operation, optimization, and scalability of HPC systems in distributed environments.

What are some common challenges faced by Remote HPC System Engineers, and how can they be managed effectively?

Remote HPC System Engineers often encounter challenges such as troubleshooting complex hardware or software issues without physical access, ensuring seamless system performance, and coordinating with geographically dispersed teams. These can be managed by leveraging strong remote monitoring tools, maintaining clear documentation, and establishing effective communication channels with on-site staff. Proactively scheduling regular system health checks and participating in virtual team meetings can also help address problems quickly and maintain high system reliability.

What are Remote HPC System Engineers?

Remote HPC (High Performance Computing) System Engineers are IT professionals who design, implement, manage, and troubleshoot HPC systems and clusters from a remote location. They work with advanced computing infrastructure that supports scientific research, complex simulations, and large-scale data processing. Their responsibilities include configuring hardware and software, monitoring system performance, ensuring security, and providing technical support to users, all while working off-site. This role requires strong expertise in HPC technologies, operating systems like Linux, networking, and scripting, as well as effective communication skills for collaborating with distributed teams.

What is the difference between Remote Hpc System Engineer vs Remote Cloud Infrastructure Engineer?

AspectRemote Hpc System EngineerRemote Cloud Infrastructure Engineer
CredentialsTypically requires Linux certifications, HPC-specific trainingOften requires cloud platform certifications (AWS, Azure, GCP)
Work EnvironmentHigh-performance computing clusters, research labsCloud platforms, data centers, virtualized environments
Industry UsageResearch, scientific computing, academiaTech, finance, enterprise IT
Search/Comparison IntentUnderstanding HPC-specific roles vs cloud rolesComparing on-premise HPC vs cloud infrastructure

The Remote Hpc System Engineer focuses on managing and optimizing high-performance computing clusters, often in research or scientific environments. In contrast, the Remote Cloud Infrastructure Engineer specializes in designing and maintaining cloud-based infrastructure across various industries. While both roles require technical expertise in system management, their environments and certifications differ, catering to distinct operational needs.

What are popular job titles related to Remote Hpc System Engineer jobs in Arizona? For Remote Hpc System Engineer jobs in Arizona, the most frequently searched job titles are:
What job categories do people searching Remote Hpc System Engineer jobs in Arizona look for? The top searched job categories for Remote Hpc System Engineer jobs in Arizona are:
What cities in Arizona are hiring for Remote Hpc System Engineer jobs? Cities in Arizona with the most Remote Hpc System Engineer job openings:
Linux HPC Engineer (Remote)

Full-time

Medical, Retirement, PTO

Posted 29 days ago


Job description

RedLine Performance Solutions (RedLine) has been in the High Performance Computing (HPC) solutions engineering services business for over 26 years and is consistently determined to keep the "bar of excellence" quite high for new hires. This enables RedLine to accomplish what other firms cannot and promotes a high level of staff retention. RedLine provides IT infrastructure management and technical support services to some of the world's largest supercomputing sites.
The Linux/HPC Engineer will primarily work on a small team of HPC Systems Administrators responsible for the installation and operational support of an HPC cluster located in Phoenix, Arizona. Operations run 24x7 and therefore there will be a rotational on-call requirement. The Linux/HPC Engineer will actively participate in the evolution and maintenance of the technical infrastructure, in addition to supporting the on-site HPC environment.
The position can be remote, but will be required to support the normal business hours for the primary customer site in Phoenix, AZ. In addition to supporting the HPC cluster in Phoenix, the Engineer will also contribute to other infrastructure and customer initiatives as business needs arise. The Engineer will be required to shift priorities, support parallel efforts, and provide technical expertise across multiple projects, including deployments, upgrades, troubleshooting, and documentation. Additional assignments may include short-term tasking in adjacent programs, collaboration with cross-functional engineering teams, and participation in planned maintenance windows or special projects to meet organizational commitments. Travel to different customer sites is expected to be a maximum of 25% of the time.
US citizenship is a mandatory requirement for this position. This full-time (W-2) position offers a full benefits package including paid time off, 401k match, and health care benefits.
Required Skills:
  • 5 or more years of Linux systems administration, preferably in a Red Hat and/or Rocky environment
  • Strong knowledge of TCP/IP networking
  • HPC system administration experience (e.g., parallel file systems, cluster management, archival systems)
  • Strong experience in Bash, Perl, and Python scripting in a version-controlled environment using Git
  • Strong verbal and written communication skills, with the ability to coordinate between multiple team members in remote locations between several disparate projects
  • Strong organizational skills

Preferred Skills/Experience:
  • Experienced with system engineering in addition to system administration
  • Cloud administration (e.g. Azure, GCP, AWS)
  • Experience with deploying and supporting computational models and simulations in HPC infrastructure (e.g., on-premise and cloud, with containers).
  • Knowledge and understanding of application hosting, with experience using Cloud Services in a Commercial Infrastructure as a Service (IAAS) or Platform as Service (PAAS) environment.
  • Red Hat Certification (e.g., RHCSA, RHCE)
  • Server automation experience (e.g., Puppet, Foreman, Ansible)
  • Experience with job scheduling software (e.g., Slurm or Moab)
  • Experience with cluster automation tools (e.g., xCAT, HPCM, or Bright Cluster Manager)
  • Familiarity with a wide range of server and networking hardware (e.g., HPE, SuperMicro, NetGate, Juniper, etc.)
  • Applications such as Atlassian Confluence, Gitlab, or Mediawiki

To learn more about RedLine, please visit our website at www.RedLinePerf.com