2

Remote Hpc System Engineer Jobs in Michigan (NOW HIRING)

$95K - $110K/yr

This role blends software and systems engineering to enhance service availability, automate ... This is a remote position open to candidates within the United States. * You will participate in an ...

$95K - $110K/yr

This role blends software and systems engineering to enhance service availability, automate ... This is a remote position open to candidates within the United States. * You will participate in an ...

This is a remote position with approximately 50% travel. Qualified candidates can reside anywhere ... Requires working with the customer and engineering teams when implementing solutions including ...

next page

Showing results 1-20

Remote Hpc System Engineer information

What are the key skills and qualifications needed to thrive as a Remote HPC System Engineer, and why are they important?

To thrive as a Remote HPC System Engineer, you need expertise in Linux system administration, parallel computing, networking, and a degree in computer science or related field. Familiarity with job schedulers (like Slurm), cluster management tools, scripting languages (such as Python or Bash), and certifications like CompTIA Linux+ or Red Hat Certified Engineer are highly valuable. Strong problem-solving abilities, effective communication, and self-motivation are essential soft skills for remote collaboration and troubleshooting. These skills ensure the reliable operation, optimization, and scalability of HPC systems in distributed environments.

What are some common challenges faced by Remote HPC System Engineers, and how can they be managed effectively?

Remote HPC System Engineers often encounter challenges such as troubleshooting complex hardware or software issues without physical access, ensuring seamless system performance, and coordinating with geographically dispersed teams. These can be managed by leveraging strong remote monitoring tools, maintaining clear documentation, and establishing effective communication channels with on-site staff. Proactively scheduling regular system health checks and participating in virtual team meetings can also help address problems quickly and maintain high system reliability.

What is the difference between Remote Hpc System Engineer vs Remote Cloud Infrastructure Engineer?

AspectRemote Hpc System EngineerRemote Cloud Infrastructure Engineer
CredentialsTypically requires Linux certifications, HPC-specific trainingOften requires cloud platform certifications (AWS, Azure, GCP)
Work EnvironmentHigh-performance computing clusters, research labsCloud platforms, data centers, virtualized environments
Industry UsageResearch, scientific computing, academiaTech, finance, enterprise IT
Search/Comparison IntentUnderstanding HPC-specific roles vs cloud rolesComparing on-premise HPC vs cloud infrastructure

The Remote Hpc System Engineer focuses on managing and optimizing high-performance computing clusters, often in research or scientific environments. In contrast, the Remote Cloud Infrastructure Engineer specializes in designing and maintaining cloud-based infrastructure across various industries. While both roles require technical expertise in system management, their environments and certifications differ, catering to distinct operational needs.

What are Remote HPC System Engineers?

Remote HPC (High Performance Computing) System Engineers are IT professionals who design, implement, manage, and troubleshoot HPC systems and clusters from a remote location. They work with advanced computing infrastructure that supports scientific research, complex simulations, and large-scale data processing. Their responsibilities include configuring hardware and software, monitoring system performance, ensuring security, and providing technical support to users, all while working off-site. This role requires strong expertise in HPC technologies, operating systems like Linux, networking, and scripting, as well as effective communication skills for collaborating with distributed teams.
What job categories do people searching Remote Hpc System Engineer jobs in Michigan look for? The top searched job categories for Remote Hpc System Engineer jobs in Michigan are:
What cities in Michigan are hiring for Remote Hpc System Engineer jobs? Cities in Michigan with the most Remote Hpc System Engineer job openings:
Infographic showing various Remote Hpc System Engineer job openings in Michigan as of June 2026, with employment types broken down into 2% As Needed, 86% Full Time, and 12% Contract. Highlights an 87% Physical, 5% Hybrid, and 8% Remote job distribution.

Expert Site Reliability Engineer

Harriscomputer

On-site, Remote

$95K - $110K/yr

Full-time

Posted 8 days ago


Job description

Site Reliability Engineer (SRE) - Remote

Overview
As a Site Reliability Engineer (SRE) at Altera, you will be responsible for ensuring the reliability, scalability, and performance of our hosted healthcare platforms. This role blends software and systems engineering to enhance service availability, automate operations, and improve the customer experience. You will act as a technical leader in monitoring, troubleshooting, incident response, and continuous improvement across our cloud and hybrid environments.

Key Responsibilities

  • Maintain and improve the reliability, availability, and performance of our production environments.
  • Lead the investigation and resolution of complex application, database, and infrastructure issues.
  • Participate in incident management, conduct root cause analysis (RCA), and contribute to post-incident reviews to prevent future occurrences.
  • Define and measure Service Level Indicators (SLIs) and Objectives (SLOs) to meet our service commitments.
  • Develop proactive monitoring and alerting strategies to identify and resolve issues before they impact customers.
  • Automate operational tasks using scripting and Infrastructure-as-Code (IaC) to improve efficiency.
  • Partner with engineering and cloud teams to refine deployment, monitoring, and support processes.
  • Provide technical leadership during major incidents and act as a key escalation point for critical issues.

Qualifications

Experience:

  • 7+ years of experience supporting enterprise applications, infrastructure, or cloud environments.
  • Monitoring & Observability: Strong experience with APM tools such as LogicMonitor, AppDynamics, Azure Monitor, SentryOne, Dynatrace, Datadog, or New Relic.
  • Microsoft Stack: Deep knowledge of Windows Server administration, IIS, .NET applications, Windows Clustering, MSMQ, Event Logs, and PerfMon.
  • Database Skills: Strong SQL Server experience, including performance tuning, query optimization, blocking analysis, and Always On Availability Groups.
  • Cloud & Networking: Experience with Azure cloud environments and a solid understanding of networking fundamentals (DNS, TCP/IP, load balancing, firewalls).
  • ITSM & ITIL: Familiarity with ServiceNow (or other ITSM platforms) and ITIL principles.

Preferred Skills:

  • Scripting with PowerShell, Python, or similar languages.
  • Infrastructure as Code (Terraform, ARM Templates, Bicep).
  • CI/CD pipelines and deployment automation (Azure DevOps, GitHub Actions).
  • Experience with Kubernetes and containerized workloads.
  • Experience implementing SLOs, SLIs, and Error Budgets.
  • Experience in a healthcare technology or patient care environment.

Education:

  • Bachelor's Degree in Computer Science, Information Technology, or Engineering is preferred; equivalent professional experience will be considered.

Working Arrangements

  • This is a remote position open to candidates within the United States.
  • You will participate in an on-call rotation to support our 24x7 healthcare environment.
  • Occasional after-hours work is required for activations, upgrades, and major incidents.

Travel

  • Travel is not a requirement for this role.

Our company complies with all local/state regulations in regard to displaying salary ranges. If required, the salary range(s) are displayed below and are specifically for those potential hires who will perform work in or reside in the location(s) listed, if selected for the role. Any offered salary is determined based on internal equity, internal salary ranges, market data, ranges, applicant's skills and prior relevant experience, certain degrees and certifications (e.g. JD, technology), for example.

Salary Range
$95,000-$110,000

Why Altera?
At Altera Digital Health, you will have the opportunity to profoundly impact the lives of patients by empowering healthcare providers to deliver superior care. You will join a passionate and gifted team committed to innovation and excellence. We offer a competitive compensation and benefits package and the opportunity to work in a fast-paced and dynamic environment.