2

Remote Hpc System Engineer Jobs in Michigan (NOW HIRING)

The successful candidate will be able to perform system integration of data migration solutions for ... REMOTE Basic Requirements Required Skills: * High School diploma, Bachelor's degree in Engineering ...

This role is not eligible for remote work. WHAT YOU'LL DO * Implement and maintain test plans, test ... Execute manual and automated tests to validate system functionality, performance, and reliability.

Systems Engineer

Troy, MI ยท On-site +1

$50K/yr

The Systems Engineer I is responsible for handling service requests and incidents escalated from ... Remote & onsite workstation troubleshooting. * Communicating with clients and co-workers and ...

(Remote) Senior Software Engineer

Wyoming, MI ยท Remote

$111K - $146K/yr

You will be responsible for providing accurate effort estimates, identifying system and process ... This remote role welcomes candidates anywhere in the US. Salary: 110K - 139K What your impact will ...

(Remote) Senior Software Engineer

Wyoming, MI ยท Remote

$111K - $146K/yr

You will be responsible for providing accurate effort estimates, identifying system and process ... This remote role welcomes candidates anywhere in the US. Salary: 110K - 139K What your impact will ...

next page

Showing results 1-20

Remote Hpc System Engineer information

What are the key skills and qualifications needed to thrive as a Remote HPC System Engineer, and why are they important?

To thrive as a Remote HPC System Engineer, you need expertise in Linux system administration, parallel computing, networking, and a degree in computer science or related field. Familiarity with job schedulers (like Slurm), cluster management tools, scripting languages (such as Python or Bash), and certifications like CompTIA Linux+ or Red Hat Certified Engineer are highly valuable. Strong problem-solving abilities, effective communication, and self-motivation are essential soft skills for remote collaboration and troubleshooting. These skills ensure the reliable operation, optimization, and scalability of HPC systems in distributed environments.

What are some common challenges faced by Remote HPC System Engineers, and how can they be managed effectively?

Remote HPC System Engineers often encounter challenges such as troubleshooting complex hardware or software issues without physical access, ensuring seamless system performance, and coordinating with geographically dispersed teams. These can be managed by leveraging strong remote monitoring tools, maintaining clear documentation, and establishing effective communication channels with on-site staff. Proactively scheduling regular system health checks and participating in virtual team meetings can also help address problems quickly and maintain high system reliability.

What is the difference between Remote Hpc System Engineer vs Remote Cloud Infrastructure Engineer?

AspectRemote Hpc System EngineerRemote Cloud Infrastructure Engineer
CredentialsTypically requires Linux certifications, HPC-specific trainingOften requires cloud platform certifications (AWS, Azure, GCP)
Work EnvironmentHigh-performance computing clusters, research labsCloud platforms, data centers, virtualized environments
Industry UsageResearch, scientific computing, academiaTech, finance, enterprise IT
Search/Comparison IntentUnderstanding HPC-specific roles vs cloud rolesComparing on-premise HPC vs cloud infrastructure

The Remote Hpc System Engineer focuses on managing and optimizing high-performance computing clusters, often in research or scientific environments. In contrast, the Remote Cloud Infrastructure Engineer specializes in designing and maintaining cloud-based infrastructure across various industries. While both roles require technical expertise in system management, their environments and certifications differ, catering to distinct operational needs.

What are Remote HPC System Engineers?

Remote HPC (High Performance Computing) System Engineers are IT professionals who design, implement, manage, and troubleshoot HPC systems and clusters from a remote location. They work with advanced computing infrastructure that supports scientific research, complex simulations, and large-scale data processing. Their responsibilities include configuring hardware and software, monitoring system performance, ensuring security, and providing technical support to users, all while working off-site. This role requires strong expertise in HPC technologies, operating systems like Linux, networking, and scripting, as well as effective communication skills for collaborating with distributed teams.
What job categories do people searching Remote Hpc System Engineer jobs in Michigan look for? The top searched job categories for Remote Hpc System Engineer jobs in Michigan are:
What cities in Michigan are hiring for Remote Hpc System Engineer jobs? Cities in Michigan with the most Remote Hpc System Engineer job openings:
Infographic showing various Remote Hpc System Engineer job openings in Michigan as of June 2026, with employment types broken down into 2% As Needed, 86% Full Time, and 12% Contract. Highlights an 87% Physical, 5% Hybrid, and 8% Remote job distribution.
Senior HPC Software Engineer

Senior HPC Software Engineer

Ford Motor Company

Dearborn, MI โ€ข Remote

$113K - $192K/yr

Full-time

Medical, Dental, Life, PTO

Posted 14 days ago


Job description

We are seeking a senior technical contributor to help support, modernize, and scale our on premise high performance computing platform. This role will work across Linux systems administration, HPC operations, Kubernetes-based services, automation, observability, software tooling, and user-facing platform delivery. The ideal candidate has deep experience administering RHEL based systems in complex compute environments and is comfortable troubleshooting issues across operating systems, schedulers, storage, networking, containers, applications, and user workloads.

This person will play a key role in improving the reliability, usability, and operational maturity of the platform. They will help develop and maintain core HPC services, support users running demanding engineering and AI/ML workloads, and create tooling, scripts, APIs, and integrations. Strong software engineering fundamentals are important, including experience with Python, Go, or similar languages, Git-based development workflows, code reviews, testing practices, CI/CD pipelines, documentation, and maintainable code design. Experience with Slurm or other workload managers is highly valued.

We are looking for someone who can balance strong technical depth with a user-focused delivery mindset. This role requires the ability to work collaboratively with platform engineers, application teams, and technical users to identify pain points, resolve production issues, document repeatable processes, and build durable improvements. The right candidate will be pragmatic, a team player, comfortable in a fast-moving environment, and motivated by making complex, massive on-prem infrastructure easier to operate, automate, observe, and continuously improve.ย 

  • Bachelor's degree in Computer Science, Engineering, or related field, or equivalent experience
  • 10+ years of experience in systems engineering, infrastructure engineering, platform engineering, or a related technical role.
  • Strong Linux systems administration experience, preferably with RHEL.
  • Experience with Slurm, PBS, or another HPC workload manager.
  • Experience creating APIs, applications, and services that support platform operations and user workflows.
  • Experience supporting production compute, infrastructure, and large-scale technical environments.
  • Hands-on experience with scripting and software development using Python, Go, Bash, or similar languages.

  • Familiarity with CI/CD concepts, GitHub, and modern software delivery practices.
  • Strong troubleshooting skills across operating systems, services, networking, storage, and application layers.
  • Ability to write clear documentation and communicate effectively with both technical and non-technical stakeholders.
  • Strong ownership mindset with the ability to drive issues to resolution.

  • Ability to use independent judgement to make sound technical decisions.

You may not check every box, or your experience may look a little different from what we've outlined, but if you think you can bring value to Ford Motor Company, we encourage you to apply!

As an established global company, we offer the benefit of choice. You can choose what your Ford future will look like: will your story span the globe, or keep you close to home? Will your career be a deep dive into what you love, or a series of new teams and new skills? Will you be a leader, a changemaker, a technical expert, a culture builder...or all of the above? No matter what you choose, we offer a work life that works for you, including:

  • Immediate medical, dental, and prescription drug coverage
  • Flexible family care, parental leave, new parent ramp-up programs, subsidized back-up child care and more
  • Vehicle discount program for employees and family members, and management leases
  • Tuition assistance
  • Established and active employee resource groups
  • Paid time off for individual and team community service
  • A generous schedule of paid holidays, including the week between Christmas and New Year's Day
  • Paid time off and the option to purchase additional vacation time.

For a detailed look at our benefits, click here:ย Benefit Summaryย 

This position is a salary grade 8.ย 

This position is a salary grade 8 and ranges from $113,580-192,900.

*Visa Sponsorship is not provided for this role*

Candidates for positions with Ford Motor Company must be legally authorized to work in the United States. Verification of employment eligibility will be required at the time of hire.

We are an Equal Opportunity Employer committed to a culturally diverse workforce. All qualified applicants will receive consideration for employment without regard to race, religion, color, age, sex, national origin, sexual orientation, gender identity, disability status or protected veteran status. In the United States, If you need a reasonable accommodation for the online application process due to a disability, please call 1-888-336-0660.
ย 

#LI-Remote

#LI-GH2

  • Bachelor's degree in Computer Science, Engineering, or related field, or equivalent experience
  • 10+ years of experience in systems engineering, infrastructure engineering, platform engineering, or a related technical role.
  • Strong Linux systems administration experience, preferably with RHEL.
  • Experience with Slurm, PBS, or another HPC workload manager.
  • Experience creating APIs, applications, and services that support platform operations and user workflows.
  • Experience supporting production compute, infrastructure, and large-scale technical environments.
  • Hands-on experience with scripting and software development using Python, Go, Bash, or similar languages.

  • Familiarity with CI/CD concepts, GitHub, and modern software delivery practices.
  • Strong troubleshooting skills across operating systems, services, networking, storage, and application layers.
  • Ability to write clear documentation and communicate effectively with both technical and non-technical stakeholders.
  • Strong ownership mindset with the ability to drive issues to resolution.

  • Ability to use independent judgement to make sound technical decisions.

  • Administer, troubleshoot, and improve RHEL based high performance computing environments supporting CPU and GPU workloads.
  • Create and maintain HPC services across compute, storage, networking, scheduling, Kubernetes, and observability.
  • Develop tools, scripts, APIs, integrations, and automation using Python, Go, Bash, or similar languages.
  • Apply software engineering best practices, including Git workflows, code reviews, testing, modular design, and CI/CD.
  • Support and help update HPC scheduling environments, with Slurm experience preferred.

  • Improve monitoring, alerting, dashboards, and operational visibility using Grafana, Prometheus, Dynatrace, and related tools.

  • Partner with users, customers, and internal engineering teams to understand requirements, resolve issues, and improve platform usability.
  • Create and maintain documentation, architecture notes, user guides, and operational procedures.
  • Drive platform modernization focused on reliability, scalability, automation, security, and maintainability.


Ford logo

About Ford

Sourced by ZipRecruiter

At Ford Motor Company, we believe freedom of movement drives human progress. With our incredible plans for the future of mobility, we have a wide variety of opportunities for you to accelerate your career and help us define tomorrow's transportation.

Industry

Civil engineering construction

Company size

51 - 200 Employees

Headquarters location

Doral, FL, US

Year founded

1982