As a Kubernetes Platform Engineer for the HPC Platform teams, you will work to support all activities of our supercomputer center. Our primary platform is the OLCF Slate Service, built on Kubernetes ...
As a Kubernetes Platform Engineer for the HPC Platform teams, you will work to support all activities of our supercomputer center. Our primary platform is the OLCF Slate Service, built on Kubernetes ...
Kubernetes Software Engineer
Knoxville, TN ยท On-site
... supercomputer center. The primary platform is the OLCF Slate Service, built on Kubernetes and RKE2, which provides a container orchestration service for running critical operation applications and ...
Kubernetes Software Engineer
Knoxville, TN ยท On-site
... supercomputer center. The primary platform is the OLCF Slate Service, built on Kubernetes and RKE2, which provides a container orchestration service for running critical operation applications and ...
Senior Platform Engineer
Knoxville, TN ยท On-site
$93K - $127K/yr
As a Senior Platform Engineer for the HPC Platform teams, you will work to support all activities of our supercomputer center. Our primary platform is the OLCF Slate Service, built on Kubernetes and ...
Senior Platform Engineer
Knoxville, TN ยท On-site
$93K - $127K/yr
As a Senior Platform Engineer for the HPC Platform teams, you will work to support all activities of our supercomputer center. Our primary platform is the OLCF Slate Service, built on Kubernetes and ...
Home to the world's fastest supercomputer, Oak Ridge also hosts Oak Ridge National Laboratory, whose research and technology initiatives in scientific discovery, clean energy, and security have ...
Home to the world's fastest supercomputer, Oak Ridge also hosts Oak Ridge National Laboratory, whose research and technology initiatives in scientific discovery, clean energy, and security have ...
Knoxville TN - GI need - Employed
Knoxville, TN ยท On-site
$381K/yr
Home to the world's fastest supercomputer, Oak Ridge also hosts Oak Ridge National Laboratory, whose research and technology initiatives in scientific discovery, clean energy, and security have ...
Knoxville TN - GI need - Employed
Knoxville, TN ยท On-site
$381K/yr
Home to the world's fastest supercomputer, Oak Ridge also hosts Oak Ridge National Laboratory, whose research and technology initiatives in scientific discovery, clean energy, and security have ...
Home to the world's fastest supercomputer, Oak Ridge also hosts Oak Ridge National Laboratory, whose research and technology initiatives in scientific discovery, clean energy, and security have ...
Home to the world's fastest supercomputer, Oak Ridge also hosts Oak Ridge National Laboratory, whose research and technology initiatives in scientific discovery, clean energy, and security have ...
Home to the worlds fastest supercomputer, Oak Ridge is home to Oak Ridge National Laboratory, whose research and technology initiatives in scientific discovery, clean energy, and security have ...
Home to the worlds fastest supercomputer, Oak Ridge is home to Oak Ridge National Laboratory, whose research and technology initiatives in scientific discovery, clean energy, and security have ...
Home to the worlds fastest supercomputer, Oak Ridge is home to Oak Ridge National Laboratory, whose research and technology initiatives in scientific discovery, clean energy, and security have ...
Home to the worlds fastest supercomputer, Oak Ridge is home to Oak Ridge National Laboratory, whose research and technology initiatives in scientific discovery, clean energy, and security have ...
Home to the worlds fastest supercomputer, Oak Ridge is home to Oak Ridge National Laboratory, whose research and technology initiatives in scientific discovery, clean energy, and security have ...
Home to the worlds fastest supercomputer, Oak Ridge is home to Oak Ridge National Laboratory, whose research and technology initiatives in scientific discovery, clean energy, and security have ...
Home to the worlds fastest supercomputer, Oak Ridge is home to Oak Ridge National Laboratory, whose research and technology initiatives in scientific discovery, clean energy, and security have ...
Home to the worlds fastest supercomputer, Oak Ridge is home to Oak Ridge National Laboratory, whose research and technology initiatives in scientific discovery, clean energy, and security have ...
Home to one of the world's fastest supercomputers, this location hosts a national laboratory whose research and technology initiatives in scientific discovery, clean energy, and security have ushered ...
Home to one of the world's fastest supercomputers, this location hosts a national laboratory whose research and technology initiatives in scientific discovery, clean energy, and security have ushered ...
Independently operate and maintain critical mechanical, electrical, and plumbing (MEP) systems in supercomputing data centers, ensuring high reliability and performance through hands-on maintenance ...
Independently operate and maintain critical mechanical, electrical, and plumbing (MEP) systems in supercomputing data centers, ensuring high reliability and performance through hands-on maintenance ...
Home to one of the world's fastest supercomputers, this location hosts a national laboratory whose research and technology initiatives in scientific discovery, clean energy, and security have ushered ...
Home to one of the world's fastest supercomputers, this location hosts a national laboratory whose research and technology initiatives in scientific discovery, clean energy, and security have ushered ...
Independently operate and maintain critical mechanical, electrical, and plumbing (MEP) systems in supercomputing data centers, ensuring high reliability and performance through hands-on maintenance ...
Quick apply
Independently operate and maintain critical mechanical, electrical, and plumbing (MEP) systems in supercomputing data centers, ensuring high reliability and performance through hands-on maintenance ...
Facilities Operations Technician
Memphis, TN ยท On-site
As a Facilities Operations Technician at xAI, you'll dive in, operating and maintaining the critical facility systems that power our supercomputing data centers. Working hands-on with MEP systems ...
Facilities Operations Technician
Memphis, TN ยท On-site
As a Facilities Operations Technician at xAI, you'll dive in, operating and maintaining the critical facility systems that power our supercomputing data centers. Working hands-on with MEP systems ...
Facilities Operations Technician
Memphis, TN ยท On-site
As a Facilities Operations Technician at xAI, you'll dive in, operating and maintaining the critical facility systems that power our supercomputing data centers. Working hands-on with MEP systems ...
Quick apply
Facilities Operations Technician
Memphis, TN ยท On-site
As a Facilities Operations Technician at xAI, you'll dive in, operating and maintaining the critical facility systems that power our supercomputing data centers. Working hands-on with MEP systems ...
Geospatial Analyst
Oak Ridge, TN ยท On-site
Support prototype and production workflows within HPC, Supercomputer and Cloud processing environments. * Maintain regular communication with customer, subcontractors and vendor staff members. * Take ...
Geospatial Analyst
Oak Ridge, TN ยท On-site
Support prototype and production workflows within HPC, Supercomputer and Cloud processing environments. * Maintain regular communication with customer, subcontractors and vendor staff members. * Take ...
Home to one of the world's fastest supercomputers, this location hosts a national laboratory whose research and technology initiatives in scientific discovery, clean energy, and security have ushered ...
Home to one of the world's fastest supercomputers, this location hosts a national laboratory whose research and technology initiatives in scientific discovery, clean energy, and security have ushered ...
Home to one of the world's fastest supercomputers, this location hosts a national laboratory whose research and technology initiatives in scientific discovery, clean energy, and security have ushered ...
Home to one of the world's fastest supercomputers, this location hosts a national laboratory whose research and technology initiatives in scientific discovery, clean energy, and security have ushered ...
Support prototype and production workflows within HPC, Supercomputer and Cloud processing environments. * Maintain regular communication with customer, subcontractors and vendor staff members. * Take ...
Support prototype and production workflows within HPC, Supercomputer and Cloud processing environments. * Maintain regular communication with customer, subcontractors and vendor staff members. * Take ...
Supercomputer information
See Tennessee salary details
$8.98 - $13.83
16% of jobs
$15.31 is the 25th percentile. Wages below this are outliers.
$13.83 - $18.68
29% of jobs
The median wage is $19.90 / hr.
$18.68 - $23.54
19% of jobs
$27.84 is the 75th percentile. Wages above this are outliers.
$23.54 - $28.39
12% of jobs
$28.39 - $33.24
8% of jobs
$33.24 - $38.09
5% of jobs
$38.09 - $42.95
4% of jobs
$42.95 - $47.80
2% of jobs
$47.80 - $52.65
2% of jobs
$52.65 - $57.50
1% of jobs
$57.50 - $62.36
1% of jobs
$8
$26
$62
How much do supercomputer jobs pay per hour?
What are the key skills and qualifications needed to thrive in the Supercomputer position, and why are they important?
To thrive as a Supercomputer Engineer, you need expertise in high-performance computing (HPC), computer architecture, parallel programming, and advanced mathematics, often supported by a degree in computer science, engineering, or a related field. Familiarity with tools such as MPI, OpenMP, Linux systems, and certifications like Certified HPC Professional can be critical. Strong problem-solving abilities, collaboration, and communication skills set exceptional candidates apart in multidisciplinary environments. These competencies are essential for building, optimizing, and managing supercomputing resources that drive scientific discovery and innovation.
What are the typical responsibilities of a Supercomputer Engineer on a daily basis?
Supercomputer Engineers are responsible for designing, configuring, and maintaining high-performance computing systems to support complex computations in fields such as scientific research, weather modeling, and data analytics. On a daily basis, they might monitor system performance, troubleshoot hardware or software issues, optimize code for scalability, and collaborate closely with researchers and IT professionals to ensure workloads run efficiently. Additionally, they often assist in upgrading systems and implementing the latest technologies to maximize computational power. Working in this role offers opportunities for ongoing professional development and cross-functional teamwork, making each day both challenging and rewarding.
What is a Supercomputer job?
A Supercomputer job typically involves working with high-performance computing (HPC) systems to process complex calculations at extremely high speeds. Professionals in this field may develop software, optimize system performance, manage hardware infrastructure, or support scientific and engineering research. These roles are common in fields such as climate modeling, artificial intelligence, biomedical research, and financial simulations.

Full-time
Medical, Dental, Vision, Retirement, PTO
Posted 9 days ago
Job description
**Please note: The first step in the interview process requires candidates to join a Microsoft Teams meeting with the video turned on.**
- Working with highly talented team members
- 3 weeksโ vacation
- Excellent medical insurance, including employer-paid benefits
As a Kubernetes Platform Engineer for the HPC Platform teams, you will work to support all activities of our supercomputer center. Our primary platform is the OLCF Slate Service, built on Kubernetes and RKE2, which provides a container orchestration service for running critical operation applications and user-managed persistent applications that run alongside the OLCF Supercomputer systems and other OLCF managed HPC clusters.
- Work with the team to define and implement best practices and standards within the organization
- Keeping the Kubernetes platform reliable, available, and fast
- Architecting solutions to problems that improve the reliability, scalability, performance, and efficiency of our services
- Respond to, investigate, and fix service issues all the way from bare metal through the OS to the application code
- Coordinate with vendors to resolve hardware and software problems
- Participate in an on-call rotation providing 24-hour, 7-day support and off-hours maintenance windows
- Work with users to help them use Kubernetes
- Bachelorโs degree in a scientific field and a minimum of 5-8 years of relevant experience. An equivalent combination of education and experience will be considered.
- Experience with Kubernetes as a cluster administrator for on-premises deployments
- Excellent interpersonal/communications skills, and the ability to work as part of a team
- Strong working knowledge of Linux systems fundamentals and networked computing environment concepts
-
Experience with code reviews, code quality, CI/CD tooling, GitOps, SCM (e.g. GitLab)
-
Ability to identify requirements and to define, plan, and implement requisite solutions for small and medium projects
-
Ability to develop and maintain programs and scripts that aid in the operation and automation of tasks using various shell and scripting languages (primarily bash, Python, and Go)
-
Experience with on-call rotation
- The ability to obtain and maintain a Department of Energy "Q" clearance is required. This requires US Citizenship.
- Bachelorโs degree in a scientific field and 8-10 years of relevant experience.
- Subject matter expert in Kubernetes as a cluster administrator for bare metal, on-premises deployments
- Excellent interpersonal/communications skills, be able to effectively communicate with other teams and organizational leadership. Convey technical details to a non-or semi-technical audience.
- Ability to identify requirements and to define, plan, and implement requisite solutions for large, organizationally impactful projects.
- Self-driven with the ability to work in a dynamic, loosely structured research amp; development environment.
- Experience with RKE2 (nice to haves: Red Hat OpenShift and Talos). Multi-cluster management tools for Kubernetes (e.g. Fleet), and container security tools (Neuvector, SCC, pod admission control)
- Experiencing with managing image registries such as Quay or Harbor
- Experience using tools such as Prometheus, Nagios, and Grafana to monitor systems, metrics and create dashboards
- Experience designing and implementing highly-available systems/services
- Experience with Infrastructure-as-Code tooling such as Terraform, Helm, and Puppet
- Experience implementing systems-level security technologies (e.g. SELinux, Seccomp, linux capabilities), experience with DevSecOps, and general security best practices.
- Experience with AIOps and MLOps tooling โ e.g. KServe, Kubeflow, vLLM, NVidia Enterprise AI, AMD Silo AI, ClearML, MLFlow
- Experience using HPC hardware for Kubernetes โ e.g. RDMA, DPUs, Infiniband, many-core CPUs
- Experience with declarative CI/CD tools such as ArgoCD
- Experience with workflow engines such as Apache Airflow or Argo Workflows
- Experience with infrastructure automation
- Cloud engineering experience with at least one cloud service provider
- Experience with reusable, automated workflows such as PagerDuty playbooks
About Cadre5
Sourced by ZipRecruiter
Industry
Software development
Company size
11 - 50 Employees
Headquarters location
Knoxville, TN, US
Year founded
1999