NVIDIA is looking for a highly-motivated Technical Program Manager (TPM) to join our Applied ... supercomputing systems. This TPM will play a crucial role throughout the lifecycle of the latest AI ...
NVIDIA is looking for a highly-motivated Technical Program Manager (TPM) to join our Applied ... supercomputing systems. This TPM will play a crucial role throughout the lifecycle of the latest AI ...
NVIDIA is looking for a highly-motivated Technical Program Manager (TPM) to join our Applied ... supercomputing systems. This TPM will play a crucial role throughout the lifecycle of the latest AI ...
NVIDIA is looking for a highly-motivated Technical Program Manager (TPM) to join our Applied ... supercomputing systems. This TPM will play a crucial role throughout the lifecycle of the latest AI ...
... Supercomputers) . As a Systems Engineer, you will serve as a vital catalyst within a premier ... You will be tasked with the sophisticated management of Compute Clusters, Software Environments ...
New
... Supercomputers) . As a Systems Engineer, you will serve as a vital catalyst within a premier ... You will be tasked with the sophisticated management of Compute Clusters, Software Environments ...
New
Member of Technical Staff, Supercomputing Platform & Infrastructure
San Francisco, CA · On-site +1
$200K - $550K/yr
About the role As an engineer on the Supercomputing Platform & Infrastructure team, you will design ... Deploy, operate, and optimize K8s clusters used to schedule and manage AI workloads * Develop ...
Member of Technical Staff, Supercomputing Platform & Infrastructure
San Francisco, CA · On-site +1
$200K - $550K/yr
About the role As an engineer on the Supercomputing Platform & Infrastructure team, you will design ... Deploy, operate, and optimize K8s clusters used to schedule and manage AI workloads * Develop ...
IT Systems Engineer V
Louisville, KY · On-site
This position is responsible for management and administration of the university's central research computing infrastructure, including supercomputing clusters, software, data storage, and networking.
IT Systems Engineer V
Louisville, KY · On-site
This position is responsible for management and administration of the university's central research computing infrastructure, including supercomputing clusters, software, data storage, and networking.
Senior Hardware Technical Program Manager
Sunnyvale, CA · On-site
$180K - $230K/yr
The Role As a Senior Hardware Technical Program Manager at Cerebras, you will spearhead operational ... Work on one of the fastest AI supercomputers in the world. * Enjoy job stability with startup ...
Senior Hardware Technical Program Manager
Sunnyvale, CA · On-site
$180K - $230K/yr
The Role As a Senior Hardware Technical Program Manager at Cerebras, you will spearhead operational ... Work on one of the fastest AI supercomputers in the world. * Enjoy job stability with startup ...
Strategic Sourcing Manager, Compute
San Francisco, CA · On-site
$226K - $285K/yr
... the roadmap to scale our supercomputing footprint globally. From site planning to system ... About the Role We are seeking a Strategic Sourcing Manager who is ready to take on global-scale ...
Strategic Sourcing Manager, Compute
San Francisco, CA · On-site
$226K - $285K/yr
... the roadmap to scale our supercomputing footprint globally. From site planning to system ... About the Role We are seeking a Strategic Sourcing Manager who is ready to take on global-scale ...
Software Engineer, Frontier Systems - Power Management
San Francisco, CA · On-site
$203K - $241K/yr
As a Software Engineer on the Frontier Systems team, you will work on critical infrastructure for large-scale supercomputers focused on power management, optimizing power usage, and ensuring ...
Software Engineer, Frontier Systems - Power Management
San Francisco, CA · On-site
$203K - $241K/yr
As a Software Engineer on the Frontier Systems team, you will work on critical infrastructure for large-scale supercomputers focused on power management, optimizing power usage, and ensuring ...
Senior Hardware Technical Program Manager
Sunnyvale, CA · On-site
$180K - $230K/yr
... engine supercomputers. Your role will be critical in ensuring seamless translation of product ... Develop and manage comprehensive program plans, including schedules, material plans, and validation ...
Senior Hardware Technical Program Manager
Sunnyvale, CA · On-site
$180K - $230K/yr
... engine supercomputers. Your role will be critical in ensuring seamless translation of product ... Develop and manage comprehensive program plans, including schedules, material plans, and validation ...
$127K/yr
This position is responsible for management and administration of the university's central research computing infrastructure, including supercomputing clusters, software, data storage, and networking.
$127K/yr
This position is responsible for management and administration of the university's central research computing infrastructure, including supercomputing clusters, software, data storage, and networking.
Member of Technical Staff - Compute Infrastructure
Palo Alto, CA · On-site
$180K - $440K/yr
We are building one of the world's largest AI supercomputers from the ground up. As part of the ... Work on Linux kernel internals, scheduling, memory management, and resource isolation at cluster ...
Member of Technical Staff - Compute Infrastructure
Palo Alto, CA · On-site
$180K - $440K/yr
We are building one of the world's largest AI supercomputers from the ground up. As part of the ... Work on Linux kernel internals, scheduling, memory management, and resource isolation at cluster ...
IT Systems Engineer V
Louisville, KY · On-site
$127K/yr
This position is responsible for management and administration of the university's central research computing infrastructure, including supercomputing clusters, software, data storage, and networking.
IT Systems Engineer V
Louisville, KY · On-site
$127K/yr
This position is responsible for management and administration of the university's central research computing infrastructure, including supercomputing clusters, software, data storage, and networking.
Network Admin Sr
Stennis Space Center, MS · On-site
$79K - $134K/yr
The Navy DoW Supercomputing Resource Center (DSRC) hosts some of the fastest supercomputers in the ... Excellent knowledge of management, control, and monitoring of server infrastructure. * Ability to ...
Network Admin Sr
Stennis Space Center, MS · On-site
$79K - $134K/yr
The Navy DoW Supercomputing Resource Center (DSRC) hosts some of the fastest supercomputers in the ... Excellent knowledge of management, control, and monitoring of server infrastructure. * Ability to ...
... like research and supercomputing facilities,commandand control centers, and SCIFs, as well as ... Manage scheduling, budgets, staffing, and project set-up with clients, sub-contractors,vendorsand ...
... like research and supercomputing facilities,commandand control centers, and SCIFs, as well as ... Manage scheduling, budgets, staffing, and project set-up with clients, sub-contractors,vendorsand ...
... Supercomputer systems and other OLCF managed HPC clusters. Job Responsibilities: * Work with the team to define and implement best practices and standards within the organization * Keeping the ...
... Supercomputer systems and other OLCF managed HPC clusters. Job Responsibilities: * Work with the team to define and implement best practices and standards within the organization * Keeping the ...
... managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global ... These multi-exaflop supercomputers are deployed in some of the biggest datacenters. These ...
... managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global ... These multi-exaflop supercomputers are deployed in some of the biggest datacenters. These ...
ASRC Federal InuTeq is seeking a highly capable Infrastructure Manager in Mountain View, CA to support the NASA Advanced Supercomputing (NAS) facility, home to some of the world's fastest ...
ASRC Federal InuTeq is seeking a highly capable Infrastructure Manager in Mountain View, CA to support the NASA Advanced Supercomputing (NAS) facility, home to some of the world's fastest ...
ASRC Federal InuTeq is seeking a highly capable Infrastructure Manager in Mountain View, CA to support the NASA Advanced Supercomputing (NAS) facility, home to some of the world's fastest ...
ASRC Federal InuTeq is seeking a highly capable Infrastructure Manager in Mountain View, CA to support the NASA Advanced Supercomputing (NAS) facility, home to some of the world's fastest ...
ASRC Federal InuTeq is seeking a highly capable Infrastructure Manager in Mountain View, CA to support the NASA Advanced Supercomputing (NAS) facility, home to some of the world's fastest ...
ASRC Federal InuTeq is seeking a highly capable Infrastructure Manager in Mountain View, CA to support the NASA Advanced Supercomputing (NAS) facility, home to some of the world's fastest ...
Operations Engineer, Fleet Reliability
Richmond, VA · On-site
$55.75 - $74.25/hr
The Operations Engineer, Fleet Reliability will manage the provisioning and uptime of supercomputing clusters, troubleshoot issues, and improve team processes in a fast-paced environment.
Operations Engineer, Fleet Reliability
Richmond, VA · On-site
$55.75 - $74.25/hr
The Operations Engineer, Fleet Reliability will manage the provisioning and uptime of supercomputing clusters, troubleshoot issues, and improve team processes in a fast-paced environment.
Manager Supercomputer information
What is the difference between Manager Supercomputer vs Supercomputing Systems Engineer?
| Aspect | Manager Supercomputer | Supercomputing Systems Engineer |
|---|---|---|
| Required Credentials | Bachelor's or master's in computer science, engineering, or related field; management experience | Bachelor's or master's in computer science, computer engineering, or related field; technical certifications |
| Work Environment | Oversees supercomputing facilities, manages teams, strategic planning | Designs, develops, and maintains supercomputing systems, works hands-on with hardware/software |
| Employer & Industry Usage | Research labs, government agencies, large tech companies | Research institutions, high-performance computing centers, tech firms |
The Manager Supercomputer primarily oversees supercomputing operations and manages teams, focusing on strategic and administrative tasks. In contrast, the Supercomputing Systems Engineer is more technically involved, designing and maintaining supercomputing systems. Both roles require strong technical backgrounds, but their responsibilities differ in scope and focus.

Full-time
Posted 4 days ago
Job description
NVIDIA is looking for a highly-motivated Technical Program Manager (TPM) to join our Applied Systems Engineering Team to drive datacenter integration for the next generation of NVIDIA AI supercomputing systems. This TPM will play a crucial role throughout the lifecycle of the latest AI systems at scale, from datacenter design and requirements definition, through systems integration of AI clusters into the datacenter environment, and support for these systems as they enter production.
This role will drive collaboration between engineering leaders across multiple hardware and software teams, helping us work together to build AI supercomputers for NVIDIA engineers and develop reference architectures to advise customers and partners.
What you'll be doing:
Collaborate with outstanding engineers and architects to build and deploy large scale GPU computing systems based on NVIDIA's reference supercomputing architectures
Lead the integration of new AI clusters with datacenter facilities with demanding requirements on power, cooling, and instrumentation
Coordinate design and fit-out of new datacenter builds, working with both internal engineering teams and external contractors
Own and produce detailed documentation for the end-to-end process for datacenter fit-out and integration
Communicate internally with engineering leadership to prioritize and address key issues essential to the success of our largest customers
What we need to see:
BS in Applied Science or Engineering (or equivalent experience)
8+ years of overall experience
Experience with high-performance computing systems and GPU clusters deployed in on-premises datacenters
A passion for understanding challenging technical problems and driving the process of finding a solution
Strong teamwork and interpersonal skills, to facilitate building a collaborative workflow for coordination between many teams
Ways to stand out from the crowd:
Understanding of datacenter design, including familiarity with power and cooling technologies
Expertise in system monitoring and instrumentation of large clusters, using technologies such as Prometheus, Grafana, Splunk, Modbus, and BACNet
Experience working with the engineering or academic research community supporting high-performance computing or deep learning
You will also be eligible for equity and benefits.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.About Nvidia
Sourced by ZipRecruiter
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology--and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent.
Industry
Computer and electronic product manufacturing
Company size
10,000+ Employees
Headquarters location
Santa Clara, CA, US
Year founded
1993