Nvidia

60 Nvidia Site Reliability Engineer Jobs Hiring Near You

Technical Product Manager - AI Infra Resilience

$196K - $226K/yr

Background as an SRE or building SRE focused products NVIDIA is widely considered one of the technology world's most desirable employers. We have some of the world's most forward-thinking and ...

Nvidia

Technical Product Manager - AI Infra Resilience

Santa Clara, CA · On-site

$196K - $226K/yr

Nvidia

Senior Director, Reliability Engineering

Santa Clara, CA · On-site

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than ... You will own the reliability engineering strategy for NVIDIA's broad boards and systems product ...

Nvidia

Senior Director, Reliability Engineering

Santa Clara, CA · On-site

Nvidia Corporation

Senior Director, Reliability Engineering

Santa Clara, CA · On-site

Nvidia Corporation

Senior Director, Reliability Engineering

Santa Clara, CA · On-site

Nvidia

Senior DevOps Engineer - Robotics

Santa Clara, CA · On-site

$152K - $196K/yr

... reliability, scalability, and efficiency of NVIDIA's build, test, and release processes for our ... SRE, or infrastructure engineering roles, including ownership of CI or lab environments, with a ...

Nvidia

Senior DevOps Engineer - Robotics

Santa Clara, CA · On-site

$152K - $196K/yr

Nvidia

Senior Software Engineer, Resilience Engineering - DGX Cloud

Santa Clara, CA

$143K - $189K/yr

Are you passionate about building world-class reliability systems? Join NVIDIA as a Senior Software ... Experience within a world-class reliability function like Google SRE or Meta production engineering.

Nvidia

Senior Software Engineer, Resilience Engineering - DGX Cloud

Santa Clara, CA

$143K - $189K/yr

Nvidia

Senior Technical Program Manager - DGX Cloud Infra Security

Santa Clara, CA · Hybrid

... NVIDIA Cloud Partners (NCPs). You will lead security efforts by embedding compliance controls ... Compliance, SRE, and Engineering to continually advance and strengthen the DGX Cloud Security ...

Nvidia

Senior Technical Program Manager - DGX Cloud Infra Security

Santa Clara, CA · Hybrid

Nvidia

Manager, Systems Software Engineering - NV Cloud Functions

Santa Clara, CA · On-site

... NVIDIA Cloud Functions (NVCF). NVCF is a platform for deploying, managing, and running GPU ... Collaborate closely with cross-functional teams, including product, security, site reliability ...

Nvidia

Manager, Systems Software Engineering - NV Cloud Functions

Santa Clara, CA · On-site

NVIDIA

Senior Staff AI Platform Engineer

Santa Clara, CA · On-site

$122K - $168K/yr

Required : • 10+ years in cloud, platform, or SRE roles with relevant education or equivalent ... NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.

NVIDIA

Senior Staff AI Platform Engineer

Santa Clara, CA · On-site

$122K - $168K/yr

NVIDIA

Software DevOps Engineer, Networking

Santa Clara, CA · On-site

$62 - $84.75/hr

... SRE, or Systems Integration roles. • Deep knowledge of Linux distributions (Ubuntu/RHEL) and ... NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.

NVIDIA

Software DevOps Engineer, Networking

Santa Clara, CA · On-site

$62 - $84.75/hr

NVIDIA

Senior Software Engineer - Datacenter Systems

Santa Clara, CA · On-site

Preferred : • Demonstrated experience implementing SRE practices, specifically defining and ... NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.

NVIDIA

Senior Software Engineer - Datacenter Systems

Santa Clara, CA · On-site

Nvidia Corporation

Senior Software Engineer, Resilience Engineering - DGX Cloud

Santa Clara, CA · On-site

$143K - $189K/yr

Nvidia Corporation

Senior Software Engineer, Resilience Engineering - DGX Cloud

Santa Clara, CA · On-site

$143K - $189K/yr

Nvidia Corporation

Senior Technical Program Manager - DGX Cloud Infra Security

Santa Clara, CA · On-site

Nvidia Corporation

Senior Technical Program Manager - DGX Cloud Infra Security

Santa Clara, CA · On-site

Nvidia Corporation

Manager, Systems Software Engineering - NV Cloud Functions

Santa Clara, CA · On-site

Nvidia Corporation

Manager, Systems Software Engineering - NV Cloud Functions

Santa Clara, CA · On-site

Nvidia

Senior Staff AI Platform Engineer

Santa Clara, CA · On-site

$122K - $168K/yr

NVIDIA is looking to hire a deeply technical, creative, and Senior AI Platform Engineer to build ... What we need to see: * 10+ years in cloud, platform, or SRE roles with relevant education or ...

Nvidia

Senior Staff AI Platform Engineer

Santa Clara, CA · On-site

$122K - $168K/yr

Nvidia

Senior Storage Software Engineer, DGXC Data Services

Santa Clara, CA · Hybrid

$143K - $189K/yr

Work closely with internal AI teams, platform teams, SRE, and operations to validate storage ... NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High ...

Nvidia

Senior Storage Software Engineer, DGXC Data Services

Santa Clara, CA · Hybrid

$143K - $189K/yr

Work closely with internal AI teams, platform teams, SRE, and operations to validate storage ... NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High ...

Nvidia

Systems Quality and Reliability Engineer - LPU

Santa Clara, CA · Hybrid

We are seeking Systems Quality and Reliability Engineer to join our LPU team ... NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 fueled ...

Nvidia

Systems Quality and Reliability Engineer - LPU

Santa Clara, CA · Hybrid

We are seeking Systems Quality and Reliability Engineer to join our LPU team ... NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 fueled ...

Nvidia

Senior Cloud Software Engineer, DGXC Data Services

Santa Clara, CA · On-site

$143K - $189K/yr

Collaborate with SRE, operations, and support teams to improve service reliability, performance ... NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High ...

Nvidia

Senior Cloud Software Engineer, DGXC Data Services

Santa Clara, CA · On-site

$143K - $189K/yr

NVIDIA

Principal Software Engineer, At-Scale Reliability and Fleet Intelligence -- CSP Engagements

Santa Clara, CA · On-site

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High ... They are seeking a Principal Software Engineer to focus on fleet-scale reliability and work with ...

NVIDIA

Principal Software Engineer, At-Scale Reliability and Fleet Intelligence -- CSP Engagements

Santa Clara, CA · On-site

Nvidia Corporation

Senior Staff AI Platform Engineer

Santa Clara, CA · On-site

$122K - $168K/yr

Nvidia Corporation

Senior Staff AI Platform Engineer

Santa Clara, CA · On-site

$122K - $168K/yr

Nvidia Corporation

Senior Storage Software Engineer, DGXC Data Services

Santa Clara, CA · On-site

$143K - $189K/yr

Work closely with internal AI teams, platform teams, SRE, and operations to validate storage ... NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High ...

Nvidia Corporation

Senior Storage Software Engineer, DGXC Data Services

Santa Clara, CA · On-site

$143K - $189K/yr

Work closely with internal AI teams, platform teams, SRE, and operations to validate storage ... NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High ...

Showing results 41-60

All Jobs Nvidia Jobs Nvidia Site Reliability Engineer Jobs

Nvidia Jobs Information

What is it like to work at Nvidia?

Nvidia is known for its collaborative and innovative culture, prioritizing teamwork and creativity to drive technological advancements. The company's structure is organized into various teams, including research and development, engineering, and sales, with a focus on fostering open communication and knowledge sharing across departments. Working at Nvidia may appeal to candidates who are passionate about artificial intelligence, graphics, and high-performance computing, as the company offers opportunities to contribute to cutting-edge projects and collaborate with experts in the field.

How easy is it to get time off at Nvidia?

Most people find it easy to get time off.
100% of people report it’s easy to get time off.
Based on data from 5 people who took the Breakroom Quiz between December 2024 and December 2025.

How easy is it to take sick days at Nvidia?

Most people find it easy to take sick days.
100% of people report that it’s easy to take time off if they are sick.
Based on data from 5 people who took the Breakroom Quiz between December 2024 and December 2025.

Do people at Nvidia get to take their breaks without interruption?

Most people get breaks without interruption.
100% of people report that they get to take their breaks without interruption.
Based on data from 5 people who took the Breakroom Quiz between December 2024 and December 2025.

Is it stressful to work at Nvidia?

Some people feel stressed out here.
40% of people say they often feel stressed out at work.
Based on data from 5 people who took the Breakroom Quiz between December 2024 and December 2025.

Do people at Nvidia recommend working with their team?

Only some people recommend working with their team.
40% of people report that they wouldn’t recommend working with their immediate team to a friend.
Based on data from 5 people who took the Breakroom Quiz between December 2024 and December 2025.

Do people get enough training when they start at Nvidia?

Some people didn’t get enough training when they started.
40% of people report they didn’t get enough training when they started working here.
Based on data from 5 people who took the Breakroom Quiz between December 2024 and December 2025.

Do people get support to advance at Nvidia?

Most people are given support to advance their career here.
In the last year, 100% of people report being given support to advance their career here.
Based on data from 5 people who took the Breakroom Quiz between December 2024 and December 2025.

Do workers feel well informed about how Nvidia is doing?

Most people feel well informed about how the company is doing.
80% of people feel that they are kept well informed about how the company is doing as a whole.
Based on data from 5 people who took the Breakroom Quiz between December 2024 and December 2025.

What other companies are hiring for Site Reliability Engineer jobs?

What are the most popular jobs at Nvidia?

What are the most popular categories at Nvidia?

Infographic showing various Site Reliability Engineer job openings at Nvidia in the United States as of July 2026, with employment types broken down into 100% Full Time. Highlights an 86% Physical, 12% Hybrid, and 2% Remote job distribution.

Technical Product Manager - AI Infra Resilience

Nvidia

Santa Clara, CA • On-site

Apply

$196K - $226K/yr

Full-time

Posted yesterday

Nvidia rating

9.3

Based on 5 frontline employees who took The Breakroom Quiz

15th of 209 rated software companies

Job description

NVIDIA is driving a vision for AI factories that convert tokens to intelligence at scale to power AI demands of tomorrow. Maintaining AI infrastructure at scale takes more than human involvement; it demands smart automation.

We're hiring a Technical Product Manager to drive AI factory resilience platform features and developer experience. You'll build underpinnings and glue for resilience of AI Factories to make it more adaptable across all architectures, workloads and generations of hardware. If you are excited about being a part of a 0 --> 1 effort to create and establish an open source project on AI Infrastructure resilience we want to hear from you!

What You'll Be Doing:

Define the resilience platform, own the product roadmap and delivery for specific platform features -such as common telemetry interfaces, health-check contracts, attribution hooks, and observability APIs that all products conform to, joint deployment experience etc,
Developer pain into features and integration partnerships.
Collaborate with the open-source developer community - prioritizing GitHub issues, gathering feedback, supporting contributors, and channeling community signal back into the roadmap.
Collaborate with engineering on feature design, prioritization, execution, and architecture tradeoffs.
Align cross-functionally with other Product, Engineering, Product Marketing, and Field teams on requirements, roadmaps, messaging, and engagements.

What We Need To See:

12+ years in product management, solutions architecture, or software engineering on a technical product.
Bachelor's degree in Computer Science or an equivalent experience.
Technical depth in 2 or more of Data center operations, GPU infra, network and storage, container orchestration, (Kubernetes) , developer platforms and SDKs, agent frameworks.
Proven capability to connect with senior technical customers and translate requirements into product strategy.
Comfort operating in fast paced environments.
Strong written and verbal communication across Developers to Executive audiences.

Ways To Stand Out From The Crowd:

Strong experience building products for Data Center infra operations and observability
Practical experience delivering or contributing to an open-source product, including interacting with contributors on GitHub.
Experience crafting developer-facing APIs, SDKs, or CLIs at scale.
Background as an SRE or building SRE focused products

NVIDIA is widely considered one of the technology world's most desirable employers. We have some of the world's most forward-thinking and hardworking people on our team. If you're creative and autonomous, we want to hear from you!

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 208,000 USD - 327,750 USD for Level 5, and 240,000 USD - 379,500 USD for Level 6.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until June 16, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

What Nvidia employees say

Hours and flexibility

Workplace

Get the full story on Breakroom

About Nvidia

Sourced by ZipRecruiter

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology--and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent.

Industry

Computer and electronic product manufacturing

Company size

10,000+ Employees

Headquarters location

Santa Clara, CA, US

Year founded

1993

Website

nvidia.com

Social media

View All Nvidia Jobs

Apply

Trending keywords

Popular titles

Top companies

60 Nvidia Site Reliability Engineer Jobs Hiring Near You

Technical Product Manager - AI Infra Resilience

Technical Product Manager - AI Infra Resilience

Senior Director, Reliability Engineering

Senior Director, Reliability Engineering

Senior Director, Reliability Engineering

Senior Director, Reliability Engineering

Senior DevOps Engineer - Robotics

Senior DevOps Engineer - Robotics

Senior Software Engineer, Resilience Engineering - DGX Cloud

Senior Software Engineer, Resilience Engineering - DGX Cloud

Senior Technical Program Manager - DGX Cloud Infra Security

Senior Technical Program Manager - DGX Cloud Infra Security

Manager, Systems Software Engineering - NV Cloud Functions

Manager, Systems Software Engineering - NV Cloud Functions

Senior Staff AI Platform Engineer

Senior Staff AI Platform Engineer

Software DevOps Engineer, Networking

Software DevOps Engineer, Networking

Senior Software Engineer - Datacenter Systems

Senior Software Engineer - Datacenter Systems

Senior Software Engineer, Resilience Engineering - DGX Cloud

Senior Software Engineer, Resilience Engineering - DGX Cloud

Senior Technical Program Manager - DGX Cloud Infra Security

Senior Technical Program Manager - DGX Cloud Infra Security

Manager, Systems Software Engineering - NV Cloud Functions

Manager, Systems Software Engineering - NV Cloud Functions

Senior Staff AI Platform Engineer

Senior Staff AI Platform Engineer

Senior Storage Software Engineer, DGXC Data Services

Senior Storage Software Engineer, DGXC Data Services

Systems Quality and Reliability Engineer - LPU

Systems Quality and Reliability Engineer - LPU

Senior Cloud Software Engineer, DGXC Data Services

Senior Cloud Software Engineer, DGXC Data Services

Principal Software Engineer, At-Scale Reliability and Fleet Intelligence -- CSP Engagements

Principal Software Engineer, At-Scale Reliability and Fleet Intelligence -- CSP Engagements

Senior Staff AI Platform Engineer

Senior Staff AI Platform Engineer

Senior Storage Software Engineer, DGXC Data Services

Senior Storage Software Engineer, DGXC Data Services

Nvidia Jobs Information

Technical Product Manager - AI Infra Resilience

Share this job

Nvidia rating

Get the real story on frontline employers

Job description

What Nvidia employees say

Get the real story on frontline employers

Hours and flexibility

Easy to get time off

Most people find it easy to take sick days

Workplace

Most people get breaks without interruption

Some people are stressed out

Only some people recommend their team

About Nvidia

Industry

Company size

Headquarters location

Year founded

Website

Social media

Share this job