... GPU capabilities. • Drive joint architecture reviews and "whiteboard" sessions with CSP and ... engineering or solutions-architect background: requirements gathering, PoC ownership, roadmap ...
... GPU capabilities. • Drive joint architecture reviews and "whiteboard" sessions with CSP and ... engineering or solutions-architect background: requirements gathering, PoC ownership, roadmap ...
Machine Learning & Operations Engineer
Durham, NC · Remote
$71K - $96K/yr
Optimize GPU/compute utilization across cloud and on-prem environments. * Deploy, monitor, and ... More typical DevOps responsibilities for software development as required. Requirements Required ...
Quick apply
Machine Learning & Operations Engineer
Durham, NC · Remote
$71K - $96K/yr
Optimize GPU/compute utilization across cloud and on-prem environments. * Deploy, monitor, and ... More typical DevOps responsibilities for software development as required. Requirements Required ...
Machine Learning & Operations Engineer
Durham, NC · Remote
$67K - $90K/yr
Optimize GPU/compute utilization across cloud and on-prem environments. * Deploy, monitor, and ... More typical DevOps responsibilities for software development as required. Requirements Required ...
Machine Learning & Operations Engineer
Durham, NC · Remote
$67K - $90K/yr
Optimize GPU/compute utilization across cloud and on-prem environments. * Deploy, monitor, and ... More typical DevOps responsibilities for software development as required. Requirements Required ...
Stay updated on industry trends and advancements in UEFI firmware, GPU technologies, and ... Strong firmware programming and debugging skills. * Experience with hardware and firmware bring-up.
Stay updated on industry trends and advancements in UEFI firmware, GPU technologies, and ... Strong firmware programming and debugging skills. * Experience with hardware and firmware bring-up.
Machine Learning & Operations Engineer
Durham, NC · Remote
$67K - $90K/yr
Optimize GPU/compute utilization across cloud and on-prem environments. * Deploy, monitor, and ... More typical DevOps responsibilities for software development as required. Required Qualifications ...
Machine Learning & Operations Engineer
Durham, NC · Remote
$67K - $90K/yr
Optimize GPU/compute utilization across cloud and on-prem environments. * Deploy, monitor, and ... More typical DevOps responsibilities for software development as required. Required Qualifications ...
Required : • A grasp of the CUDA programming model and experience employing GPU profiling tools like NVIDIA Nsight Systems/Compute to address PCIe bottlenecks and kernel stalls. • Extensive ...
Required : • A grasp of the CUDA programming model and experience employing GPU profiling tools like NVIDIA Nsight Systems/Compute to address PCIe bottlenecks and kernel stalls. • Extensive ...
An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can ... We are now looking for a motivated ASIC Timing Engineer to join our dynamic and growing team. If ...
An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can ... We are now looking for a motivated ASIC Timing Engineer to join our dynamic and growing team. If ...
A grasp of the CUDA programming model and experience employing GPU profiling tools like NVIDIA Nsight Systems/Compute to address PCIe bottlenecks and kernel stalls. * Extensive knowledge of profiling ...
A grasp of the CUDA programming model and experience employing GPU profiling tools like NVIDIA Nsight Systems/Compute to address PCIe bottlenecks and kernel stalls. * Extensive knowledge of profiling ...
Join our global Developer Technology (DevTech) team at NVIDIA, where we drive innovation and ... Knowledge of CPU and GPU architecture fundamentals and low-level performance optimizations
Join our global Developer Technology (DevTech) team at NVIDIA, where we drive innovation and ... Knowledge of CPU and GPU architecture fundamentals and low-level performance optimizations
Stay updated on industry trends and advancements in UEFI firmware, GPU technologies, and ... Strong firmware programming and debugging skills. Experience with hardware and firmware bring-up.
Stay updated on industry trends and advancements in UEFI firmware, GPU technologies, and ... Strong firmware programming and debugging skills. Experience with hardware and firmware bring-up.
Our invention of the GPU in 1999 fueled the growth of the PC gaming market, redefined modern ... As a software engineer, you will craft highly efficient software to automate and facilitate chip ...
Our invention of the GPU in 1999 fueled the growth of the PC gaming market, redefined modern ... As a software engineer, you will craft highly efficient software to automate and facilitate chip ...
Senior Software Engineer, AI Inference
Raleigh, NC · On-site +1
$133K - $220K/yr
As leading developers and maintainers of the vLLM project, and inventors of state-of-the-art ... Manage and scale multi-cloud GPU infrastructure using Terraform and Ansible, including both bare ...
Senior Software Engineer, AI Inference
Raleigh, NC · On-site +1
$133K - $220K/yr
As leading developers and maintainers of the vLLM project, and inventors of state-of-the-art ... Manage and scale multi-cloud GPU infrastructure using Terraform and Ansible, including both bare ...
An era in which our tightly coupled CPU, GPU and DPU technology acts as the brains of computers ... NVIDIA is searching for a highly motivated, technical engineer to join the Tegra system-on-chip ...
An era in which our tightly coupled CPU, GPU and DPU technology acts as the brains of computers ... NVIDIA is searching for a highly motivated, technical engineer to join the Tegra system-on-chip ...
Senior ASIC Front End Infrastructure Engineer
Durham, NC · On-site
$104K - $142K/yr
Keep the GPU Continuous Integration system at thecutting edgeofsource management methodologies ... MastersDegree in Electrical Engineering, Computer Engineering, Computer Science or related or ...
Senior ASIC Front End Infrastructure Engineer
Durham, NC · On-site
$104K - $142K/yr
Keep the GPU Continuous Integration system at thecutting edgeofsource management methodologies ... MastersDegree in Electrical Engineering, Computer Engineering, Computer Science or related or ...
Senior Software Engineer, CUTLASS Performance
Durham, NC · On-site
$118K - $156K/yr
... key GPU kernel and fusion opportunities. * Identify gaps between theoretical and realized ... Strong programming skills in Python and C++. * Experience in software performance analysis and ...
Senior Software Engineer, CUTLASS Performance
Durham, NC · On-site
$118K - $156K/yr
... key GPU kernel and fusion opportunities. * Identify gaps between theoretical and realized ... Strong programming skills in Python and C++. * Experience in software performance analysis and ...
... Engineering practice, you will design and drive deployment of fully integrated architectures for GPU-accelerated AI factories and high-performance computing infrastructure in close partnership with ...
... Engineering practice, you will design and drive deployment of fully integrated architectures for GPU-accelerated AI factories and high-performance computing infrastructure in close partnership with ...
Senior Container Platform Engineer*** Hybrid in Raleigh, NC
Raleigh, NC · On-site +1
$86K - $144K/yr
You will collaborate closely with DevOps, SRE, and Security teams to architect scalable, self ... Experience supporting AI/ML or GPU-based workloads on Kubernetes (e.g., NVIDIA GPU Operator, ML ...
Senior Container Platform Engineer*** Hybrid in Raleigh, NC
Raleigh, NC · On-site +1
$86K - $144K/yr
You will collaborate closely with DevOps, SRE, and Security teams to architect scalable, self ... Experience supporting AI/ML or GPU-based workloads on Kubernetes (e.g., NVIDIA GPU Operator, ML ...
Senior Container Platform Engineer*** Hybrid in Raleigh, NC
Raleigh, NC · On-site +1
$86K - $144K/yr
You will collaborate closely with DevOps, SRE, and Security teams to architect scalable, self ... Experience supporting AI/ML or GPU-based workloads on Kubernetes (e.g., NVIDIA GPU Operator, ML ...
Senior Container Platform Engineer*** Hybrid in Raleigh, NC
Raleigh, NC · On-site +1
$86K - $144K/yr
You will collaborate closely with DevOps, SRE, and Security teams to architect scalable, self ... Experience supporting AI/ML or GPU-based workloads on Kubernetes (e.g., NVIDIA GPU Operator, ML ...
Senior Container Platform Engineer*** Hybrid in Raleigh, NC
Raleigh, NC · On-site
$86K - $144K/yr
You will collaborate closely with DevOps, SRE, and Security teams to architect scalable, self ... Experience supporting AI/ML or GPU-based workloads on Kubernetes (e.g., NVIDIA GPU Operator, ML ...
Senior Container Platform Engineer*** Hybrid in Raleigh, NC
Raleigh, NC · On-site
$86K - $144K/yr
You will collaborate closely with DevOps, SRE, and Security teams to architect scalable, self ... Experience supporting AI/ML or GPU-based workloads on Kubernetes (e.g., NVIDIA GPU Operator, ML ...
Senior Container Platform Engineer*** Hybrid in Raleigh, NC
Raleigh, NC · On-site
$86K - $144K/yr
You will collaborate closely with DevOps, SRE, and Security teams to architect scalable, self ... Experience supporting AI/ML or GPU-based workloads on Kubernetes (e.g., NVIDIA GPU Operator, ML ...
Senior Container Platform Engineer*** Hybrid in Raleigh, NC
Raleigh, NC · On-site
$86K - $144K/yr
You will collaborate closely with DevOps, SRE, and Security teams to architect scalable, self ... Experience supporting AI/ML or GPU-based workloads on Kubernetes (e.g., NVIDIA GPU Operator, ML ...
Gpu Engineer information
See Raleigh, NC salary details
$37.9K - $46.6K
3% of jobs
$46.6K - $55.3K
3% of jobs
$55.3K - $64K
4% of jobs
$64K - $72.7K
7% of jobs
$72.7K - $81.4K
6% of jobs
$82.2K is the 25th percentile. Wages below this are outliers.
$81.4K - $90.1K
6% of jobs
The median wage is $98K / yr.
$90.1K - $98.8K
21% of jobs
$98.8K - $107.5K
4% of jobs
$113.2K is the 75th percentile. Wages above this are outliers.
$107.5K - $116.3K
29% of jobs
$116.3K - $125K
2% of jobs
$125K - $133.7K
13% of jobs
$37.9K
$98.9K
$133.7K
How much do gpu engineer jobs pay per year?
What engineers make $500,000?
What engineers make $300,000 a year?
What jobs pay $400 an hour?
What are the key skills and qualifications needed to thrive in the Gpu Engineer position, and why are they important?
To thrive as a GPU Engineer, you need strong knowledge of computer architecture, proficiency in C/C++, and experience with parallel programming models such as CUDA or OpenCL, along with a degree in computer science, electrical engineering, or a related field. Familiarity with debugging tools, driver development, performance profiling utilities, and hardware simulation platforms is typically required. Excellent problem-solving abilities, attention to detail, and effective teamwork and communication skills help distinguish top candidates. These skills ensure that GPU Engineers can develop high-performance solutions, efficiently troubleshoot hardware and software issues, and collaborate successfully in multidisciplinary environments.
What does a GPU engineer do?
What does a GPU Engineer do?
A GPU Engineer designs, develops, and optimizes graphics processing units (GPUs) for applications like gaming, artificial intelligence, and high-performance computing. They work on hardware architecture, driver development, and parallel computing optimizations to maximize performance. GPU Engineers collaborate with software developers, hardware designers, and researchers to improve graphics rendering, machine learning acceleration, and computational efficiency.
What are some common challenges faced by GPU Engineers, and how are they addressed?
GPU Engineers often face challenges such as optimizing code for maximum parallel efficiency, debugging complex hardware-software interactions, and keeping pace with rapidly evolving GPU architectures. Addressing these issues typically requires a combination of deep architectural understanding, use of specialized profiling and debugging tools, and ongoing collaboration with hardware, software, and QA teams. Many companies provide ongoing training and encourage knowledge sharing within engineering teams to help individuals stay current and effectively tackle new technical hurdles. Overcoming these challenges not only sharpens technical expertise but also opens doors for career growth into architect, team lead, or principal engineer roles.
Full-time
Posted 2 days ago
Job description
NVIDIA is a leading technology company known for its groundbreaking developments in Artificial Intelligence and High-Performance Computing. They are seeking a Senior Software Engineer for their CSP Engagements team to focus on the cloud-native stack for advanced AI/ML datacenters, tackling complex scheduling challenges and enhancing Kubernetes and Slurm functionalities.
Responsibilities:
• Perform deep-dive debugging of multi-rack, multi-tenant clusters: scheduler behavior, container runtime issues, device-plugin crashes, RDMA/IB fabric anomalies, etc.
• Gather customer requirements and prototype feature extensions for Kubernetes operators, Slurm plugins, and custom micro-services that expose new GPU capabilities.
• Drive joint architecture reviews and “whiteboard” sessions with CSP and internal platform teams; convert findings into RFCs and upstream pull requests.
• Create reproducible testbeds (Helm/Ansible/Terraform) that mirror customer environments; automate validation and benchmark suites.
• Deliver technical collateral-design docs, how-to guides, demo scripts-and present at customer on-sites, KubeCon, and SlurmUG.
• Collaborate with AE, FAE, and Solution Architect teams to deliver integrated customer solutions and technical documentation.
Qualifications:
Required:
• Strong source-level expertise in Kubernetes internals (scheduler, CRI/CNI/CSI, operators) and Slurm (federation, power-save, plugins).
• Hands-on experience integrating next-gen GPUs (Blackwell/GB200/GB300) or comparable accelerators into containerized clusters.
• Proven track record debugging large-scale, cloud-native stacks across networking (RDMA/RoCE), storage, and control planes.
• Customer-facing engineering or solutions-architect background: requirements gathering, PoC ownership, roadmap influence.
• Familiarity with CI/CD (GitHub Actions, Tekton), observability (Prometheus, OpenTelemetry), and infrastructure-as-code.
• Excellent communication-able to switch between deep technical detail and high-level business impact.
• 10+ years of professional software development experience in distributed systems (Go, Rust, C/C++ or Python for tooling).
• BS or MS (or equivalent experience) in Computer Engineering, Computer Science, or related field.
Preferred:
• Upstream contributions to Kubernetes, Slurm, Volcano, or similar projects.
• Experience with GPU computing (CUDA), deep learning workloads
Company:
NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI. Founded in 1993, the company is headquartered in Santa Clara, USA, with a team of 10001+ employees. The company is currently Late Stage.
About Nvidia
Sourced by ZipRecruiter
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology--and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent.
Industry
Computer and electronic product manufacturing
Company size
10,000+ Employees
Headquarters location
Santa Clara, CA, US
Year founded
1993