OR ยท On-site
You will work across NVIDIA's GPU driver, CUDA, firmware, BMC, and AI software teams, collaborate closely with Microsoft and ODM/OEM partners, and ensure that developers and enterprise customers have ...
OR ยท On-site
You will work across NVIDIA's GPU driver, CUDA, firmware, BMC, and AI software teams, collaborate closely with Microsoft and ODM/OEM partners, and ensure that developers and enterprise customers have ...
OR ยท On-site
Experience with GPU programming and performance optimization (CUDA or equivalent). * Proven track record leading large, cross-team efforts from concept through production, including navigating ...
OR ยท On-site
Work with cross-functional partners to promote adoption of NVIDIA technologies like RAPIDS, CUDA-X ... Map and monitor the developer ecosystem to identify growth opportunities, collaborating with ...
OR ยท On-site
$172K - $204K/yr
We are seeking an engineering leader responsible for end-to-end delivery of every DGX compute ... CUDA, networking, and AI applications work together seamlessly, while driving architecture and ...
OR ยท On-site
Familiarity with GPU acceleration and NVIDIA platforms (e.g., CUDA-X libraries, and/or AI ... Demonstrated success in building and scaling developer communities within capital markets or ...
OR ยท Hybrid
$122K - $161K/yr
Contribute to CUDA kernel and operator development for critical transformer components such as ... Proficient programming ability with modern C++ (C++11/14/17 and beyond). * Familiarity with popular ...
OR ยท On-site
Proven understanding of AI and ML techniques, parallel programming techniques, and software engineering * Experience with NVIDIA's CUDA-X Platform, including SDKs like Physics NeMo, Warp, CUDA, and ...
OR ยท On-site
We are seeking a highly technical and strategic Developer Relations Manager to join our team ... Familiarity with NVIDIA's libraries and SDKs (CUDA, CUDA-X, AI) and an understanding of how GPU ...
OR ยท On-site
... Compute (CUDA, PTX, OpenCL, Fortran, C++). This team is comprised of worldwide leading compiler ... Excellent hands-on C++ programming skills applied to industry standard C++ compilers and ...
OR ยท On-site
$126K - $166K/yr
Strong technical foundation in GPU computing, CUDA, or parallel programming models * Excellent communication and presentation skills, with the ability to work across engineering, marketing, and ...
OR ยท On-site
Expertise with advanced AI and GPU-accelerated frameworks including CUDA-X, RAPIDS, TensorRT-LLM ... Demonstrated history of launching and growing developer ecosystems across federal agencies, defense ...
OR ยท On-site
$104K - $143K/yr
Experienced in parallel programming, including CUDA/OpenCL GPU programming or other parallel models such as OpenMP. * Solid understanding of computer architecture and hands-on experience with ...
$133K - $175K/yr
Collaborate with GPU architecture, CUDA, and NVVM/PTX compiler teams to provide feedback on programming models and to assess the performance of future GPU hardware features. What we need to see:
$133K - $175K/yr
Collaborate with GPU architecture, CUDA, and NVVM/PTX compiler teams to provide feedback on programming models and to assess the performance of future GPU hardware features. What we need to see:
OR ยท On-site
$104K - $143K/yr
CUDA or OpenCL programming experience is desired but not required. * Experience with the following technologies is a huge plus: XLA, TVM, MLIR, LLVM, OpenAI Triton, deep learning models and ...
OR ยท On-site
$134K - $180K/yr
Strong programming skills in Rust, C++, Python, CUDA; ability to read, modify, and optimize performance-critical code across layers. * Experience with GPU performance analysis tools and methodologies ...
CUDA toolkit, cuDNN, TensorRT, NCCL, Triton Inference Server, DCGM, and DOCA/OFED. Ensure version ... systems software engineering with hands-on experience in AI/ML workload optimization, GPU ...
Work directly with startup founders and engineering teams toarchitect and optimize AIworkloadsusing NVIDIA technologies including CUDA-X libraries, TensorRT-LLM, Triton Inference Server, NVIDIA NeMo ...
OR ยท On-site
Engage with developers to use NVIDIA's Agentic AI stack-including Nemo Microservices, Nemo Agent Toolkit, Dynamo, Nemotron models, and CUDA-accelerated pipelines-to improve performance and ...
OR ยท On-site
$122K - $161K/yr
GPU programming experience (CUDA, OAI TRITON or CUTLASS). NVIDIA is at the forefront of breakthroughs in Artificial Intelligence, High-Performance Computing, and Visualization. Our teams arecomposed ...
OR ยท On-site
$122K - $161K/yr
Strong programming skills in Python plus C++ and/or CUDA; ability to debug and optimize performancecritical code. * Experience with profiling and performance investigation (microbenchmarks, flame ...
$29.48 - $34.68
5% of jobs
$34.68 - $39.88
10% of jobs
$39.88 - $45.08
9% of jobs
$46.19 is the 25th percentile. Wages below this are outliers.
$45.08 - $50.28
7% of jobs
$50.28 - $55.48
15% of jobs
The median wage is $57.08 / hr.
$55.48 - $60.67
14% of jobs
$65.39 is the 75th percentile. Wages above this are outliers.
$60.67 - $65.87
17% of jobs
$65.87 - $71.07
14% of jobs
$71.07 - $76.27
6% of jobs
$76.27 - $81.47
3% of jobs
$81.47 - $86.67
0% of jobs
$29
$57
$86
| Aspect | Cuda Programming | GPU Developer |
|---|---|---|
| Required Credentials | Knowledge of CUDA, C/C++, parallel computing | Knowledge of GPU architecture, CUDA, OpenCL, C/C++ |
| Work Environment | High-performance computing, scientific research, AI | Graphics, gaming, scientific visualization, AI |
| Industry Usage | Tech companies, research labs, AI firms | Gaming, entertainment, tech, research |
While Cuda Programming focuses specifically on writing code using NVIDIA's CUDA platform for parallel processing, GPU Developers have a broader role that includes designing, optimizing, and implementing GPU-based solutions across various platforms and technologies. Both roles require knowledge of GPU architecture and programming languages like C/C++, but GPU Developers often work on a wider range of applications beyond CUDA-specific projects.

Full-time
Posted 7 days ago
DGX Station is NVIDIA's next-generation personal AI supercomputer-a deskside workstation built on the NVIDIA Grace Blackwell GB300 Superchip with massive coherent CPU+GPU memory, designed to bring data-center-class AI capabilities directly to the desks of researchers, developers, and AI engineers. As NVIDIA brings DGX Station to a broad set of customers, we need an engineer who can own full-stack OS enablement-from firmware and drivers through OS integration to ensuring AI applications run seamlessly on day one, with a primary focus on Windows and strong coverage of Linux.
This is a hands-on, technically deep role where you will be the go-to engineer for making DGX Station a first-class Windows platform while also driving its Linux bring-up and validation. You will work across NVIDIA's GPU driver, CUDA, firmware, BMC, and AI software teams, collaborate closely with Microsoft and ODM/OEM partners, and ensure that developers and enterprise customers have a polished, production-ready experience on DGX Station across both operating systems.
What you'll be doing:
Windows Platform Ownership (primary): Own end-to-end Windows enablement for DGX Station-driving the platform from initial bring-up on Windows through WHQL certification to customer-ready shipping quality. You are the single point of accountability for "DGX Station works on Windows."
Linux Bring-up & Enablement: Drive Linux bring-up and continuous enablement for DGX Station on DGX OS / Ubuntu, including kernel module integration, device tree and ACPI configuration, systemd services, initramfs, and dkms packaging. Partner with the DGX OS and kernel teams to land platform support upstream and in NVIDIA's distribution.
Firmware & Driver Enablement: Enable and validate BIOS/UEFI, BMC, and system-level firmware for Windows and Linux on the Grace (Arm) + Blackwell GB300 architecture. Work with firmware teams to ensure ACPI tables, SMBIOS, Secure Boot, measured boot, power management, and hardware abstraction layers are correct on both OSes.
GPU Driver Integration: Coordinate GPU driver, display driver, and compute driver bring-up and validation on Windows (WDDM, MCDM) and Linux (open-gpu-kernel-modules, DRM/KMS). Work with the NVIDIA driver team and Microsoft to resolve compatibility issues, achieve WHQL certification, and ensure driver stability across Windows Update and Linux kernel revisions.
CUDA & AI Stack Readiness: Ensure the CUDA toolkit, cuDNN, TensorRT, NCCL, and NVIDIA's AI SDK stack are fully functional on DGX Station on both Windows and Linux. Validate AI/DL workload performance-training, fine-tuning, and inference-and work with the CUDA team to resolve gaps on the Arm + GB300 platform.
Application Validation: Validate that NVIDIA AI applications-NIM microservices, NemoClaw, AI Workbench, and developer tools-run correctly on DGX Station across Windows and Linux. Define and implement test plans covering single-user and multi-user scenarios, container runtimes, application installation flows, and developer workflows.
System Validation & Quality: Drive the overall test strategy for DGX Station on Windows and Linux: functional testing, stress testing, power/thermal validation, sleep/resume and S-state cycles, Windows Update and Linux kernel-upgrade compatibility, and long-duration reliability. Own bug triage and resolution across firmware, BMC, driver, and OS layers.
Partner Engagement: Be the primary technical interface with Microsoft (Windows on Arm, WHQL, driver signing) and ODM/OEM partners shipping DGX Station. Coordinate schedules, resolve cross-company technical blockers, and represent NVIDIA's platform requirements on both OSes.
Performance Optimization: Profile and optimize system performance-boot time, GPU compute throughput, NVLink-C2C and memory bandwidth utilization, power efficiency, and thermal behavior. Identify bottlenecks across the stack on Windows and Linux and drive fixes with the appropriate teams.
Documentation & Enablement: Create and maintain platform documentation for DGX Station on Windows and Linux: bring-up guides, known issues, driver compatibility matrices, recovery and re-imaging procedures, and developer setup instructions. Enable field and support teams for customer deployments.
What we need to see:
BS or MS in Computer Science, Electrical Engineering, or related field (or equivalent experience) and 12+ yrs of confirmed experience in systems software engineering with deep expertise in Windows platform enablement, driver development, or OS integration, and proven hands-on experience bringing up Linux on new hardware platforms.
Strong hands-on experience with Windows internals: kernel-mode drivers, ACPI, power management, Secure Boot, UEFI, WDM/WDF driver frameworks, and the WHQL certification process.
Solid understanding of Linux platform enablement: kernel modules, device tree / ACPI on Arm, systemd, initramfs, dkms, and packaging for Ubuntu / DGX OS.
Experience with GPU driver stack, display drivers, or compute drivers on Windows and/or Linux. Familiarity with DirectX, WDDM, DRM/KMS, and GPU compute APIs is a strong plus.
Experience enabling hardware platforms-bring-up, driver integration, validation, and certification for shipping products on Windows and Linux.
Strong debugging and root-cause analysis skills across firmware, driver, and OS boundaries. Comfortable with WinDbg, kernel debugging (kd, kgdb/crash), crash dump analysis, ftrace/ETW, and performance profiling tools.
Ability to work across organizational boundaries-coordinating with GPU driver, CUDA, firmware, BMC, and AI software teams as well as external partners (Microsoft, ODM/OEMs).
Proficiency in C/C++ and Python. Experience with Arm architecture is a plus.
Ways to stand out from the crowd:
Experience with Windows on Arm platforms-driver enablement, performance optimization, or application compatibility on Arm-based Windows devices.
Hands-on experience with CUDA, TensorRT, or AI/ML frameworks on Windows and Linux-especially on Arm + NVIDIA GPU systems.
Prior experience working with OEM/ODM partners or silicon vendors on Windows and Linux platform certification for workstation- or server-class hardware.
Track record shipping workstation or server hardware products-from bring-up through general availability-with both Windows and Linux support.
Experience with BMC, Redfish, out-of-band management, or platform manageability software on high-end workstations or servers.Experience with GPU-accelerated applications: AI training and inference, content creation tools, or scientific computing on Windows and Linux.
NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative and autonomous, we want to hear from you!
We also welcome out-of-the-box problem solvers who can provide new ideas with a strong execution bias. Expect to be constantly challenged, improving, and evolving for the better. For two decades, we have pioneered visual computing, the art and science of computer graphics. Since the creation of the GPU, the engine of modern visual computing, the field has grown. It now involves video games, movie production, product composition, medical diagnosis, and scientific research. Today, we stand at the beginning of the next era, the AI computing era, ignited by a new computing model, GPU deep learning.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD.You will also be eligible for equity and benefits.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.Sourced by ZipRecruiter
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology--and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent.
Computer and electronic product manufacturing
10,000+ Employees
Santa Clara, CA, US
1993