OR · On-site
$122K - $161K/yr
At the core of this platform are the CUDA Core Libraries. C++ and Python libraries that enable developers to write fast, reliable, and scalable GPU-accelerated software! We are hiring a full-time ...
OR · On-site
$122K - $161K/yr
At the core of this platform are the CUDA Core Libraries. C++ and Python libraries that enable developers to write fast, reliable, and scalable GPU-accelerated software! We are hiring a full-time ...
OR · On-site
$122K - $161K/yr
NVIDIA Nsight Compute helps CUDA engineers around the world to innovate in Artificial Intelligence (AI) and High Performance Computing. Join our team and help develop groundbreaking performance tools ...
OR · On-site
$122K - $161K/yr
Familiarity with deep learning accelerator architectures such as the GPU and hands-on experience with CUDA programming and kernel optimization. * A strong analytical approach with experience using ...
OR · On-site
CUDA-Q is the open-source programming framework bridging classical accelerated computing and quantum processors, to enable fault-tolerant quantum-GPU supercomputing. This role sits where quantum ...
OR · On-site
$122K - $161K/yr
We are hiring software engineers for the CUDA Tile team. NVIDIA GPUs are at the center of the deep learning revolution and continue to enable breakthroughs in generative AI, large language models ...
OR · On-site
$122K - $161K/yr
We are looking for a motivated Deep Learning engineer to bring advanced CUDA features and Distributed Runtime technologies into AI stacks, including PyTorch, TRT-LLM, vLLM, SGLang, JAX, etc. You will ...
OR · On-site
NVIDIA seeks a Developer Relations Manager to lead our work in architecting impactful usage and adoption of our core CUDA Math Libraries. We are interested in finding a leader in high performance ...
OR · On-site
Join us in developing the CUDA-Q platform for programming powerful hybrid quantum-classical multi-processor systems. We are looking for a dedicated engineer with expertise building extensible ...
OR · On-site
$122K - $161K/yr
NVIDIA is seeking a Senior Software Engineer, NCCL and CUDA specialization to join our Cloud Service Provider (CSP)Engagements team, focusing on ML software stack functionality and performance for ...
OR · On-site
$122K - $161K/yr
We are hiring software engineers to work on the CUDA driver for Windows. CUDA is NVIDIA's platform for accelerating general purpose computation on the GPU. Our team delivers features and improvements ...
OR · On-site
$122K - $161K/yr
Help define forward-looking improvements to the CUDA APIs and programming model * Write effective, maintainable, and well-tested code * Develop code for multiple operating systems What we need to see ...
You will work across PyTorch, CUDA, C++, and GPU profiling to optimize training and rendering ... Collaborate with researchers, CUDA engineers, ML engineers, and production teams to turn promising ...
OR · On-site
$122K - $161K/yr
Help define forward-looking improvements to the CUDA APIs and programming model * Write effective, maintainable, and well-tested code * Develop code for multiple operating systems What we need to see ...
OR · On-site
$104K - $143K/yr
We are seeking a talented Deep Learning Compiler & Tools Engineer focused on CUDA Tile (Performance & Infrastructure) to join our team. You will collaborate closely with compiler developers ...
OR · On-site
$122K - $161K/yr
Develop advanced C++/CUDA libraries and algorithms for speed-of-light performance * Remove ... Exceptional C++ programming skills NVIDIA is widely considered to be one of the technology world ...
OR · On-site
$122K - $161K/yr
Work with NVIDIA GPU Architecture and CUDA Programming model teams to build abstractions to expose new GPU features in portable and performant ways in PTX ISA. PTX Compiler (PTXAS) apart from ...
OR · On-site
$104K - $143K/yr
Excellent C++, Python, and CUDA programming skills * Strong collaboration, communication, and documentation habits and ideally experience with working in a globally distributed organization Ways to ...
Hillsboro, OR · On-site
$133K - $175K/yr
Experience with CUDA, OpenCL, HIP, SYCL, Mojo, Pallas, Triton, Mosaic, Halide, or any general-purpose or domain-specific programming language targeting highly parallel accelerators. * Deep ...
Hillsboro, OR · On-site
$133K - $175K/yr
Experience with CUDA, OpenCL, HIP, SYCL, Mojo, Pallas, Triton, Mosaic, Halide, or any general-purpose or domain-specific programming language targeting highly parallel accelerators. * Deep ...
OR · On-site
$104K - $143K/yr
Background with NVIDIA GPUs, CUDA Programming, NCCL and MLPerf benchmarking * Experience with Machine Learning and Deep Learning concepts, algorithms and models * Familiarity with InfiniBand with ...
OR · On-site
$121K - $163K/yr
... CUDA programming skills * Strong understanding of fundamental numerical methods, dense and sparse array computing * Deep familiarity with Python numerical computing libraries (e.g. NumPy, SciPy ...
$12.71 - $18.16
4% of jobs
$18.16 - $23.61
9% of jobs
$27.53 is the 25th percentile. Wages below this are outliers.
$23.61 - $29.07
17% of jobs
$29.07 - $34.52
13% of jobs
The median wage is $37.70 / hr.
$34.52 - $39.97
13% of jobs
$39.97 - $45.42
10% of jobs
$45.42 - $50.88
9% of jobs
$51.90 is the 75th percentile. Wages above this are outliers.
$50.88 - $56.33
9% of jobs
$56.33 - $61.78
7% of jobs
$61.78 - $67.24
6% of jobs
$67.24 - $72.69
4% of jobs
$12
$41
$72
Cuda Programmers often encounter challenges related to optimizing code performance and efficiently managing memory on GPU architectures. Debugging and profiling can be complex, as issues may arise from both the code and hardware-specific elements, requiring close attention to parallelization and bottlenecks. Collaboration is key, as you’ll typically work closely with software engineers, data scientists, or researchers to integrate and optimize code for specialized workflows. Successfully navigating these challenges helps drive significant performance improvements and innovation in high-performance computing applications.
To thrive as a Cuda Programmer, you need strong programming skills in C/C++ and parallel computing, with a solid understanding of GPU architectures and CUDA development. Familiarity with CUDA libraries, performance profiling tools, and platforms like NVIDIA Nsight or Visual Studio is often required, while certifications from NVIDIA can be advantageous. Problem-solving abilities, attention to detail, and effective teamwork and communication skills help set candidates apart. These competencies ensure you can optimize complex algorithms, work efficiently on high-performance computing projects, and collaborate smoothly with multidisciplinary teams.
A CUDA Programmer develops high-performance parallel computing applications using NVIDIA's CUDA (Compute Unified Device Architecture) framework. They optimize algorithms to run efficiently on GPUs, accelerating tasks such as machine learning, scientific simulations, and real-time data processing. This role requires proficiency in C/C++, an understanding of GPU architectures, and experience with parallel computing concepts to maximize performance.

$122K - $161K/yr
Full-time
Posted 8 days ago
NVIDIA's accelerated computing platform is the foundation of modern HPC and AI.At the core of this platform are the CUDA Core Libraries. C++ and Python libraries that enable developers to write fast, reliable, and scalable GPU-accelerated software! We are hiring a full-time Software Engineer to work on the CUDA Core Libraries that power GPU computing for both C++ and Python developers. This includes projects such asCCCL (Thrust, CUB, libcudacxx),cuda-python, andnumba-cuda. You will join the team building the foundational libraries, algorithms, and language/runtime infrastructure that make CUDA a speed-of-light experience for developers across deep learning, scientific computing, and data analytics!
What you'll be doing:
Develop and implement CUDA Core Libraries inC++ and/or Python, including parallel algorithms and idiomatic language bindings for core CUDA functionality.
Compose, optimize, and evolve GPU algorithms and APIs, from high-level interfaces down to low-level performance tuning involving memory, parallelism, and synchronization.
Own features end-to-end: develop, implementation, testing, benchmarking, documentation, and long-term maintenance.
Improve developer experience across the stack: CI, tests, benchmarks, packaging, examples, and docs.
Collaborate with senior CUDA engineers in design reviews, code reviews, and open-source-style workflows.
Engage with real users through issues, performance investigations, and API feedback.
What we need to see:
BS, MS, or PhD in Computer Science, Computer Engineering, or a related fieldor equivalent experience.
Minimum of 8+ years of related development experience
Strong programming skills inC++, Python, or both, with proven interest in systems-level software (performance, memory, concurrency, API design).
Solid understanding of modern C++ (templates, generics, standard library) and/or Python library development and packaging.
Practical experience withparallel or heterogeneous programming(CUDA, OpenMP, GPU-accelerated Python, or similar).
Experience contributing to production software or open-source libraries, including testing, profiling, and code review.
Ability to work independently, scope problems, and drive projects to completion.
Clear written communication for technical design and documentation.
Comfort navigating large, multi-language codebases (C++, Python, CMake, Pixi, CI systems).
Ways to stand out from the crowd:
Strong understanding of CPU/GPU architecture and how hardware details affect performance.
Hands-on experience withCUDA C++,CUDA Python,PyTorch,JAX,Numba,CuPy, or similar GPU-accelerated stacks.
Familiarity withThrust,CUB,libcudacxx, or other modern C++/GPU libraries.
Experience with compiler infrastructure or tooling (LLVM, Clang tooling, MLIR).
Demonstrated interest in developer tools, library design, and making other developers faster.
If you care deeply about performance, enjoy working at the C++/Python boundary, and want to shape the core CUDA libraries relied on by thousands of developers, this role is a direct fit.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.You will also be eligible for equity and benefits.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.Sourced by ZipRecruiter
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology--and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent.
Computer and electronic product manufacturing
10,000+ Employees
Santa Clara, CA, US
1993