Extend important CUDA programming models and functionality such as CUDA Graphs * Explore ways to use Graphs to improve the scheduling of AI/ML workloads on our GPUS to be more efficient and faster.
Extend important CUDA programming models and functionality such as CUDA Graphs * Explore ways to use Graphs to improve the scheduling of AI/ML workloads on our GPUS to be more efficient and faster.
Senior Software Engineer, CUDA Deep Learning Systems
Santa Clara, CA · On-site
$143K - $189K/yr
... Python programming. • Solid background in the fundamentals of Deep Learning with a focus on ... CUDA programming and kernel optimization. • A strong analytical approach with experience using ...
Senior Software Engineer, CUDA Deep Learning Systems
Santa Clara, CA · On-site
$143K - $189K/yr
... Python programming. • Solid background in the fundamentals of Deep Learning with a focus on ... CUDA programming and kernel optimization. • A strong analytical approach with experience using ...
Principal Engineer, CUDA UMD - GPU Kernel Scheduling
Santa Clara, CA · On-site
$158K - $212K/yr
Extend important CUDA programming models and functionality such as CUDA Graphs * Explore ways to use Graphs to improve the scheduling of AI/ML workloads on our GPUS to be more efficient and faster.
Principal Engineer, CUDA UMD - GPU Kernel Scheduling
Santa Clara, CA · On-site
$158K - $212K/yr
Extend important CUDA programming models and functionality such as CUDA Graphs * Explore ways to use Graphs to improve the scheduling of AI/ML workloads on our GPUS to be more efficient and faster.
Senior Software Engineer, CUDA UMD - Graphs and GPU Sharing
Santa Clara, CA · On-site
$143K - $189K/yr
Extend important CUDA programming models and functionality such as CUDA Graphs and MPS (Multi-Process Service) * Write effective, maintainable, and well-tested code * Develop code for multiple ...
Senior Software Engineer, CUDA UMD - Graphs and GPU Sharing
Santa Clara, CA · On-site
$143K - $189K/yr
Extend important CUDA programming models and functionality such as CUDA Graphs and MPS (Multi-Process Service) * Write effective, maintainable, and well-tested code * Develop code for multiple ...
Senior Software Engineer, CUDA UMD - Graphs and GPU Sharing
Santa Clara, CA · On-site
$143K - $189K/yr
Extend important CUDA programming models and functionality such as CUDA Graphs and MPS (Multi-Process Service) * Write effective, maintainable, and well-tested code * Develop code for multiple ...
Senior Software Engineer, CUDA UMD - Graphs and GPU Sharing
Santa Clara, CA · On-site
$143K - $189K/yr
Extend important CUDA programming models and functionality such as CUDA Graphs and MPS (Multi-Process Service) * Write effective, maintainable, and well-tested code * Develop code for multiple ...
Strong proficiency in C++ and Python programming. * Solid background in the fundamentals of Deep ... Hands-on experience with CUDA, communication libraries (e.g., NCCL, MPI, UCX) and distributed ...
Strong proficiency in C++ and Python programming. * Solid background in the fundamentals of Deep ... Hands-on experience with CUDA, communication libraries (e.g., NCCL, MPI, UCX) and distributed ...
Senior Software Engineer, CUDA Deep Learning Systems
Santa Clara, CA · On-site
$143K - $189K/yr
Strong proficiency in C++ and Python programming. * Solid background in the fundamentals of Deep ... Hands-on experience with CUDA, communication libraries (e.g., NCCL, MPI, UCX) and distributed ...
Senior Software Engineer, CUDA Deep Learning Systems
Santa Clara, CA · On-site
$143K - $189K/yr
Strong proficiency in C++ and Python programming. * Solid background in the fundamentals of Deep ... Hands-on experience with CUDA, communication libraries (e.g., NCCL, MPI, UCX) and distributed ...
CUDA Kernel Optimization Specialist
San Francisco, CA · Remote
$80 - $120/hr
CUDA Engineering Expert Type: Contract Compensation: $80-$120/hour Location: Remote Role Responsibilities * Analyze and optimize GPU kernels for performance, efficiency, and hardware utilization.
Quick apply
CUDA Kernel Optimization Specialist
San Francisco, CA · Remote
$80 - $120/hr
CUDA Engineering Expert Type: Contract Compensation: $80-$120/hour Location: Remote Role Responsibilities * Analyze and optimize GPU kernels for performance, efficiency, and hardware utilization.
Senior Software Engineer, CUDA Core Libraries
Santa Clara, CA · On-site
$143K - $189K/yr
Strong programming skills in C++, Python, or both, with proven interest in systems-level software ... Hands-on experience with CUDA C++, CUDA Python, PyTorch, JAX, Numba, CuPy, or similar GPU ...
Senior Software Engineer, CUDA Core Libraries
Santa Clara, CA · On-site
$143K - $189K/yr
Strong programming skills in C++, Python, or both, with proven interest in systems-level software ... Hands-on experience with CUDA C++, CUDA Python, PyTorch, JAX, Numba, CuPy, or similar GPU ...
Senior Software Engineer, CUDA Core Libraries
$143K - $189K/yr
At the core of this platform are the CUDA Core Libraries. C++ and Python libraries that enable ... Strong programming skills inC++, Python, or both, with proven interest in systems-level software ...
Senior Software Engineer, CUDA Core Libraries
$143K - $189K/yr
At the core of this platform are the CUDA Core Libraries. C++ and Python libraries that enable ... Strong programming skills inC++, Python, or both, with proven interest in systems-level software ...
CUDA Kernel Optimization Specialist - AI Trainer
San Francisco, CA · Remote
$80 - $120/hr
CUDA Engineering Expert Type: Contract Compensation: $80-$120/hour Location: Remote Role Responsibilities * Analyze and optimize GPU kernels for performance, efficiency, and hardware utilization.
Quick apply
CUDA Kernel Optimization Specialist - AI Trainer
San Francisco, CA · Remote
$80 - $120/hr
CUDA Engineering Expert Type: Contract Compensation: $80-$120/hour Location: Remote Role Responsibilities * Analyze and optimize GPU kernels for performance, efficiency, and hardware utilization.
Embedded AI Engineer
Sunnyvale, CA · On-site
$156K - $206K/yr
... with CUDA programming and PyTorch framework • In-depth knowledge of deep learning models, particularly Large Language Models (LLMs) • Proficiency in C++ and Python programming languages • ...
Embedded AI Engineer
Sunnyvale, CA · On-site
$156K - $206K/yr
... with CUDA programming and PyTorch framework • In-depth knowledge of deep learning models, particularly Large Language Models (LLMs) • Proficiency in C++ and Python programming languages • ...
Strong CUDA programming skills with production kernel development * Deep understanding of GPU architecture (memory hierarchy, SMs, warps) * Track record of achieving significant performance ...
Quick apply
Strong CUDA programming skills with production kernel development * Deep understanding of GPU architecture (memory hierarchy, SMs, warps) * Track record of achieving significant performance ...
Strong CUDA programming skills with production kernel development * Deep understanding of GPU architecture (memory hierarchy, SMs, warps) * Track record of achieving significant performance ...
Strong CUDA programming skills with production kernel development * Deep understanding of GPU architecture (memory hierarchy, SMs, warps) * Track record of achieving significant performance ...
Senior / Staff AI Research Engineer, Real-Time Inference
Milpitas, CA · On-site
$121K - $167K/yr
Deep expertise in CUDA programming, GPU architecture, and low-level kernel optimization, including custom kernel authoring with tools such as Triton. * Hands-on experience with model quantization ...
Senior / Staff AI Research Engineer, Real-Time Inference
Milpitas, CA · On-site
$121K - $167K/yr
Deep expertise in CUDA programming, GPU architecture, and low-level kernel optimization, including custom kernel authoring with tools such as Triton. * Hands-on experience with model quantization ...
The CUDA programming language defines a unified programming model across a range of system configurations and hardware capabilities. The compiler is responsible for translating parallel programs ...
The CUDA programming language defines a unified programming model across a range of system configurations and hardware capabilities. The compiler is responsible for translating parallel programs ...
The CUDA programming language defines a unified programming model across a range of system configurations and hardware capabilities. The compiler is responsible for translating parallel programs ...
The CUDA programming language defines a unified programming model across a range of system configurations and hardware capabilities. The compiler is responsible for translating parallel programs ...
The CUDA programming language defines a unified programming model across a range of system configurations and hardware capabilities. The compiler is responsible for translating parallel programs ...
The CUDA programming language defines a unified programming model across a range of system configurations and hardware capabilities. The compiler is responsible for translating parallel programs ...
Senior Research Engineer - AI Coding Tools
Santa Clara, CA · On-site
$143K - $189K/yr
Generate, curate, and validate synthetic training and evaluation data for CUDA programming * Deliver "net new knowledge" to frontier LLMs through RAG and skill-based systems that keep models current ...
Senior Research Engineer - AI Coding Tools
Santa Clara, CA · On-site
$143K - $189K/yr
Generate, curate, and validate synthetic training and evaluation data for CUDA programming * Deliver "net new knowledge" to frontier LLMs through RAG and skill-based systems that keep models current ...
Deep expertise in CUDA programming, GPU architecture, and low-level kernel optimization, including custom kernel authoring with tools such as Triton. * Hands-on experience with model quantization ...
Deep expertise in CUDA programming, GPU architecture, and low-level kernel optimization, including custom kernel authoring with tools such as Triton. * Hands-on experience with model quantization ...
Cuda Programming information
See California salary details
$27.52 - $32.37
5% of jobs
$32.37 - $37.22
10% of jobs
$37.22 - $42.08
9% of jobs
$43.12 is the 25th percentile. Wages below this are outliers.
$42.08 - $46.93
7% of jobs
$46.93 - $51.78
15% of jobs
The median wage is $53.28 / hr.
$51.78 - $56.63
14% of jobs
$61.03 is the 75th percentile. Wages above this are outliers.
$56.63 - $61.49
17% of jobs
$61.49 - $66.34
14% of jobs
$66.34 - $71.19
6% of jobs
$71.19 - $76.05
3% of jobs
$76.05 - $80.90
0% of jobs
$27
$53
$80
How much do cuda programming jobs pay per hour?
What is the difference between Cuda Programming vs GPU Developer?
| Aspect | Cuda Programming | GPU Developer |
|---|---|---|
| Required Credentials | Knowledge of CUDA, C/C++, parallel computing | Knowledge of GPU architecture, CUDA, OpenCL, C/C++ |
| Work Environment | High-performance computing, scientific research, AI | Graphics, gaming, scientific visualization, AI |
| Industry Usage | Tech companies, research labs, AI firms | Gaming, entertainment, tech, research |
While Cuda Programming focuses specifically on writing code using NVIDIA's CUDA platform for parallel processing, GPU Developers have a broader role that includes designing, optimizing, and implementing GPU-based solutions across various platforms and technologies. Both roles require knowledge of GPU architecture and programming languages like C/C++, but GPU Developers often work on a wider range of applications beyond CUDA-specific projects.

$158K - $212K/yr
Full-time
Posted 6 days ago
Job description
NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing - with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. We're looking to grow our company, and form teams with the smartest people in the world. Join us at the forefront of technological advancement.
Are you a motivated system software engineer with a deep understanding of device drivers who has phenomenal C/C++ skills? If so, this role might be for you. We are looking for a seasoned software professional to work on the CUDA Driver, a core component of our platform for accelerating general purpose computation on the GPU. You will be an integral part of a team that delivers features and improvements to better realize the potential of NVIDIA hardware for a growing range of computational workloads, ranging from deep learning, scientific computation, data science and self-driving cars to video games and virtual reality.
What you'll be doing:
As a member of our team, you will use your design abilities, coding expertise, and creativity to deliver the best compute platform in the world. You will craft elegant solutions to exciting problems and shape the future direction of CUDA as you collaborate with your peers across NVIDIA.
Evangelize, architect, and implement new features
Coordinate and drive development efforts across multiple teams
Help define forward-looking improvements to the CUDA APIs and programming model
Extend important CUDA programming models and functionality such as CUDA Graphs
Explore ways to use Graphs to improve the scheduling of AI/ML workloads on our GPUS to be more efficient and faster.
Write effective, maintainable, and well-tested code
Develop code for multiple operating systems
What we need to see:
BS or MS degree in Computer Science, Electrical Engineering or related field (or equivalent experience)
Strong C and C++ programming skills
Minimum of 15+ years of related development experience (multiple positions for varying experience levels open)
Experience driving projects across multiple teams
Experience working with large codebases
Background with operating system interfaces for threads, process control, and virtual memory
Experience writing and debugging multithreaded programs
Good written communication as well as presentation skills
Ways to stand out from the crowd:
Prior experience with parallel computing - preferably writing CUDA Programs or Libraries that use CUDA
Understanding of system level architecture, such as interconnects, memory hierarchy, interrupts, and memory-mapped IO
Knowledge of memory coherence and consistency models
Background with kernel mode development
Experience with Linux Systems Software development as well as experience maintaining and extending programming models or higher-level language support for similar environments
You will also be eligible for equity and benefits.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.About Nvidia
Sourced by ZipRecruiter
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology--and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent.
Industry
Computer and electronic product manufacturing
Company size
10,000+ Employees
Headquarters location
Santa Clara, CA, US
Year founded
1993