OR ยท On-site
We are now looking for a GPU Performance Engineer for Neural Reconstruction! NVIDIA is building the future of computer graphics, simulation, robotics, and embodied AI. Neural reconstruction and ...
OR ยท On-site
We are now looking for a GPU Performance Engineer for Neural Reconstruction! NVIDIA is building the future of computer graphics, simulation, robotics, and embodied AI. Neural reconstruction and ...
CUDA Engineering Expert Type: Contract Compensation: $80-$120/hour Location: Remote Role Responsibilities * Analyze and optimize GPU kernels for performance, efficiency, and hardware utilization.
Quick apply
CUDA Engineering Expert Type: Contract Compensation: $80-$120/hour Location: Remote Role Responsibilities * Analyze and optimize GPU kernels for performance, efficiency, and hardware utilization.
Members of this team possess deep technical expertise in our GPU architecture and programming models. We use this to develop workflows and tools for deep performance analysis capabilities, which we ...
Members of this team possess deep technical expertise in our GPU architecture and programming models. We use this to develop workflows and tools for deep performance analysis capabilities, which we ...
As a System Software Engineer on ML & Robotics Infra focused on GPU and accelerated compute, you'll own how every accelerated workload on the robot from model inference, SLAM/perception, and more ...
As a System Software Engineer on ML & Robotics Infra focused on GPU and accelerated compute, you'll own how every accelerated workload on the robot from model inference, SLAM/perception, and more ...
$164K - $202K/yr
GPU Software Engineer/GPU Architect Location: San Jose, CA Duration: Long-term >> ongoing contract Overview: We're looking for a strong GPU Software Engineer/GPU Architect to join a highimpact ...
$164K - $202K/yr
GPU Software Engineer/GPU Architect Location: San Jose, CA Duration: Long-term >> ongoing contract Overview: We're looking for a strong GPU Software Engineer/GPU Architect to join a highimpact ...
San Mateo, CA ยท On-site
$197K - $234K/yr
... GPU performance engineering), pushing the limits of what's possible with the current hardware. โข Contribute to the long-term vision for Genesis' infra platform. Qualifications : Required : โข ...
San Mateo, CA ยท On-site
$197K - $234K/yr
... GPU performance engineering), pushing the limits of what's possible with the current hardware. โข Contribute to the long-term vision for Genesis' infra platform. Qualifications : Required : โข ...
We are now looking for a GPU Performance Engineer for Neural Reconstruction! NVIDIA is building the future of computer graphics, simulation, robotics, and embodied AI. Neural reconstruction and ...
We are now looking for a GPU Performance Engineer for Neural Reconstruction! NVIDIA is building the future of computer graphics, simulation, robotics, and embodied AI. Neural reconstruction and ...
Redwood City, CA ยท On-site
$211K - $251K/yr
As a System Software Engineer on ML & Robotics Infra focused on GPU and accelerated compute, you'll own how every accelerated workload on the robot from model inference, SLAM/perception, and more ...
Redwood City, CA ยท On-site
$211K - $251K/yr
As a System Software Engineer on ML & Robotics Infra focused on GPU and accelerated compute, you'll own how every accelerated workload on the robot from model inference, SLAM/perception, and more ...
Cupertino, CA ยท On-site
$172K - $213K/yr
Apple's GGML team provides developers access to harness the power of the GPU across all of Apple's innovative products, from iPhone, iPad, Apple TV, Apple Watch to the Mac product line. Apple Silicon ...
Cupertino, CA ยท On-site
$172K - $213K/yr
Apple's GGML team provides developers access to harness the power of the GPU across all of Apple's innovative products, from iPhone, iPad, Apple TV, Apple Watch to the Mac product line. Apple Silicon ...
Redmond, WA ยท On-site
$158K - $258K/yr
We are looking for a Senior Researcher - GPU Performance - Hardware/Software Codesign researcher to ... Reliable C++ programming skills. Other Requirements: Ability to meet Microsoft, customer and/or ...
Redmond, WA ยท On-site
$158K - $258K/yr
We are looking for a Senior Researcher - GPU Performance - Hardware/Software Codesign researcher to ... Reliable C++ programming skills. Other Requirements: Ability to meet Microsoft, customer and/or ...
Kernel engineering means demonstrating mastery in designing complex, scalable systems using modern C++, coupled with a fundamental grasp of GPU architectures (HIP/CUDA), memory hierarchies, and ...
Kernel engineering means demonstrating mastery in designing complex, scalable systems using modern C++, coupled with a fundamental grasp of GPU architectures (HIP/CUDA), memory hierarchies, and ...
Strong knowledge of GPU programming models and workloads (graphics, compute, and AI inference) to translate workload characteristics into architectural innovation. * Demonstrated ability to build and ...
Strong knowledge of GPU programming models and workloads (graphics, compute, and AI inference) to translate workload characteristics into architectural innovation. * Demonstrated ability to build and ...
Cupertino, CA ยท On-site
$172K - $213K/yr
Apple Silicon GPU SW architecture team within the Media, Graphics & Compute Technologies group is seeking a senior/principal engineer to lead server-side ML acceleration and multi-node distribution ...
Cupertino, CA ยท On-site
$172K - $213K/yr
Apple Silicon GPU SW architecture team within the Media, Graphics & Compute Technologies group is seeking a senior/principal engineer to lead server-side ML acceleration and multi-node distribution ...
Experience with low-level GPU programming (CUDA, Triton, CUTLASS, etc.) and performance engineering techniques. Preferred qualifications: * Master's degree or PhD in Engineering, Computer Science, or ...
Experience with low-level GPU programming (CUDA, Triton, CUTLASS, etc.) and performance engineering techniques. Preferred qualifications: * Master's degree or PhD in Engineering, Computer Science, or ...
Strong knowledge of GPU programming models and workloads (graphics, compute, and AI inference) to translate workload characteristics into architectural innovation. * Demonstrated ability to build and ...
Strong knowledge of GPU programming models and workloads (graphics, compute, and AI inference) to translate workload characteristics into architectural innovation. * Demonstrated ability to build and ...
Strong knowledge of GPU programming models and workloads (graphics, compute, and AI inference) to translate workload characteristics into architectural innovation. * Demonstrated ability to build and ...
Strong knowledge of GPU programming models and workloads (graphics, compute, and AI inference) to translate workload characteristics into architectural innovation. * Demonstrated ability to build and ...
Apple's GGML team provides developers access to harness the power of the GPU across all of Apple's innovative products, from iPhone, iPad, Apple TV, Apple Watch to the Mac product line. Apple Silicon ...
Apple's GGML team provides developers access to harness the power of the GPU across all of Apple's innovative products, from iPhone, iPad, Apple TV, Apple Watch to the Mac product line. Apple Silicon ...
San Diego, CA ยท On-site
Engineering Group, Engineering Group > GPU ASICS Engineering General Summary: Qualcomm's GPU Research Team is looking for talented GPU architects to help advance state-of-the-art 3D GPU capabilities ...
San Diego, CA ยท On-site
Engineering Group, Engineering Group > GPU ASICS Engineering General Summary: Qualcomm's GPU Research Team is looking for talented GPU architects to help advance state-of-the-art 3D GPU capabilities ...
Hands-on experience with GPU programming and compute frameworks - CUDA, ROCm, or OpenCL - with real performance profiling and optimization work, not just running tutorials * Strong Linux systems ...
Hands-on experience with GPU programming and compute frameworks - CUDA, ROCm, or OpenCL - with real performance profiling and optimization work, not just running tutorials * Strong Linux systems ...
Santa Clara, CA ยท On-site
$203K - $240K/yr
Are you a hard-working GPU programmer? Do you want to help craft the architecture of future GPUs? A key part of NVIDIA's strength is our sophisticated software platforms and simulation environments ...
Santa Clara, CA ยท On-site
$203K - $240K/yr
Are you a hard-working GPU programmer? Do you want to help craft the architecture of future GPUs? A key part of NVIDIA's strength is our sophisticated software platforms and simulation environments ...
$39K - $48K
3% of jobs
$48K - $56.9K
3% of jobs
$56.9K - $65.9K
4% of jobs
$65.9K - $74.8K
7% of jobs
$74.8K - $83.8K
6% of jobs
$84.5K is the 25th percentile. Wages below this are outliers.
$83.8K - $92.7K
6% of jobs
The median wage is $100.8K / yr.
$92.7K - $101.7K
21% of jobs
$101.7K - $110.6K
4% of jobs
$116.4K is the 75th percentile. Wages above this are outliers.
$110.6K - $119.6K
29% of jobs
$119.6K - $128.5K
2% of jobs
$128.5K - $137.5K
13% of jobs
$39K
$101.8K
$137.5K
To thrive as a GPU Engineer, you need strong knowledge of computer architecture, proficiency in C/C++, and experience with parallel programming models such as CUDA or OpenCL, along with a degree in computer science, electrical engineering, or a related field. Familiarity with debugging tools, driver development, performance profiling utilities, and hardware simulation platforms is typically required. Excellent problem-solving abilities, attention to detail, and effective teamwork and communication skills help distinguish top candidates. These skills ensure that GPU Engineers can develop high-performance solutions, efficiently troubleshoot hardware and software issues, and collaborate successfully in multidisciplinary environments.
A GPU Engineer designs, develops, and optimizes graphics processing units (GPUs) for applications like gaming, artificial intelligence, and high-performance computing. They work on hardware architecture, driver development, and parallel computing optimizations to maximize performance. GPU Engineers collaborate with software developers, hardware designers, and researchers to improve graphics rendering, machine learning acceleration, and computational efficiency.
GPU Engineers often face challenges such as optimizing code for maximum parallel efficiency, debugging complex hardware-software interactions, and keeping pace with rapidly evolving GPU architectures. Addressing these issues typically requires a combination of deep architectural understanding, use of specialized profiling and debugging tools, and ongoing collaboration with hardware, software, and QA teams. Many companies provide ongoing training and encourage knowledge sharing within engineering teams to help individuals stay current and effectively tackle new technical hurdles. Overcoming these challenges not only sharpens technical expertise but also opens doors for career growth into architect, team lead, or principal engineer roles.

Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent. As an NVIDIAN, you'll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.
We are now looking for a GPU Performance Engineer for Neural Reconstruction!
NVIDIA is building the future of computer graphics, simulation, robotics, and embodied AI. Neural reconstruction and Gaussian Splatting are changing how 3D worlds are collected, represented, optimized, and rendered. These workloads push the limits of GPU computing, differentiable rendering, computer vision, and production ML systems. In this role, you will help make neural reconstruction faster, more scalable, and more reliable. You will work across PyTorch, CUDA, C++, and GPU profiling to optimize training and rendering workflows used in sophisticated 3D reconstruction systems. The ideal candidate enjoys working close to the hardware while understanding the ML and 3D vision goals behind the system.
What You'll Be Doing:
Profile end-to-end neural reconstruction workflows and identify bottlenecks across data loading, initialization, training, rendering, evaluation, and export.
Improve CUDA and PyTorch performance for Gaussian Splatting and neural reconstruction workloads, including camera/lidar data, multiview batching, large-scene rendering, and memory-sensitive training paths.
Analyze GPU performance using tools such as Nsight Systems, Nsight Compute, NVTX, PyTorch Profiler, CUDA events, and benchmark dashboards.
Optimize sparse and irregular rendering workloads, including tile-level masking/culling, sparse gradients, batching, and multi-GPU execution.
Translate high-impact Python, NumPy, or PyTorch bottlenecks into efficient CUDA/C++ or PyTorch-native implementations when appropriate.
Validate that performance improvements preserve reconstruction quality, numerical behavior, camera/lidar correctness, and production reliability.
Build repeatable benchmarks, regression tests, and profiling workflows to catch performance and quality regressions early.
Collaborate with researchers, CUDA engineers, ML engineers, and production teams to turn promising prototypes into maintainable, reviewable, production-quality code.
What We Need To See:
BS, MS, PhD, or equivalent experience in Computer Science, Computer Engineering, Electrical Engineering, Applied Math, Robotics, Computer Vision, Machine Learning, or a related field (or equivalent experience) with 12+ years of experience.
Strong programming skills in Python and C++!
Hands-on experience with PyTorch or a similar tensor/autograd framework.
Experience optimizing GPU-accelerated workloads using CUDA, C++/CUDA extensions, or related GPU programming approaches.
Practical experience with profiling and performance analysis, including root-causing CPU/GPU bottlenecks, synchronization overhead, memory pressure, kernel launch overhead, and framework-level inefficiencies.
Ability to develop benchmarks and validate that optimizations preserve correctness, numerical behavior, and user-visible quality.
Strong communication skills, including the ability to explain performance tradeoffs, risks, and results to research and engineering partners.
Ways To Stand Out From The Crowd:
Experience with Gaussian Splatting, NeRF, differentiable rendering, rasterization, neural rendering, SLAM, 3D reconstruction, or robotics/autonomous-vehicle perception pipelines.
Deep CUDA performance experience, including memory access patterns, shared memory, atomics, occupancy, launch configuration, synchronization, and numerical stability.
Experience optimizing PyTorch workloads with custom operators, fused kernels, sparse tensors, distributed training, or distributed rendering.
Familiarity with camera and lidar geometry, projection models, calibration, rolling shutter, depth rendering, or multi-sensor reconstruction.
Experience improving large production ML systems where quality metrics, training speed, memory footprint, and developer velocity must be balanced.
Widely considered to be one of the technology world's most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family www.nvidiabenefits.com/
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD for Level 5, and 272,000 USD - 431,250 USD for Level 6.You will also be eligible for equity and benefits.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.Sourced by ZipRecruiter
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology--and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent.
Computer and electronic product manufacturing
10,000+ Employees
Santa Clara, CA, US
1993