Responsibilities : • Design and implement kernels that support high-performance long-context behavior • Ownership of kernel design, implementation, deployment, and production reliability • ...
Responsibilities : • Design and implement kernels that support high-performance long-context behavior • Ownership of kernel design, implementation, deployment, and production reliability • ...
Research Engineer, Infrastructure, Kernels
San Francisco, CA · On-site
$350K - $475K/yr
You will develop high-performance ML kernels (e.g., CUDA, CuTe, Triton), enable efficient low-precision arithmetic, and improve the distributed compute stack that makes training large models possible.
Research Engineer, Infrastructure, Kernels
San Francisco, CA · On-site
$350K - $475K/yr
You will develop high-performance ML kernels (e.g., CUDA, CuTe, Triton), enable efficient low-precision arithmetic, and improve the distributed compute stack that makes training large models possible.
Senior Formal Verification Engineer, GPU Kernels
Santa Clara, CA · On-site
$143K - $189K/yr
Modern AI performance relies on highly optimized GPU kernels - performance-critical code where bugs can be hard to catch and expensive to miss. NVIDIA's Deep Learning Safety Team is hiring engineers ...
Senior Formal Verification Engineer, GPU Kernels
Santa Clara, CA · On-site
$143K - $189K/yr
Modern AI performance relies on highly optimized GPU kernels - performance-critical code where bugs can be hard to catch and expensive to miss. NVIDIA's Deep Learning Safety Team is hiring engineers ...
Senior Formal Verification Engineer, GPU Kernels
$143K - $189K/yr
Modern AI performance relies on highly optimized GPU kernels - performance-critical code where bugs can be hard to catch and expensive to miss. NVIDIA's Deep Learning Safety Team is hiring engineers ...
Senior Formal Verification Engineer, GPU Kernels
$143K - $189K/yr
Modern AI performance relies on highly optimized GPU kernels - performance-critical code where bugs can be hard to catch and expensive to miss. NVIDIA's Deep Learning Safety Team is hiring engineers ...
Research Engineer, Infrastructure, Kernels
$350K - $475K/yr
You will develop high-performance ML kernels (e.g., CUDA, CuTe, Triton), enable efficient low-precision arithmetic, and improve the distributed compute stack that makes training large models possible.
Research Engineer, Infrastructure, Kernels
$350K - $475K/yr
You will develop high-performance ML kernels (e.g., CUDA, CuTe, Triton), enable efficient low-precision arithmetic, and improve the distributed compute stack that makes training large models possible.
Senior Software Engineer, CUTLASS Kernels
Santa Clara, CA · On-site
$143K - $189K/yr
Optimize kernels for peak throughput on both silicon and software performance simulators. * Collaborate with teams across NVIDIA including the GPU architecture, NVVM/PTX compiler, CUDA library, and ...
Senior Software Engineer, CUTLASS Kernels
Santa Clara, CA · On-site
$143K - $189K/yr
Optimize kernels for peak throughput on both silicon and software performance simulators. * Collaborate with teams across NVIDIA including the GPU architecture, NVVM/PTX compiler, CUDA library, and ...
Senior Software Engineer, CUTLASS Kernels
Durham, NC · On-site
$118K - $156K/yr
Optimize kernels for peak throughput on both silicon and software performance simulators. * Collaborate with teams across NVIDIA including the GPU architecture, NVVM/PTX compiler, CUDA library, and ...
Senior Software Engineer, CUTLASS Kernels
Durham, NC · On-site
$118K - $156K/yr
Optimize kernels for peak throughput on both silicon and software performance simulators. * Collaborate with teams across NVIDIA including the GPU architecture, NVVM/PTX compiler, CUDA library, and ...
Senior Software Engineer, CUTLASS Kernels
$121K - $160K/yr
Optimize kernels for peak throughput on both silicon and software performance simulators. * Collaborate with teams across NVIDIA including the GPU architecture, NVVM/PTX compiler, CUDA library, and ...
Senior Software Engineer, CUTLASS Kernels
$121K - $160K/yr
Optimize kernels for peak throughput on both silicon and software performance simulators. * Collaborate with teams across NVIDIA including the GPU architecture, NVVM/PTX compiler, CUDA library, and ...
Senior Software Engineer, CUTLASS Kernels
Redmond, WA · On-site
$137K - $180K/yr
Optimize kernels for peak throughput on both silicon and software performance simulators. * Collaborate with teams across NVIDIA including the GPU architecture, NVVM/PTX compiler, CUDA library, and ...
Senior Software Engineer, CUTLASS Kernels
Redmond, WA · On-site
$137K - $180K/yr
Optimize kernels for peak throughput on both silicon and software performance simulators. * Collaborate with teams across NVIDIA including the GPU architecture, NVVM/PTX compiler, CUDA library, and ...
Senior Software Engineer, CUTLASS Kernels
$133K - $175K/yr
Optimize kernels for peak throughput on both silicon and software performance simulators. * Collaborate with teams across NVIDIA including the GPU architecture, NVVM/PTX compiler, CUDA library, and ...
Senior Software Engineer, CUTLASS Kernels
$133K - $175K/yr
Optimize kernels for peak throughput on both silicon and software performance simulators. * Collaborate with teams across NVIDIA including the GPU architecture, NVVM/PTX compiler, CUDA library, and ...
Senior Software Engineer, CUTLASS Kernels
$143K - $189K/yr
Optimize kernels for peak throughput on both silicon and software performance simulators. * Collaborate with teams across NVIDIA including the GPU architecture, NVVM/PTX compiler, CUDA library, and ...
Senior Software Engineer, CUTLASS Kernels
$143K - $189K/yr
Optimize kernels for peak throughput on both silicon and software performance simulators. * Collaborate with teams across NVIDIA including the GPU architecture, NVVM/PTX compiler, CUDA library, and ...
Senior GenAI Research Engineer - Optimization and Kernels
San Francisco, CA · On-site
$123K - $169K/yr
As a Senior GenAI Research Engineer, you will drive performance improvements and design high-performance GPU kernels for large language model training while collaborating with a diverse team of ...
Senior GenAI Research Engineer - Optimization and Kernels
San Francisco, CA · On-site
$123K - $169K/yr
As a Senior GenAI Research Engineer, you will drive performance improvements and design high-performance GPU kernels for large language model training while collaborating with a diverse team of ...
Software Engineer - Kernels
Mountain View, CA · On-site +1
$175K - $400K/yr
Design and optimize kernels that interface directly with our hardware * Work in partnership with our ML Research and Hardware Engineering teams * Provide expertise and guidance on hardware ...
Software Engineer - Kernels
Mountain View, CA · On-site +1
$175K - $400K/yr
Design and optimize kernels that interface directly with our hardware * Work in partnership with our ML Research and Hardware Engineering teams * Provide expertise and guidance on hardware ...
Software Engineer - Kernels
Mountain View, CA · On-site
$175K - $400K/yr
Design and optimize kernels that interface directly with our hardware * Work in partnership with our ML Research and Hardware Engineering teams * Provide expertise and guidance on hardware ...
Software Engineer - Kernels
Mountain View, CA · On-site
$175K - $400K/yr
Design and optimize kernels that interface directly with our hardware * Work in partnership with our ML Research and Hardware Engineering teams * Provide expertise and guidance on hardware ...
Senior GenAI Research Engineer - Optimization and Kernels
Mountain View, CA · On-site
$166K - $225K/yr
Design, implement, and optimize high-performance GPU kernels for training workloads (e.g., attention mechanisms, custom layers, gradient computation, activation functions) targeting NVIDIA ...
Senior GenAI Research Engineer - Optimization and Kernels
Mountain View, CA · On-site
$166K - $225K/yr
Design, implement, and optimize high-performance GPU kernels for training workloads (e.g., attention mechanisms, custom layers, gradient computation, activation functions) targeting NVIDIA ...
Experience writing kernels to accelerate Neural Network execution on custom hardware accelerators (not on CPU's) * Design, prototype, and execute low-level, adaptable C++ programs (kernels) for ...
Quick apply
Experience writing kernels to accelerate Neural Network execution on custom hardware accelerators (not on CPU's) * Design, prototype, and execute low-level, adaptable C++ programs (kernels) for ...
Senior ML Accelerator Engineer - GPU
Sunnyvale, CA · On-site
$128K - $261K/yr
For the AI Kernels & Compilers team, that mission shows up in the details: turning cutting-edge perception, prediction, and planning research into production-grade software that can run efficiently ...
Senior ML Accelerator Engineer - GPU
Sunnyvale, CA · On-site
$128K - $261K/yr
For the AI Kernels & Compilers team, that mission shows up in the details: turning cutting-edge perception, prediction, and planning research into production-grade software that can run efficiently ...
Staff Software Engineer - GenAI Performance and Kernel
San Francisco, CA · On-site
$164K/yr
In this role, you will own the design and optimization of high-performance GPU kernels for GenAI inference, leading development and mentoring others in performance engineering. Responsibilities : • ...
Staff Software Engineer - GenAI Performance and Kernel
San Francisco, CA · On-site
$164K/yr
In this role, you will own the design and optimization of high-performance GPU kernels for GenAI inference, leading development and mentoring others in performance engineering. Responsibilities : • ...
Senior Neural Network Kernel Software Development Engineer
San Francisco, CA · On-site
$110K - $140K/yr
Experience writing kernels to accelerate Neural Network execution on custom hardware accelerators (not on CPU's) * Design, prototype, and execute low-level, adaptable C++ programs (kernels) for ...
Quick apply
Senior Neural Network Kernel Software Development Engineer
San Francisco, CA · On-site
$110K - $140K/yr
Experience writing kernels to accelerate Neural Network execution on custom hardware accelerators (not on CPU's) * Design, prototype, and execute low-level, adaptable C++ programs (kernels) for ...
Experience writing kernels to accelerate Neural Network execution on custom hardware accelerators (not on CPU's) * Design, prototype, and execute low-level, adaptable C++ programs (kernels) for ...
Quick apply
Experience writing kernels to accelerate Neural Network execution on custom hardware accelerators (not on CPU's) * Design, prototype, and execute low-level, adaptable C++ programs (kernels) for ...
Kernels information
See salary details
$132K - $135.8K
0% of jobs
$135.8K - $139.5K
0% of jobs
$139.5K - $143.3K
0% of jobs
$143.3K - $147.1K
0% of jobs
$147.1K - $150.9K
0% of jobs
$150.9K - $154.6K
0% of jobs
$154.6K - $158.4K
0% of jobs
$158.4K - $162.2K
0% of jobs
$162.2K - $166K
0% of jobs
$166K - $169.7K
0% of jobs
$170.7K is the 25th percentile. Wages below this are outliers.
$169.7K - $173.5K
100% of jobs
$132K
$173.5K
How much do kernels jobs pay per year?
What are some common challenges faced when working as a Kernel Developer, and how can new team members best overcome them?
What are the key skills and qualifications needed to thrive as a Kernel Engineer, and why are they important?
What is the difference between Kernels vs Network Administrators?
| Aspect | Kernels | Network Administrators |
|---|---|---|
| Required Credentials | Knowledge of operating systems, programming, Linux/Unix | Networking certifications (e.g., CCNA), IT experience |
| Work Environment | System-level development, OS configuration | Network setup, maintenance, troubleshooting |
| Industry Usage | Software development, OS design | IT services, corporate networks |
While Kernels focus on developing and maintaining core operating system components, Network Administrators manage and troubleshoot network infrastructure. Both roles require technical expertise but differ in scope and daily tasks, with Kernels working at the system level and Network Administrators handling network connectivity and security.
What are kernels in computing?

Job description
Magic is on a mission to build safe AGI to accelerate humanity's progress on critical problems. They are seeking a Kernel Engineer to design, implement, and maintain high-performance kernels aimed at optimizing throughput and latency during training and inference.
Responsibilities:
• Design and implement kernels that support high-performance long-context behavior
• Ownership of kernel design, implementation, deployment, and production reliability
• Focus on robustness, extensive testing, and functional correctness, while pushing on performance
• Evaluate porting Magic’s compute kernels to alternative hardware options
• Co-design kernels with understanding and interaction with training, inference, and RL teams
Qualifications:
Required:
• Low-level programming experience targeting AI accelerators such as NVIDIA Blackwell or Google TPUs
• Develop and optimize GPU kernels in frameworks such as NCCL, MSCCLPP, CUTLASS, CuTeDSL, Triton, Quack, Flash-Attention, and similar frameworks
• Experience in other kernel authoring frameworks such as Pallas/Mosaic (GPU or TPU), or Mojo also maps well to the work on Magic's kernel team
• Strong depth over shallow breadth: for kernel engineering, we prefer candidates with deep expertise in computer architecture, low-level machine optimizations, and code generation, with breadth across ML
• Agility, ownership mindset, and grit
Company:
Magic is an AI coding startup that enables developers to work with AI to find code for building apps. Founded in 2022, the company is headquartered in San Francisco, USA, with a team of 51-200 employees. The company is currently Growth Stage.