We are searching for a CUDA Kernel Engineer who has hands-on experience developing and optimizing NVIDIA CUDA kernels from scratch . You will work on the GPU performance layer powering large-scale ...
Quick apply
We are searching for a CUDA Kernel Engineer who has hands-on experience developing and optimizing NVIDIA CUDA kernels from scratch . You will work on the GPU performance layer powering large-scale ...
Quick apply
We are searching for a CUDA Kernel Engineer who has hands-on experience developing and optimizing NVIDIA CUDA kernels from scratch . You will work on the GPU performance layer powering large-scale ...
San Francisco, CA · On-site +1
We are searching for a CUDA Kernel Engineer who has hands-on experience developing and optimizing NVIDIA CUDA kernels from scratch . You will work on the GPU performance layer powering large-scale ...
San Francisco, CA · On-site +1
We are searching for a CUDA Kernel Engineer who has hands-on experience developing and optimizing NVIDIA CUDA kernels from scratch . You will work on the GPU performance layer powering large-scale ...
San Francisco, CA · Remote
$80 - $120/hr
CUDA Engineering Expert Type: Contract Compensation: $80-$120/hour Location: Remote Role ... Review GPU kernel implementations to identify bottlenecks without needing extensive algorithmic ...
Quick apply
San Francisco, CA · Remote
$80 - $120/hr
CUDA Engineering Expert Type: Contract Compensation: $80-$120/hour Location: Remote Role ... Review GPU kernel implementations to identify bottlenecks without needing extensive algorithmic ...
San Francisco, CA · On-site
$190K - $250K/yr
About the role We are seeking a highly skilled GPU Kernel Engineer who is passionate about pushing ... Design, implement, and optimize custom GPU kernels using C++, PTX, CUDA, ROCm, Triton, and/or JAX ...
San Francisco, CA · On-site
$190K - $250K/yr
About the role We are seeking a highly skilled GPU Kernel Engineer who is passionate about pushing ... Design, implement, and optimize custom GPU kernels using C++, PTX, CUDA, ROCm, Triton, and/or JAX ...
Burlingame, CA · On-site
$110K - $270K/yr
Role The AI Kernel Engineer in Quadric plays the key role to enable a large number of AI kernels ... CUDA, DSP, NEON, Triton-lang * Proficiency in C/C++ and Python, experience with assembly language a ...
Burlingame, CA · On-site
$110K - $270K/yr
Role The AI Kernel Engineer in Quadric plays the key role to enable a large number of AI kernels ... CUDA, DSP, NEON, Triton-lang * Proficiency in C/C++ and Python, experience with assembly language a ...
About the Role You will develop, integrate, and optimize state-of-the-art CUDA kernels to power AI ... Working closely with researchers and engineers, you'll help make Voltai the world's leading AI ...
Quick apply
About the Role You will develop, integrate, and optimize state-of-the-art CUDA kernels to power AI ... Working closely with researchers and engineers, you'll help make Voltai the world's leading AI ...
Burlingame, CA · On-site
$110K - $270K/yr
Role The AI Kernel Engineer in Quadric plays the key role to enable a large number of AI kernels ... CUDA, DSP, NEON, Triton-lang * Proficiency in C/C++ and Python, experience with assembly language a ...
Quick apply
Burlingame, CA · On-site
$110K - $270K/yr
Role The AI Kernel Engineer in Quadric plays the key role to enable a large number of AI kernels ... CUDA, DSP, NEON, Triton-lang * Proficiency in C/C++ and Python, experience with assembly language a ...
About the Role You will develop, integrate, and optimize state-of-the-art CUDA kernels to power AI ... Working closely with researchers and engineers, you'll help make Voltai the world's leading AI ...
About the Role You will develop, integrate, and optimize state-of-the-art CUDA kernels to power AI ... Working closely with researchers and engineers, you'll help make Voltai the world's leading AI ...
$110K - $270K/yr
Role The AI Kernel Engineer in Quadric plays the key role to enable a large number of AI kernels ... CUDA, DSP, NEON, Triton-lang * Proficiency in C/C++ and Python, experience with assembly language a ...
$110K - $270K/yr
Role The AI Kernel Engineer in Quadric plays the key role to enable a large number of AI kernels ... CUDA, DSP, NEON, Triton-lang * Proficiency in C/C++ and Python, experience with assembly language a ...
Drive kernel DSL design decisions - thread spawn mechanisms, register passing conventions, and ... CUDA or equivalent accelerator programming - deep experience writing GPU kernels, understanding ...
Drive kernel DSL design decisions - thread spawn mechanisms, register passing conventions, and ... CUDA or equivalent accelerator programming - deep experience writing GPU kernels, understanding ...
Mountain View, CA · On-site
$260K - $320K/yr
Drive kernel DSL design decisions -- thread spawn mechanisms, register passing conventions, and ... CUDA or equivalent accelerator programming -- deep experience writing GPU kernels, understanding ...
Quick apply
Mountain View, CA · On-site
$260K - $320K/yr
Drive kernel DSL design decisions -- thread spawn mechanisms, register passing conventions, and ... CUDA or equivalent accelerator programming -- deep experience writing GPU kernels, understanding ...
Milpitas, CA · On-site
$121K - $167K/yr
In this role, you will drive the full stack of model optimization - from CUDA kernel engineering to quantization and compression - to deploy high-performance AI models on edge compute platforms ...
Milpitas, CA · On-site
$121K - $167K/yr
In this role, you will drive the full stack of model optimization - from CUDA kernel engineering to quantization and compression - to deploy high-performance AI models on edge compute platforms ...
Santa Clara, CA · On-site
$143K - $189K/yr
... CUDA, kernel libraries, compilers, and robotics to deliver high-performance, production-ready solutions. • Contribute to CUDA kernel and operator development for critical transformer components ...
Santa Clara, CA · On-site
$143K - $189K/yr
... CUDA, kernel libraries, compilers, and robotics to deliver high-performance, production-ready solutions. • Contribute to CUDA kernel and operator development for critical transformer components ...
San Francisco, CA · On-site
$150K - $250K/yr
You'll work at the intersection of kernel engineering and applied AI to scale up AI agents that ... Strong CUDA C experience, with hands-on work implementing or optimizing kernels for ML or other GPU ...
San Francisco, CA · On-site
$150K - $250K/yr
You'll work at the intersection of kernel engineering and applied AI to scale up AI agents that ... Strong CUDA C experience, with hands-on work implementing or optimizing kernels for ML or other GPU ...
San Francisco, CA · On-site
$150K - $250K/yr
You'll work at the intersection of kernel engineering and applied AI to scale up AI agents that ... Strong CUDA C experience, with hands-on work implementing or optimizing kernels for ML or other GPU ...
Quick apply
San Francisco, CA · On-site
$150K - $250K/yr
You'll work at the intersection of kernel engineering and applied AI to scale up AI agents that ... Strong CUDA C experience, with hands-on work implementing or optimizing kernels for ML or other GPU ...
About the job FriendliAI is looking for a GPU Kernel Engineer to design, build, and optimize the ... Develop and maintain GPU code in CUDA and C++, including low-level assembly when needed * Implement ...
About the job FriendliAI is looking for a GPU Kernel Engineer to design, build, and optimize the ... Develop and maintain GPU code in CUDA and C++, including low-level assembly when needed * Implement ...
Burlingame, CA · On-site
$120K - $160K/yr
Role The AI Kernel Engineer (New Grad) at Quadric plays a key role in enabling a large number of AI ... CUDA, DSP, NEON, or Triton-lang. * Familiarity with assembly language or compiler internals is a ...
Burlingame, CA · On-site
$120K - $160K/yr
Role The AI Kernel Engineer (New Grad) at Quadric plays a key role in enabling a large number of AI ... CUDA, DSP, NEON, or Triton-lang. * Familiarity with assembly language or compiler internals is a ...
$143K - $189K/yr
We are hiring senior engineers to work on the CUDA driver, a core component of our platform for ... Background with kernel mode development * Experience with Windows, Linux, or macOS driver ...
$143K - $189K/yr
We are hiring senior engineers to work on the CUDA driver, a core component of our platform for ... Background with kernel mode development * Experience with Windows, Linux, or macOS driver ...
$120K - $160K/yr
Role The AI Kernel Engineer (New Grad) at Quadric plays a key role in enabling a large number of AI ... CUDA, DSP, NEON, or Triton-lang. * Familiarity with assembly language or compiler internals is a ...
$120K - $160K/yr
Role The AI Kernel Engineer (New Grad) at Quadric plays a key role in enabling a large number of AI ... CUDA, DSP, NEON, or Triton-lang. * Familiarity with assembly language or compiler internals is a ...
... with CUDA, kernel, and compiler engineering teams to integrate agents with compilers, profilers, execution sandboxes, and runtimes in a safe, observable way. • We collaborate with internal and ...
... with CUDA, kernel, and compiler engineering teams to integrate agents with compilers, profilers, execution sandboxes, and runtimes in a safe, observable way. • We collaborate with internal and ...

Full-time
Medical, Dental, Vision, Retirement
Posted 13 days ago
Be an early applicant
Location: Remote US
Start date: ASAP
Languages: English (required)
About the Role
Pragmatike is hiring on behalf of a fast-growing AI startup recognized as a Top 10 GenAI company by GTM Capital, founded by MIT CSAIL researchers.
We are searching for a CUDA Kernel Engineer who has hands-on experience developing and optimizing NVIDIA CUDA kernels from scratch. You will work on the GPU performance layer powering large-scale, high-throughput AI systems used by Fortune 500 customers.
This role is ideal for someone who deeply understands NVIDIA GPU architecture, memory hierarchy, warp-level execution, and profiling workflowsnot someone coming from generic hardware, FPGA, or non-NVIDIA compute backgrounds. You will directly influence the GPU efficiency, throughput, and scalability of mission-critical AI systems.
What Youll Do
What Were Looking For
Bonus Points
Why This Role Will Pivot Your Career
Benefits
Pragmatike is an Equal Opportunity Employer and is committed to providing equal employment opportunities to all applicants without discrimination. We recruit on behalf of our clients and prohibit discrimination and harassment based on race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local laws. This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, and training.We are committed to a fair and inclusive hiring process. We process your personal data solely for recruitment purposes, in accordance with applicable privacy laws, and maintain reasonable safeguards to protect your information. Your data may be shared with our client(s) for hiring consideration, but will not be disclosed to third parties outside of the recruitment process.