2

Remote Kernel Developer Jobs (NOW HIRING)

GPU Software Engineer

$138K - $185K/yr

USA(Remote) Role Summary We are seeking expert-level GPU Software Engineers to support a high ... on kernel development) Key Differentiators (Critical Expectation) • This is NOT a DevOps / ...

Work on a small team of System Software and Linux Kernel Engineers to design, develop, and deploy ... remote, the specific salary range for your preferred location, during the hiring process. Waymo ...

SRE Engineer- Remote, Only W2

$58.25 - $77.50/hr

Role: SRE Engineer (OpenStack ) - Remote Experience: SRE / OpenStack Platform / Private Cloud ... Linux internals, kernel tuning (RHCE-adjacent), filesystems, partitions b) Storage: LVM, SCSI ...

Software Engineer, Telco

Raleigh, NC · On-site +1

$96K - $154K/yr

Good knowledge of Linux Kernel internals and architecture * Ability to work with Linux Kernel ... For positions with Remote-US locations, the actual salary range for the position may differ based ...

Remote Employment Type: Sixteen-month Contract Company: Alloy Digital We are seeking a talented and ... Develop and maintain software on Linux-based systems, including kernel modifications, device ...

next page

Showing results 1-20

Remote Kernel Developer information

See salary details

$43

$63

$94

How much do remote kernel developer jobs pay per hour?

As of Jun 7, 2026, the average hourly pay for remote kernel developer in the United States is $63.57, according to ZipRecruiter salary data. Most workers in this role earn between $52.88 and $66.83 per hour, depending on experience, location, and employer.
More about Remote Kernel Developer jobs
What cities are hiring for Remote Kernel Developer jobs? Cities with the most Remote Kernel Developer job openings:
What are the most commonly searched types of Kernel Developer jobs? The most popular types of Kernel Developer jobs are:
What states have the most Remote Kernel Developer jobs? States with the most job openings for Remote Kernel Developer jobs include:
Infographic showing various Remote Kernel Developer job openings in the United States as of May 2026, with employment types broken down into 10% As Needed, 50% Full Time, 30% Part Time, and 10% Nights. Highlights an 87% Physical, 6% Hybrid, and 7% Remote job distribution, with an average salary of $132,222 per year, or $63.6 per hour.

$138K - $185K/yr

Full-time

Posted 24 days ago


Job description

Job Title : GPU Software Engineer
Location: USA(Remote)
Role Summary
We are seeking expert-level GPU Software Engineers to support a high-visibility platform initiative within the Maya program, focused on building software tooling on top of a custom compiler and SDK.
The role involves developing, optimizing, and porting GPU kernels and AI workloads to a specialized hardware platform.
This is a critical and time-sensitive engagement with immediate onboarding expectations and long-term roadmap alignment (~18 months).
Key Responsibilities
• Develop GPU kernels for specialized hardware platforms using PyTorch/Triton frameworks
• Build software solutions leveraging custom compiler and SDK capabilities
• Design and implement kernel-level optimizations to control hardware execution behavior
• Port open-source AI/ML models to custom SDK environments
• Port and adapt high-performance computing benchmarks and stress workloads such as:
  • Linpack (High Performance Linpack)
  • BERT/benchmark-style workloads (referred as "Babu bench")
    • Develop stress testing and validation workloads aligned to hardware behaviour and platform validation
    • Support testing and stress testing of current and next-generation hardware platforms
    • Collaborate closely with platform architects and compiler teams to enhance system capabilities

Core Technical Skills (Must-Have)
Programming & Frameworks
• Python
• C/C++ (systems-level programming)
• PyTorch
• Triton (Triton language / kernel development)
GPU & Systems Expertise
• GPU kernel development (mandatory and critical)
• Strong understanding of GPU architecture and compute optimization
• Experience with compiler-based optimizations / runtime execution layers
• Experience with custom SDKs or hardware abstraction layers
Performance & Workloads
• Experience in:
  • GEMM kernel development (matrix multiplication kernels)
  • Porting ML models to new hardware platforms
  • Performance tuning and stress testing at system level

Nice-to-Have Skills
• Experience working with custom silicon / hardware platforms
• Exposure to high-performance computing (HPC) workloads
• Familiarity with:
  • Linpack benchmarks
  • AI workload benchmarking tools
    • Experience in compiler optimization ecosystems

Engagement Model & Structure
• Number of roles: 3 developers (initial hiring may start with 2)
• Location flexibility:
  • Onsite / Offshore / Hybrid mix allowed
    • Timeline:
  • Immediate start required
    • Duration:
  • ~18 months program duration with phased platform evolution

Interview Process
• Candidates will undergo direct technical evaluation by program lead
• Strong preference for candidates who can showcase real implementations / past work (hands-on kernel development)
Key Differentiators (Critical Expectation)
• This is NOT a DevOps / support / debugging role
• Requires deep hands-on engineering expertise in:
  • Kernel programming
  • GPU workloads
  • ML framework internals
    • Candidates must demonstrate build-level competence, not just theoretical knowledge

Success Criteria
• Ability to deliver:
  • High-performance kernels
  • Production-ready software for hardware platforms
    • Successful porting of models and workloads to custom environments
    • Contribution to next-generation platform readiness and validation

✅ Recommended Screening Criteria
To help you send the right candidates quickly, prioritize profiles with:
• Proven GPU kernel development experience (non-negotiable)
• Hands-on PyTorch + Triton kernel implementation
• Evidence of systems-level programming (C/C++)
• Contributions to AI infrastructure, HPC, or compiler-level work