1

Day Cuda Programmer Jobs (NOW HIRING)

Software Engineer

Cardiff By The Sea, CA · On-site +1

$87K - $157K/yr

Translate and enhance existing code for GPU/CUDA acceleration and parallel/distributed execution ... days with an anticipated close date of no earlier than 3 days after the original posting date as ...

Software Engineer

Rancho Santa Fe, CA · On-site +1

$87K - $157K/yr

Translate and enhance existing code for GPU/CUDA acceleration and parallel/distributed execution ... days with an anticipated close date of no earlier than 3 days after the original posting date as ...

Software Engineer

Chula Vista, CA · On-site +1

$87K - $157K/yr

Translate and enhance existing code for GPU/CUDA acceleration and parallel/distributed execution ... days with an anticipated close date of no earlier than 3 days after the original posting date as ...

Software Engineer

La Jolla, CA · On-site +1

$87K - $157K/yr

Translate and enhance existing code for GPU/CUDA acceleration and parallel/distributed execution ... days with an anticipated close date of no earlier than 3 days after the original posting date as ...

Software Engineer

El Cajon, CA · On-site +1

$87K - $157K/yr

Translate and enhance existing code for GPU/CUDA acceleration and parallel/distributed execution ... days with an anticipated close date of no earlier than 3 days after the original posting date as ...

Software Engineer

San Diego, CA · On-site +1

$87K - $157K/yr

Translate and enhance existing code for GPU/CUDA acceleration and parallel/distributed execution ... days with an anticipated close date of no earlier than 3 days after the original posting date as ...

Software Engineer

National City, CA · On-site +1

$87K - $157K/yr

Translate and enhance existing code for GPU/CUDA acceleration and parallel/distributed execution ... days with an anticipated close date of no earlier than 3 days after the original posting date as ...

Senior HPC Cluster Engineer

Austin, TX · On-site

$103K - $142K/yr

... and day to day operation through automation. • Provide technical leadership and strategic ... Preferred : • Background with NVIDIA GPUs, CUDA Programming, NCCL and MLPerf benchmarking. • ...

Senior HPC Cluster Engineer

Redmond, WA · On-site

$117K - $160K/yr

... and day to day operation through automation. • Provide technical leadership and strategic ... Preferred : • Background with NVIDIA GPUs, CUDA Programming, NCCL and MLPerf benchmarking. • ...

Senior HPC Cluster Engineer

Redmond, WA · On-site

$117K - $160K/yr

... and day to day operation through automation. • Provide technical leadership and strategic ... Preferred : • Background with NVIDIA GPUs, CUDA Programming, NCCL and MLPerf benchmarking. • ...

Senior HPC Cluster Engineer

Santa Clara, CA · On-site

$122K - $168K/yr

... and day to day operation through automation. • Provide technical leadership and strategic ... Preferred : • Background with NVIDIA GPUs, CUDA Programming, NCCL and MLPerf benchmarking. • ...

next page

Showing results 1-20

Day Cuda Programmer information

See salary details

$12

$39

$68

How much do day cuda programmer jobs pay per hour?

As of Jun 20, 2026, the average hourly pay for day cuda programmer in the United States is $39.54, according to ZipRecruiter salary data. Most workers in this role earn between $25.72 and $51.44 per hour, depending on experience, location, and employer.
More about Day Cuda Programmer jobs
What cities are hiring for Day Cuda Programmer jobs? Cities with the most Day Cuda Programmer job openings:
What are the most commonly searched types of Cuda Programmer jobs? The most popular types of Cuda Programmer jobs are:
What states have the most Day Cuda Programmer jobs? States with the most job openings for Day Cuda Programmer jobs include:
Infographic showing various Day Cuda Programmer job openings in the United States as of June 2026, with employment types broken down into 8% Internship, 84% Full Time, 4% Part Time, and 4% Temporary. Highlights an 84% In-person, 8% Hybrid, and 8% Remote job distribution, with an average salary of $82,234 per year, or $39.5 per hour.

AI Infrastructure & Experience Engineer

DGN Technologies

Mountain View, CA • On-site

$202K - $240K/yr

Other

This job post has expired 1 day ago. Applications are no longer accepted.


Job description

Job Category: Technical

Job Title: AI Infrastructure & Experience Engineer

Duties: Key Responsibilities

Inference Optimization: Deploy and tune multiple LLMs and generative multimodal models on local inference hardware. Optimize performance metrics (TTFT, tokens/sec) via model quantization, caching strategies, and architecture-specific adjustments.

Systems Engineering & CUDA: Leverage deep knowledge of the CUDA environment to build custom kernels, ensuring maximum utilization of the low cost GPU compute.

Orchestration & Integration: Seamlessly bridge inference backends with orchestration layers (LiteLLM, Ollama, etc.) and frontends like OpenWebUI.

Rapid Prototyping: Build functional, high-fidelity demos showcasing model memory capabilities, agentic workflows, and context-aware web search.

Peripheral Connectivity: Implement communication protocols to bridge local AI compute with peripheral devices, including smart TVs, household appliances, and XR hardware.

Skills: Technical Qualifications

Recent experience in model optimization required Hardware & Compute: Proven experience with NVIDIA eco-systems and ARM64 architecture.

Systems Programming: Advanced proficiency in C++, Python, and Rust. Deep familiarity with CUDA and the ability to author/debug custom CUDA kernels for compute-intensive tasks.

AI/ML Frameworks: Extensive experience with modern inference engines (llama.cpp, TensorRT-LLM, Ollama) and orchestration frameworks (LiteLLM).

Software Engineering: Robust understanding of asynchronous programming (FastAPI), containerization (Docker/Kubernetes), sandbox environments, and API design for low-latency communication.

Full-Stack Prototyping: Ability to quickly spin up modern frontend UIs (React, Next.js, or similar) to present AI-driven intelligence to end users.

Communication Protocols: Familiarity with WebSockets, gRPC, and REST for device-to-device communication in a local network environment.

Keywords:

Education: Ideal Candidate Profile

The "Builder" Mindset: You are energized by the prospect of building proofs-of-concept in days rather than months. You thrive in environments where speed and creativity are paramount.

Problem Solver: You approach unsolved, messy engineering challenges with enthusiasm rather than trepidation.

Architectural Vision: You see the "big picture" of how AI becomes part of the consumer's daily life, not just how the model generates text.

Agile & Adaptable: You are comfortable working in a fast-paced environment where priorities shift based on the results of rapid experimentation.

Degree in Computer Science, Machine Learning or Artificial Intelligence Specialization preferred, but not required

3 years of relevant industry experience required

Skills and Experience:

Required Skills: 

INFERENCE OPTIMIZATION

NVIDIA ECOSYSTEMS

CUSTOM CUDA KERNEL DEVELOPMENT

ARM64 ARCHITECTURE

PYTHON

Additional Skills:

RUST

CUDA

MODERN INFERENCE ENGINES

LLAMA.CPP

TENSORRT-LLM

OLLAMA

ORCHESTRATION FRAMEWORKS

LITELLM

ASYNCHRONOUS PROGRAMMING

FASTAPI

CONTAINERIZATION

DOCKER

KUBERNETES

SANDBOX ENVIRONMENTS

API DESIGN

LOW-LATENCY COMMUNICATION

FRONTEND UI DEVELOPMENT

REACT

NEXT.JS

WEBSOCKETS

GRPC

REST

DEVICE-TO-DEVICE COMMUNICATION

PROBLEM SOLVING

ARCHITECTURAL VISION

AGILITY

ADAPTABILITY

Languages:

English

                Read

                Write

                Speak

Minimum Degree Required: Bachelor's Degree

Patents: No

Publications: No

Veteran Status: No

# of Positions: 1