2

Remote Cuda Developer Jobs in Austin, TX (NOW HIRING)

Remote US Start date: ASAP Languages: English (required) About the Role Pragmatike is hiring on ... We are searching for a CUDA Kernel Engineer who has hands-on experience developing and optimizing ...

Collaborate with DevOps to ensure reproducible, portable training environments. * Write tests to ... Comfortable working in Linux GPU environments (CUDA, ROCm). * Ability to collaborate with backend ...

Collaborate with DevOps to ensure reproducible, portable training environments. Write tests to ... Comfortable working in Linux GPU environments (CUDA, ROCm). Ability to collaborate with backend ...

Collaborate with DevOps to ensure reproducible, portable training environments. * Write tests to ... Comfortable working in Linux GPU environments (CUDA, ROCm). * Ability to collaborate with backend ...

This is for a proposal and will be remote. The High Performance Computing (HPC) Engineer supports ... MPI, OpenMP, CUDA, and container-based environments. * Monitor HPC system utilization and ...

New

Remote Cuda Developer information

See Austin, TX salary details

$82.8K

$101.6K

$134.3K

How much do remote cuda developer jobs pay per year?

As of May 28, 2026, the average yearly pay for remote cuda developer in Austin, TX is $101,598.00, according to ZipRecruiter salary data. Most workers in this role earn between $89,200.00 and $114,000.00 per year, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive as a Remote CUDA Developer, and why are they important?

To thrive as a Remote CUDA Developer, you need strong proficiency in C/C++ programming, parallel computing concepts, and a solid understanding of GPU architecture, typically backed by a degree in computer science or a related field. Experience with NVIDIA CUDA toolkit, GPU debugging tools, and version control systems like Git is commonly required. Excellent problem-solving skills, self-motivation, and effective remote communication abilities help distinguish high performers in this role. These skills are vital for efficiently delivering high-performance computing solutions and collaborating seamlessly with distributed teams.

How does a Remote CUDA Developer typically collaborate with team members across different locations?

As a Remote CUDA Developer, you will frequently collaborate with cross-functional teams such as data scientists, software engineers, and product managers through virtual meetings, code reviews, and collaborative platforms like GitHub or GitLab. Clear communication and thorough documentation are essential since team members may be in different time zones. You can expect to participate in regular stand-ups, sprint planning, and peer programming sessions, ensuring alignment and smooth integration of your GPU-accelerated code into larger projects. Tools like Slack, Zoom, and project management platforms help maintain connectivity and workflow efficiency.

What is a Remote CUDA Developer?

A Remote CUDA Developer is a software engineer who specializes in using NVIDIA's CUDA (Compute Unified Device Architecture) platform to develop parallel computing applications, often for high-performance tasks like machine learning, scientific computing, or data analysis. They work remotely, collaborating with teams online rather than being physically present in an office. These developers write and optimize code to run efficiently on NVIDIA GPUs, enabling applications to process large amounts of data much faster than traditional CPU-only solutions.

What is the difference between Remote Cuda Developer vs Remote Machine Learning Engineer?

AspectRemote Cuda DeveloperRemote Machine Learning Engineer
Required CredentialsCUDA programming certifications, computer science degreeMachine learning certifications, data science background
Work EnvironmentSoftware development, GPU optimizationModel development, data analysis
Industry UsageHigh-performance computing, gaming, AIAI, data science, predictive modeling

Remote Cuda Developers focus on GPU programming and optimization using CUDA, primarily in high-performance computing and AI applications. Remote Machine Learning Engineers develop and deploy machine learning models, often utilizing GPU resources but with a broader focus on data and algorithms. While both roles may involve GPU expertise, Cuda Developers specialize in low-level programming, whereas Machine Learning Engineers work on model development and deployment.

What job categories do people searching Remote Cuda Developer jobs in Austin, TX look for? The top searched job categories for Remote Cuda Developer jobs in Austin, TX are:
What cities near Austin, TX are hiring for Remote Cuda Developer jobs? Cities near Austin, TX with the most Remote Cuda Developer job openings:

CUDA Kernel Engineer

PRAGMATIKE

Austin, TX โ€ข Remote

Full-time

Medical, Dental, Vision, Retirement

Posted 24 days ago


Job description

Location: Remote US
Start date: ASAP
Languages: English (required)

About the Role

Pragmatike is hiring on behalf of a fast-growing AI startup recognized as a Top 10 GenAI company by GTM Capital, founded by MIT CSAIL researchers.

We are searching for a CUDA Kernel Engineer who has hands-on experience developing and optimizing NVIDIA CUDA kernels from scratch. You will work on the GPU performance layer powering large-scale, high-throughput AI systems used by Fortune 500 customers.

This role is ideal for someone who deeply understands NVIDIA GPU architecture, memory hierarchy, warp-level execution, and profiling workflowsnot someone coming from generic hardware, FPGA, or non-NVIDIA compute backgrounds. You will directly influence the GPU efficiency, throughput, and scalability of mission-critical AI systems.

What Youll Do

  • Design, implement, and optimize custom CUDA kernels for NVIDIA GPUs, with a focus on maximizing occupancy, memory throughput, and warp efficiency.
  • Profile GPU workloads using tools such as Nsight Compute, Nsight Systems, nvprof, and CUDAโ€MEMCHECK.
  • Analyze and eliminate performance bottlenecks including warp divergence, uncoalesced memory access, register pressure, and PCIe transfer overhead.
  • Improve GPU memory pipelines (global, shared, L2, texture memory) and ensure proper memory coalescing.
  • Collaborate closely with AI systems, model acceleration, and backend distributed systems teams.
  • Contribute to GPU architecture decisions, kernel libraries, and internal performance-engineering best practices.

What Were Looking For

  • Proven track record building NVIDIA CUDA kernels from scratchnot just calling existing libraries.
  • Strong ability to optimize kernels (tiling strategies, occupancy tuning, shared memory design, warp scheduling).
  • Deep understanding of CUDA threads, warps, blocks, and grids, GPU memory hierarchy and memory coalescing, as well as warp divergence (how to detect, analyze, and mitigate it)
  • Experience diagnosing PCIe bottlenecks and optimizing host-device transfers (pinned memory, streams, batching, overlap).
  • Familiarity with C++, CUDA runtime APIs, and GPU debugging/profiling tooling.

Bonus Points

  • Experience with multi-GPU or distributed GPU systems (NCCL, NVLink, MIG).
  • Background in GPU acceleration for ML frameworks or HPC workloads.
  • Knowledge of model inference optimization (TensorRT, CUDA Graphs, CUTLASS).
  • Exposure to compiler-level optimization or PTX/SASS analysis.
  • Startup experience or comfort working in fast-moving, ambiguous environments.

Why This Role Will Pivot Your Career

  • Research pedigree: MIT CSAIL founders recognized for breakthrough AI and systems contributions.
  • Customer impact: Deploy AI solutions powering Fortune 500 clients.
  • Industry momentum: Lab alumni have led high-value acquisitions (MosaicML Databricks, Run:AI Nvidia, W&B CoreWeave).
  • Funding & growth: Oversubscribed seed round, next funding in 2026.
  • Career growth & influence: Lead AI initiatives, optimize pipelines, and directly impact production AI systems at scale.
  • Culture & autonomy: Own critical systems while collaborating with world-class engineers.
  • Aspirational impact: Solve GPU/AI performance challenges few engineers ever face.

Benefits

  • Competitive salary & equity options
  • Sign-on bonus
  • Health, Dental, and Vision
  • 401k

Pragmatike is an Equal Opportunity Employer and is committed to providing equal employment opportunities to all applicants without discrimination. We recruit on behalf of our clients and prohibit discrimination and harassment based on race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local laws. This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, and training.We are committed to a fair and inclusive hiring process. We process your personal data solely for recruitment purposes, in accordance with applicable privacy laws, and maintain reasonable safeguards to protect your information. Your data may be shared with our client(s) for hiring consideration, but will not be disclosed to third parties outside of the recruitment process.