2

Remote Cuda Developer Jobs (NOW HIRING)

Senior Software Engineer

Arlington, VA ยท On-site +1

$141K - $185K/yr

Description Type: Full-Time(W2) On-site/Hybrid, Arlington, VA (Remote option available for the ... CUDA) * Software-defined radio (SDR) experience * Embedded system experience * This opportunity is ...

Remote Rate: $130K per annum Job Responsibilities Develop and optimize features in a modern, low ... CUDA, and TensorRT, or with Python interop such as pybind11 Experience submitting code alongside ...

Build and optimize CUDA kernels, JAX/PyTorch ops, and distributed training loops for physics-aware ... Strong grasp of modern DevOps: code reviews, CI/CD pipelines, observability, and containerization ...

Senior Software Engineer, AI Inference

Boston, MA ยท On-site +1

$133K - $220K/yr

... CUDA kernel compilation to Kubernetes-orchestrated model serving on OpenShift. If you want to work ... For positions with Remote-US locations, the actual salary range for the position may differ based ...

Senior Software Engineer, AI Inference

Raleigh, NC ยท On-site +1

$133K - $220K/yr

... CUDA kernel compilation to Kubernetes-orchestrated model serving on OpenShift. If you want to work ... For positions with Remote-US locations, the actual salary range for the position may differ based ...

Senior Software Engineer, AI Inference

Boston, MA ยท On-site +1

$133K - $220K/yr

... CUDA kernel compilation to Kubernetes-orchestrated model serving on OpenShift. If you want to work ... For positions with Remote-US locations, the actual salary range for the position may differ based ...

next page

Showing results 1-20

Remote Cuda Developer information

See salary details

$83.5K

$102.5K

$135.5K

How much do remote cuda developer jobs pay per year?

As of Jun 14, 2026, the average yearly pay for remote cuda developer in the United States is $102,500.00, according to ZipRecruiter salary data. Most workers in this role earn between $90,000.00 and $115,000.00 per year, depending on experience, location, and employer.

How does a Remote CUDA Developer typically collaborate with team members across different locations?

As a Remote CUDA Developer, you will frequently collaborate with cross-functional teams such as data scientists, software engineers, and product managers through virtual meetings, code reviews, and collaborative platforms like GitHub or GitLab. Clear communication and thorough documentation are essential since team members may be in different time zones. You can expect to participate in regular stand-ups, sprint planning, and peer programming sessions, ensuring alignment and smooth integration of your GPU-accelerated code into larger projects. Tools like Slack, Zoom, and project management platforms help maintain connectivity and workflow efficiency.

What is a Remote CUDA Developer?

A Remote CUDA Developer is a software engineer who specializes in using NVIDIA's CUDA (Compute Unified Device Architecture) platform to develop parallel computing applications, often for high-performance tasks like machine learning, scientific computing, or data analysis. They work remotely, collaborating with teams online rather than being physically present in an office. These developers write and optimize code to run efficiently on NVIDIA GPUs, enabling applications to process large amounts of data much faster than traditional CPU-only solutions.

What are the key skills and qualifications needed to thrive as a Remote CUDA Developer, and why are they important?

To thrive as a Remote CUDA Developer, you need strong proficiency in C/C++ programming, parallel computing concepts, and a solid understanding of GPU architecture, typically backed by a degree in computer science or a related field. Experience with NVIDIA CUDA toolkit, GPU debugging tools, and version control systems like Git is commonly required. Excellent problem-solving skills, self-motivation, and effective remote communication abilities help distinguish high performers in this role. These skills are vital for efficiently delivering high-performance computing solutions and collaborating seamlessly with distributed teams.

What is the difference between Remote Cuda Developer vs Remote Machine Learning Engineer?

AspectRemote Cuda DeveloperRemote Machine Learning Engineer
Required CredentialsCUDA programming certifications, computer science degreeMachine learning certifications, data science background
Work EnvironmentSoftware development, GPU optimizationModel development, data analysis
Industry UsageHigh-performance computing, gaming, AIAI, data science, predictive modeling

Remote Cuda Developers focus on GPU programming and optimization using CUDA, primarily in high-performance computing and AI applications. Remote Machine Learning Engineers develop and deploy machine learning models, often utilizing GPU resources but with a broader focus on data and algorithms. While both roles may involve GPU expertise, Cuda Developers specialize in low-level programming, whereas Machine Learning Engineers work on model development and deployment.

More about Remote Cuda Developer jobs
What cities are hiring for Remote Cuda Developer jobs? Cities with the most Remote Cuda Developer job openings:
What are the most commonly searched types of Cuda Developer jobs? The most popular types of Cuda Developer jobs are:
What states have the most Remote Cuda Developer jobs? States with the most job openings for Remote Cuda Developer jobs include:
Infographic showing various Remote Cuda Developer job openings in the United States as of June 2026, with employment types broken down into 87% Full Time, and 13% Contract. Highlights an 87% Physical, 6% Hybrid, and 7% Remote job distribution, with an average salary of $102,500 per year, or $49.3 per hour.
Machine Learning Systems Engineer

Machine Learning Systems Engineer

Motional

Pittsburgh, PA โ€ข On-site, Remote

Other

Posted 3 days ago


Job description

Mission Summary:

We are looking for a Machine Learning Systems Engineer to join our ML Acceleration team. In this role, you will be responsible for the core systems that enable our researchers to train frontier models at scale, focusing obsessively on speed, cost, reliability, and throughput. You will work at the intersection of machine learning research and high-performance systems engineering. Your work will directly impact our ability to scale large-scale distributed model training and reduce the time-to-convergence for our next generation of models.

What you'll be doing:

  • Performance Profiling & Optimization: Utilize profiling tools (e.g., Nsight, PyTorch Profiler) to identify bottlenecks in data loading, gradient computation, and communication. Implement optimizations like kernel fusion, sharding, and tiling to improve step time.
  • Distributed Training: Optimize distributed training pipelines using frameworks such as PyTorch Distributed.
  • Kernel Development: Design and maintain high-performance GPU kernels in Triton or CUDA for state-of-the-art ML workloads.
  • Data Pipeline Engineering: Optimize robust data loading pipelines that maximize training throughput.

What we're looking for:

  • Education: Bachelor's, Master's degree, or PhD in Computer Science, Computer Engineering, or a related technical discipline.
  • Software Engineering: Strong proficiency in Python.
  • ML Frameworks: Extensive hands-on experience with PyTorch.
  • ML Knowledge: Experience optimizing machine learning model execution during training and inference, alongside a strong understanding of fundamental machine learning concepts, architectures, and processes.
  • Problem Solving: Exceptional analytical and problem-solving skills, with a bias for action and a data-driven approach to technical challenges.

We encourage a hybrid schedule with in-office time at one of our locations in Boston, Pittsburgh, or Las Vegas to support collaboration, or this role can be fully remote.