OR · On-site
As a Principal Machine Learning Engineer, you will work at the intersection of applied ML and ... Practical experience optimizing ML workflows using CUDA/GPU acceleration. * Background in feature ...
OR · On-site
As a Principal Machine Learning Engineer, you will work at the intersection of applied ML and ... Practical experience optimizing ML workflows using CUDA/GPU acceleration. * Background in feature ...
OR · On-site
$139K/yr
Excellent hands-on C++ programming skills applied to industry standard C++ compilers and ... Developing CUDA, DirectX, OpenGL/Vulkan, OptiX applications. * You should have strong interpersonal ...
Background in parallel programming, e.g., CUDA, OpenMP, MPI, pthreads, etc. * Programming fluency in C/C++ with a deep understanding of algorithms and software development. * Knowledge of CPU and GPU ...
Background in parallel programming, e.g., CUDA, OpenMP, MPI, pthreads, etc. * Programming fluency in C/C++ with a deep understanding of algorithms and software development. * Knowledge of CPU and GPU ...
Experience with GPU / accelerator programming (Vulkan, CUDA, SYCL, Metal) or SIMD / CPU kernels * Familiarity with quantization formats and their quality trade-offs * Open-source contributions to ...
Experience with GPU / accelerator programming (Vulkan, CUDA, SYCL, Metal) or SIMD / CPU kernels * Familiarity with quantization formats and their quality trade-offs * Open-source contributions to ...
Hillsboro, OR · On-site
$122K - $232K/yr
Experience with scripting languages such as Python/Perl/PowerShell/shell and programming languages ... Experience with AI/ML frameworks and libraries (e.g., Pytorch, CUDA, vLLM, Triton, NCCL, oneCCL ...
Hillsboro, OR · On-site
$122K - $232K/yr
Experience with scripting languages such as Python/Perl/PowerShell/shell and programming languages ... Experience with AI/ML frameworks and libraries (e.g., Pytorch, CUDA, vLLM, Triton, NCCL, oneCCL ...
Hillsboro, OR · On-site
$188K - $223K/yr
Ability to quickly ramp up on parallel programming models like CUDA, to accelerate computational methods. * Ability to solve complex algorithmic challenges and technical questions with minimal ...
Hillsboro, OR · On-site
$188K - $223K/yr
Ability to quickly ramp up on parallel programming models like CUDA, to accelerate computational methods. * Ability to solve complex algorithmic challenges and technical questions with minimal ...
Hillsboro, OR · On-site
$128K - $181K/yr
... CUDA, or GPU programming. - Knowledge of performance analysis, optimization, and debugging techniques. Seize the opportunity to be part of Intel's mission to create world-changing technology that ...
Hillsboro, OR · On-site
$128K - $181K/yr
... CUDA, or GPU programming. - Knowledge of performance analysis, optimization, and debugging techniques. Seize the opportunity to be part of Intel's mission to create world-changing technology that ...
Hillsboro, OR · On-site
$188K - $223K/yr
Ability to quickly ramp up on parallel programming models like CUDA, to accelerate computational methods. * Ability to solve complex algorithmic challengesand technical questions with minimal ...
Hillsboro, OR · On-site
$188K - $223K/yr
Ability to quickly ramp up on parallel programming models like CUDA, to accelerate computational methods. * Ability to solve complex algorithmic challengesand technical questions with minimal ...
$388K - $619K/yr
We work to provide Netflix developers with the best support, solutions, and approaches to leverage ... CUDA-aware Python, TensorRT, torch.compile, ONNX) Expertise in performance optimization for low ...
Hillsboro, OR · On-site +1
$195K - $275K/yr
GPU optimizations (OpenCL, CUDA, SYCL/DPC++, C for Metal or similar) * Parallel programming (OpenMP, TBB, or MPI) Job Type:Experienced Hire Shift:Shift 1 (United States of America) Primary Location:
Hillsboro, OR · On-site +1
$195K - $275K/yr
GPU optimizations (OpenCL, CUDA, SYCL/DPC++, C for Metal or similar) * Parallel programming (OpenMP, TBB, or MPI) Job Type:Experienced Hire Shift:Shift 1 (United States of America) Primary Location:
Hillsboro, OR · On-site
$195K - $275K/yr
GPU optimizations (OpenCL, CUDA, SYCL/DPC++, C for Metal or similar) * Parallel programming (OpenMP, TBB, or MPI) Job Type: Experienced Hire Shift: Shift 1 (United States of America) Primary Location:
Hillsboro, OR · On-site
$195K - $275K/yr
GPU optimizations (OpenCL, CUDA, SYCL/DPC++, C for Metal or similar) * Parallel programming (OpenMP, TBB, or MPI) Job Type: Experienced Hire Shift: Shift 1 (United States of America) Primary Location:
OR · On-site +1
$466K - $750K/yr
Strong systems programming skills with the ability to work across multiple layers of the stack ... tuning (CUDA, NCCL, Nsight, PyTorch profiler) Experience with multimodal or diffusion model ...
OR · On-site +1
$466K - $750K/yr
Strong systems programming skills with the ability to work across multiple layers of the stack ... tuning (CUDA, NCCL, Nsight, PyTorch profiler) Experience with multimodal or diffusion model ...
OR · On-site
We're looking for a Principal Engineer to join our CSP Engagements team as the technical focal ... CUDA, NCCL, driver, and firmware teams * Ensure key open-source performance and stress tools (e.g ...
New
... GPU programming, and performance optimization. Contributes to the design, development, and ... Graphics experience (GPU / CUDA) Job Type:Student / Intern Shift:Shift 1 (United States of America ...
... GPU programming, and performance optimization. Contributes to the design, development, and ... Graphics experience (GPU / CUDA) Job Type:Student / Intern Shift:Shift 1 (United States of America ...
... GPU programming, and performance optimization. Contributes to the design, development, and ... Graphics experience (GPU / CUDA) Job Type: Student / Intern Shift: Shift 1 (United States of ...
... GPU programming, and performance optimization. Contributes to the design, development, and ... Graphics experience (GPU / CUDA) Job Type: Student / Intern Shift: Shift 1 (United States of ...
OR · On-site
$122K - $161K/yr
Deep hands-on experience with NCCL, CUDA-aware distributed execution, and debugging multi-GPU and ... Expert-level Python and C/C++ programming skills. * Experience operating workloads in scheduled ...
$141K - $191K/yr
Our diverse team of engineers and researchers have pioneered sparse, event-based, neuromorphic ... CUDA, LLVM, oneAPI, SYCL, ONNX, IREE, OpenVINO, TVM. * 5+ years of experience leading software ...
$141K - $191K/yr
Our diverse team of engineers and researchers have pioneered sparse, event-based, neuromorphic ... CUDA, LLVM, oneAPI, SYCL, ONNX, IREE, OpenVINO, TVM. * 5+ years of experience leading software ...
$170K - $315K/yr
Performance engineering and software performance optimizations * Floating point arithmetic and numerical stability * Software development on Linux * Low-level performance optimizations using CUDA ...
$170K - $315K/yr
Performance engineering and software performance optimizations * Floating point arithmetic and numerical stability * Software development on Linux * Low-level performance optimizations using CUDA ...
Hillsboro, OR · On-site
$141K - $191K/yr
Our diverse team of engineers and researchers have pioneered sparse, event-based, neuromorphic ... CUDA, LLVM, oneAPI, SYCL, ONNX, IREE, OpenVINO, TVM. * 5+ years of experience leading software ...
Hillsboro, OR · On-site
$141K - $191K/yr
Our diverse team of engineers and researchers have pioneered sparse, event-based, neuromorphic ... CUDA, LLVM, oneAPI, SYCL, ONNX, IREE, OpenVINO, TVM. * 5+ years of experience leading software ...
Hillsboro, OR · On-site
$170K - $315K/yr
Performance engineering and software performance optimizations * Floating point arithmetic and numerical stability * Software development on Linux * Low-level performance optimizations using CUDA ...
Hillsboro, OR · On-site
$170K - $315K/yr
Performance engineering and software performance optimizations * Floating point arithmetic and numerical stability * Software development on Linux * Low-level performance optimizations using CUDA ...
$29.48 - $34.68
5% of jobs
$34.68 - $39.88
10% of jobs
$39.88 - $45.08
9% of jobs
$46.19 is the 25th percentile. Wages below this are outliers.
$45.08 - $50.28
7% of jobs
$50.28 - $55.48
15% of jobs
The median wage is $57.08 / hr.
$55.48 - $60.67
14% of jobs
$65.39 is the 75th percentile. Wages above this are outliers.
$60.67 - $65.87
17% of jobs
$65.87 - $71.07
14% of jobs
$71.07 - $76.27
6% of jobs
$76.27 - $81.47
3% of jobs
$81.47 - $86.67
0% of jobs
$29
$57
$86
| Aspect | Cuda Programming | GPU Developer |
|---|---|---|
| Required Credentials | Knowledge of CUDA, C/C++, parallel computing | Knowledge of GPU architecture, CUDA, OpenCL, C/C++ |
| Work Environment | High-performance computing, scientific research, AI | Graphics, gaming, scientific visualization, AI |
| Industry Usage | Tech companies, research labs, AI firms | Gaming, entertainment, tech, research |
While Cuda Programming focuses specifically on writing code using NVIDIA's CUDA platform for parallel processing, GPU Developers have a broader role that includes designing, optimizing, and implementing GPU-based solutions across various platforms and technologies. Both roles require knowledge of GPU architecture and programming languages like C/C++, but GPU Developers often work on a wider range of applications beyond CUDA-specific projects.
The Team
The Machine Learning Platform team builds the foundational technology that scales machine learning innovation across Upstart. As a Principal Machine Learning Engineer, you will work at the intersection of applied ML and platform engineering-collaborating closely with Research Scientists, Data Scientists, and ML Platform Engineers to design tools and systems that accelerate model development to ultimately improve predictive accuracy. Success in this role requires deep knowledge of ML throughout the entire modeling lifecycle - from data preparation to training and deployment to production.
In this role, you will lead engineering initiatives that turn high-impact modeling needs into scalable, reusable infrastructure. This includes building a unified embeddings platform for training, serving, and managing representations at scale; streamlining feature engineering pipelines to reduce manual steps and deliver new signals quickly; developing automated continuous-learning systems that handle data refresh, retraining, evaluation, and drift monitoring with minimal manual effort; and scaling our training pipelines to support larger datasets, more complex architectures, and faster experimentation.
Across all of these efforts, you will work backward from applied ML projects that meaningfully improve accuracy-using those real-world scenarios to harden the platform capabilities that enable ML teams across Upstart to innovate with greater speed, reliability, and impact.
How You'll Make an Impact
Your work will multiply the effectiveness of every ML team at Upstart-accelerating innovation and advancing our mission to make credit more accurate, accessible, and fair.
This is a high influence role suited for those who enjoy combining science innovation, with cross functional collaboration and advisory.
Minimum Qualifications
Preferred Qualifications