Cuda Machine Learning Performance Engineer Jobs

Machine Learning Performance Engineer

New York, NY · On-site

Optiver is a seeking a Machine Learning Performance Engineer to join our team, focusing on a ... Strong knowledge of low-level GPU programming with CUDA, including Tensor Cores, cooperative groups ...

Machine Learning Performance Engineer

New York, NY · On-site

Machine Learning Performance Engineer

$154.30K/yr

... level systems programming and optimisation to join our growing ML team. Machine learning is a ... Part of this is improving straightforward CUDA, but the interesting part needs a whole-systems ...

Machine Learning Performance Engineer

$154.30K/yr

Machine Learning Performance Engineer

New York, NY · On-site

$200K/yr

Machine Learning Performance Engineer

New York, NY · On-site

$200K/yr

Machine Learning Performance Engineer

$200K/yr

Machine Learning Performance Engineer

$200K/yr

Machine Learning Performance Engineer

New York, NY · On-site

$153.20K/yr

... level systems programming and optimization to join our growing ML team. Machine learning is a ... Part of this is improving straightforward CUDA, but the interesting part needs a whole-systems ...

Machine Learning Performance Engineer

New York, NY · On-site

$153.20K/yr

Machine Learning Performance Engineer

New York, NY · On-site

$153.20K/yr

Machine Learning Performance Engineer

New York, NY · On-site

$153.20K/yr

Machine Learning Performance Engineer

New York, NY

$153.20K/yr

Machine Learning Performance Engineer

New York, NY

$153.20K/yr

Machine Learning Performance Engineer

New York, NY · On-site

$153.20K/yr

Machine Learning Performance Engineer

New York, NY · On-site

$153.20K/yr

Machine Learning Performance Engineer

Loveland, CO · On-site

$160.16K - $266.93K/yr

Write custom CUDA kernels and LibTorch (PyTorch C++) extensions to accelerate hot paths in both ... in ML engineering, performance engineering, or HPC, with substantial production ML experience

Machine Learning Performance Engineer

Loveland, CO · On-site

$160.16K - $266.93K/yr

Machine Learning Performance Engineer

Santa Rosa, CA · On-site

$160.16K - $266.93K/yr

Machine Learning Performance Engineer

Santa Rosa, CA · On-site

$160.16K - $266.93K/yr

Machine Learning Performance Engineer

Santa Rosa, CA · On-site

$160.16K - $266.93K/yr

Machine Learning Performance Engineer

Santa Rosa, CA · On-site

$160.16K - $266.93K/yr

SmartIPlace

W2 Role- Machine Learning Performance Engineer - CUDA Python[50% Travel-remote]

Austin, TX · Remote

$143.30K/yr

Machine Learning Performance Engineer - CUDA Python Work Authorization - USC / GC only Interview: Video Duration: 6-month contract maybe extensions Location: 50% travel Duration: 6 month contract

Quick apply

SmartIPlace

W2 Role- Machine Learning Performance Engineer - CUDA Python[50% Travel-remote]

Austin, TX · Remote

$143.30K/yr

Machine Learning Performance Engineer - CUDA Python Work Authorization - USC / GC only Interview: Video Duration: 6-month contract maybe extensions Location: 50% travel Duration: 6 month contract

Staff Machine Learning Performance Engineer, Siri Runtime Systems and Interaction

Cupertino, CA

$212K - $318.40K/yr

Staff Machine Learning Performance Engineer, Siri Runtime Systems And Interaction Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new ...

Staff Machine Learning Performance Engineer, Siri Runtime Systems and Interaction

Cupertino, CA

$212K - $318.40K/yr

Senior Deep Learning Performance Architect

Santa Clara, CA

$196.10K/yr

Solid foundation in machine learning and deep learning * Strong programming skills in Python, C, C ... Triton, CUDA, OpenCL * Experience with the architecture of or workload analysis on other DL ...

Senior Deep Learning Performance Architect

Santa Clara, CA

$196.10K/yr

Senior Deep Learning Performance Architect

Redmond, WA

$187K/yr

Senior Deep Learning Performance Architect

Redmond, WA

$187K/yr

Nvidia Corporation

Senior Deep Learning Performance Architect

Santa Clara, CA · On-site

$196.10K/yr

Nvidia Corporation

Senior Deep Learning Performance Architect

Santa Clara, CA · On-site

$196.10K/yr

NVIDIA

Deep Learning Kernel Software Performance Architect - New College Grad 2026

Santa Clara, CA · On-site

$196.10K/yr

... with the CUDA and AI Compiler teams to pinpoint and resolve performance issues • Engage AI/ML ... in programming languages such as Python, C, C++. Preferred : • Strong foundation in machine ...

NVIDIA

Deep Learning Kernel Software Performance Architect - New College Grad 2026

Santa Clara, CA · On-site

$196.10K/yr

Exaways Corporation

Machine Learning Engineer

Berkeley Heights, NJ

... Performance Computing (HPC) machines, Data Science tools, products & services in cloud and on ... CUDA • Preferred experience with configuration Management tools like Ansible, puppet • ...

Exaways Corporation

Machine Learning Engineer

Berkeley Heights, NJ

Staff Machine Learning Performance Engineer, Siri Runtime Systems and Interaction

Cupertino, CA · On-site

As a Machine Learning Performance Engineer, you will play a critical role in ensuring the efficiency and scalability of Siri's machine learning models. You will work closely with diverse teams to ...