... level programming (e.g., C/C++ and/or Rust) and solid CS fundamentals in data structures ... Solid understanding of parallel and heterogeneous compute architectures, such as GPUs, spatial ...
... level programming (e.g., C/C++ and/or Rust) and solid CS fundamentals in data structures ... Solid understanding of parallel and heterogeneous compute architectures, such as GPUs, spatial ...
Lead AI Engineer
Toronto, ON ยท On-site +1
Title and Summary Lead AI Engineer Overview We are looking for a talented Lead AI Engineer to work ... Strong foundational experience with PySpark and a solid understanding of distributed and parallel ...
Lead AI Engineer
Toronto, ON ยท On-site +1
Title and Summary Lead AI Engineer Overview We are looking for a talented Lead AI Engineer to work ... Strong foundational experience with PySpark and a solid understanding of distributed and parallel ...
... in parallel with the Utility. ยท Thorough working knowledge of airfield lighting, system safety ... and engineering to program and construction management. AECOM is a Fortune 500 firm that had ...
... in parallel with the Utility. ยท Thorough working knowledge of airfield lighting, system safety ... and engineering to program and construction management. AECOM is a Fortune 500 firm that had ...
This Delivery Manager role exists to keep that parallel delivery coherent: ceremonies running tight ... Candidates who have led delivery for AI-assisted engineering teams and built operational frameworks ...
This Delivery Manager role exists to keep that parallel delivery coherent: ceremonies running tight ... Candidates who have led delivery for AI-assisted engineering teams and built operational frameworks ...
This Delivery Manager role exists to keep that parallel delivery coherent: ceremonies running tight ... Candidates who have led delivery for AI-assisted engineering teams and built operational frameworks ...
This Delivery Manager role exists to keep that parallel delivery coherent: ceremonies running tight ... Candidates who have led delivery for AI-assisted engineering teams and built operational frameworks ...
Strong programming background in C/C++ and Python, with a concrete understanding of bare-metal or ... parallel compute pipelines, high-bandwidth memory (HBM) controllers, and matrix math engines.
New
Strong programming background in C/C++ and Python, with a concrete understanding of bare-metal or ... parallel compute pipelines, high-bandwidth memory (HBM) controllers, and matrix math engines.
New
You have a good understanding of design patterns and modern user interface paradigms (MVC, MVVM), and experience with parallel and asynchronous programming. You have full-stack experience, having ...
You have a good understanding of design patterns and modern user interface paradigms (MVC, MVVM), and experience with parallel and asynchronous programming. You have full-stack experience, having ...
Strong programming background in C/C++ and Python, with a concrete understanding of bare-metal or ... parallel compute pipelines, high-bandwidth memory (HBM) controllers, and matrix math engines.
New
Strong programming background in C/C++ and Python, with a concrete understanding of bare-metal or ... parallel compute pipelines, high-bandwidth memory (HBM) controllers, and matrix math engines.
New
The Role We're looking for an experienced QA Automation Engineer to help us scale a high-quality ... Experience integrating automated tests into CI/CD pipelines (parallel runs, retries, artifacts ...
New
Quick apply
The Role We're looking for an experienced QA Automation Engineer to help us scale a high-quality ... Experience integrating automated tests into CI/CD pipelines (parallel runs, retries, artifacts ...
New
Research Crawling Engineer
Toronto, ON ยท Remote
$80K - $175K/yr
Strong programming experience in one or more of: Go, Rust, Python, Java, or C++ * Experience ... Familiarity with distributed systems and parallel processing * Experience working with large ...
Quick apply
Research Crawling Engineer
Toronto, ON ยท Remote
$80K - $175K/yr
Strong programming experience in one or more of: Go, Rust, Python, Java, or C++ * Experience ... Familiarity with distributed systems and parallel processing * Experience working with large ...
Data Engineer, Mortgage Servicing
Toronto, ON ยท Remote
CA$140K - CA$240K/yr
In parallel, this individual will help enable downstream analytics, reporting, product capabilities ... Experience with modern programming languages commonly used in data engineering, such as Python ...
Data Engineer, Mortgage Servicing
Toronto, ON ยท Remote
CA$140K - CA$240K/yr
In parallel, this individual will help enable downstream analytics, reporting, product capabilities ... Experience with modern programming languages commonly used in data engineering, such as Python ...
Research Engineer, Calibration
Toronto, ON ยท On-site +1
CA$158K - CA$269K/yr
... parallel, and distributed computing techniques for efficient computation. - Publications in top-tier conferences or journals related to high-performance computing, image processing, computer graphics ...
Research Engineer, Calibration
Toronto, ON ยท On-site +1
CA$158K - CA$269K/yr
... parallel, and distributed computing techniques for efficient computation. - Publications in top-tier conferences or journals related to high-performance computing, image processing, computer graphics ...
Research Crawling Engineer
Toronto, ON ยท Remote
$80K - $175K/yr
Strong programming experience in one or more of: Go, Rust, Python, Java, or C++ * Experience ... Familiarity with distributed systems and parallel processing * Experience working with large ...
Quick apply
Research Crawling Engineer
Toronto, ON ยท Remote
$80K - $175K/yr
Strong programming experience in one or more of: Go, Rust, Python, Java, or C++ * Experience ... Familiarity with distributed systems and parallel processing * Experience working with large ...
RTL Design Engineer
Thornhill, ON ยท Hybrid
... parallel. Be a part of a team that delivers Industry leading IP and help our experts in RTL, FW ... Digital design engineering experience * Excellent knowledge of Verilog, System Verilog, C and a ...
RTL Design Engineer
Thornhill, ON ยท Hybrid
... parallel. Be a part of a team that delivers Industry leading IP and help our experts in RTL, FW ... Digital design engineering experience * Excellent knowledge of Verilog, System Verilog, C and a ...
Champion modern, agile quality engineering- shift-left testing, test pyramids, contract testing ... HPC/ parallel runtimes andcommunication libraries (MPI, RCCL/NCCL, UCX, Libfabric) * Linux kernel ...
Champion modern, agile quality engineering- shift-left testing, test pyramids, contract testing ... HPC/ parallel runtimes andcommunication libraries (MPI, RCCL/NCCL, UCX, Libfabric) * Linux kernel ...
The ideal candidate should be passionate about AI/ML engineering and possess leadership skills to ... Familiar with GPU/TPU/CPU SOC architectures including SIMD/SIMT/Parallel Processing models, Cache ...
The ideal candidate should be passionate about AI/ML engineering and possess leadership skills to ... Familiar with GPU/TPU/CPU SOC architectures including SIMD/SIMT/Parallel Processing models, Cache ...
Research Engineer, Sensor Signal Processing
Toronto, ON ยท On-site +1
CA$155K - CA$269K/yr
... on parallel computing architectures (e.g., CPU, GPU, DSP, and specialized accelerators ... BLAS, CHOLMOD, Gauss-Newton, L-BFGS, linear programming. - Experience with real-time methods ...
Research Engineer, Sensor Signal Processing
Toronto, ON ยท On-site +1
CA$155K - CA$269K/yr
... on parallel computing architectures (e.g., CPU, GPU, DSP, and specialized accelerators ... BLAS, CHOLMOD, Gauss-Newton, L-BFGS, linear programming. - Experience with real-time methods ...
The Principal Developer partners closely with business, product, and technology stakeholders to ... parallel to product development * Gain a wholistic view of the Moneris sales process for the ...
The Principal Developer partners closely with business, product, and technology stakeholders to ... parallel to product development * Gain a wholistic view of the Moneris sales process for the ...
Champion modern, agile quality engineering-- shift-left testing, test pyramids, contract testing ... HPC/ parallel runtimes andcommunication libraries (MPI, RCCL/NCCL, UCX, Libfabric) * Linux kernel ...
Champion modern, agile quality engineering-- shift-left testing, test pyramids, contract testing ... HPC/ parallel runtimes andcommunication libraries (MPI, RCCL/NCCL, UCX, Libfabric) * Linux kernel ...
Use tools like Claude Code / Codex to accelerate implementation, debugging, and refactoring across multiple sessions in parallel * Create internal tooling, prompts, and "engineering systems" that ...
Quick apply
Use tools like Claude Code / Codex to accelerate implementation, debugging, and refactoring across multiple sessions in parallel * Create internal tooling, prompts, and "engineering systems" that ...
Parallel Programming information
See Toronto, ON salary details
$22.4K - $33.4K
5% of jobs
$33.4K - $44.4K
4% of jobs
$44.4K - $55.4K
5% of jobs
$55.4K - $66.3K
4% of jobs
$66.3K - $77.3K
3% of jobs
$87.4K is the 25th percentile. Wages below this are outliers.
$77.3K - $88.3K
3% of jobs
$88.3K - $99.3K
2% of jobs
$99.3K - $110.2K
3% of jobs
The median wage is $116.4K / yr.
$110.2K - $121.2K
35% of jobs
$130.4K is the 75th percentile. Wages above this are outliers.
$121.2K - $132.2K
12% of jobs
$132.2K - $143.2K
23% of jobs
$22.4K
$105.1K
$143.2K
How much do parallel programming jobs pay per year?
What is a Parallel Programming job?
What are the key skills and qualifications needed to thrive in the Parallel Programming position, and why are they important?
What are some typical challenges encountered in a Parallel Programming role?
What job makes $10,000 a month without a degree?

Full-time
Posted 23 days ago
Job description
We are now looking for a Senior Machine Learning Applications and Compiler Engineer!
NVIDIA is seeking engineers to develop algorithms and optimizations for our LPX inference and compiler stack. You will work at the intersection of large-scale systems, compilers, and deep learning, crafting how neural network workloads map onto future NVIDIA platforms. This is your chance to be part of something outstandingly innovative!
What you'll be doing:
Build, develop, and maintain high-performance runtime and compiler components, focusing on end-to-end inference optimization.
Define and implement mappings of large-scale inference workloads onto NVIDIA's systems.
Extend and integrate with NVIDIA's SW ecosystem, contributing to libraries, tooling, and interfaces that enable seamless deployment of models across platforms.
Benchmark, profile, and monitor key performance and efficiency metrics to ensure the compiler generates efficient mappings of neural network graphs to our inference hardware.
Collaborate closely with hardware architects and design teams to feedback software observations, influence future architectures, and codesign features that unlock new performance and efficiency points.
Prototype and evaluate new compilation and runtime techniques, including graph transformations, scheduling strategies, and memory/layout optimizations tailored to spatial processors.
Publish and present technical work on novel compilation approaches for inference and related spatial accelerators at top tier ML, compiler, and computer architecture venues.
What we need to see:
MS or PhD in Computer Science, Electrical/Computer Engineering, or related field, or equivalent experience, with 5 years of relevant experience.
Strong software engineering background with proficiency in systems level programming (e.g., C/C++ and/or Rust) and solid CS fundamentals in data structures, algorithms, and concurrency.
Hands on experience with compiler or runtime development, including IR design, optimization passes, or code generation.
Experience with LLVM and/or MLIR, including building custom passes, dialects, or integrations.
Familiarity with deep learning frameworks such as TensorFlow and PyTorch, and experience working with portable graph formats such as ONNX.
Solid understanding of parallel and heterogeneous compute architectures, such as GPUs, spatial accelerators, or other domain specific processors.
Strong analytical and debugging skills, with experience using profiling, tracing, and benchmarking tools to drive performance improvements.
Excellent communication and collaboration skills, with the ability to work across hardware, systems, and software teams.
Ideal candidates will have direct experience with MLIR based compilers or other multilevel IR stacks, especially in the context of graph based deep learning workloads.
Ways to stand out from the crowd:
Prior work on spatial or dataflow architectures, including static scheduling, pipeline parallelism, or tensor parallelism at scale.
Contributions to opensource ML frameworks, compilers, or runtime systems, particularly in areas related to performance or scalability.
Demonstrated research impact, such as publications or presentations at conferences like PLDI, CGO, ASPLOS, ISCA, MICRO, MLSys, NeurIPS, or similar.
Experience with large-scale AI distributed inference or training systems, including performance modeling and capacity planning for multi rack deployments.
#LI-Hybrid
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 135,000 CAD - 185,000 CAD for Level 3, and 170,000 CAD - 220,000 CAD for Level 4.You will also be eligible for equity and benefits.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
About Nvidia
Sourced by ZipRecruiter
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology--and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent.
Industry
Computer and electronic product manufacturing
Company size
10,000+ Employees
Headquarters location
Santa Clara, CA, US
Year founded
1993