1

Deep Learning Performance Architect Jobs (NOW HIRING)

Senior Deep Learning Performance Architect

Austin, TX · On-site

$165.50K/yr

... deep learning workloads, especially LLM inference/training in real-world deployments. • Build and use performance and power models (Python/C++) to drive architecture and product decisions. • ...

OR · Hybrid

We are now looking for a Senior Deep Learning Performance Architect! NVIDIA seeks a Senior DL Performance Architect to join our group of pioneers who enjoy pushing AI Inference performance boundaries.

next page

Showing results 1-20

Deep Learning Performance Architect information

See salary details

$156.5K

$168K

How much do deep learning performance architect jobs pay per year?

As of May 30, 2026, the average yearly pay for deep learning performance architect in the United States is $167,842.00, according to ZipRecruiter salary data. Most workers in this role earn between $167,000.00 and $167,000.00 per year, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive as a Deep Learning Performance Architect, and why are they important?

To thrive as a Deep Learning Performance Architect, you need a strong background in computer science, deep learning frameworks, parallel computing, and optimization techniques, typically supported by a relevant degree and experience in AI or high-performance computing. Familiarity with tools such as TensorFlow, PyTorch, CUDA, and profiling or benchmarking systems is essential. Analytical problem-solving, effective communication, and a collaborative mindset help professionals excel in cross-functional teams and resolve complex performance bottlenecks. These skills are vital for optimizing AI workloads, ensuring scalability, and maximizing the efficiency of deep learning models in production environments.

What are some common challenges faced by Deep Learning Performance Architects when optimizing large-scale neural network models?

Deep Learning Performance Architects often encounter challenges such as balancing model accuracy with computational efficiency, managing memory constraints on specialized hardware, and optimizing inference or training speed across different platforms. They frequently need to profile and analyze bottlenecks at both the algorithmic and hardware levels, often requiring close collaboration with software engineers and hardware designers. Staying current with rapidly evolving deep learning frameworks and hardware accelerators is also essential to ensure optimal performance and scalability.

What is a Deep Learning Performance Architect?

A Deep Learning Performance Architect is a specialized professional who designs, analyzes, and optimizes the performance of deep learning systems and models. They work to improve the efficiency, speed, and scalability of machine learning algorithms on various hardware platforms such as GPUs, TPUs, and CPUs. Their role often involves collaborating with software engineers and data scientists to identify bottlenecks and implement solutions that enhance computational capabilities for AI workloads. By doing so, they ensure that deep learning applications run faster and more efficiently, making the best use of available resources.

What is the difference between Deep Learning Performance Architect vs Machine Learning Engineer?

AspectDeep Learning Performance ArchitectMachine Learning Engineer
CredentialsAdvanced degrees in AI, deep learning, or related fields; certifications in deep learning frameworksDegrees in computer science, data science, or related fields; certifications in machine learning tools
Work EnvironmentResearch labs, AI development teams, performance optimization settingsData-driven projects, model development, deployment environments
Industry UsageTech companies, AI research firms, organizations focusing on deep learning optimizationTech companies, startups, enterprises applying machine learning solutions

The Deep Learning Performance Architect specializes in optimizing deep learning models for efficiency and scalability, focusing on hardware and software performance. In contrast, Machine Learning Engineers develop, train, and deploy machine learning models across various applications. While both roles require strong technical skills, the Architect emphasizes performance tuning and system optimization, whereas the Engineer focuses on model development and implementation.

More about Deep Learning Performance Architect jobs
What job categories do people searching Deep Learning Performance Architect jobs look for? The top searched job categories for Deep Learning Performance Architect jobs are:
Senior Deep Learning Performance Architect

Senior Deep Learning Performance Architect

Nvidia

Santa Clara, CA

$196.10K/yr

Full-time

Posted 16 days ago


Job description

We are now seeking a Senior Deep Learning Performance Architect!

NVIDIA is looking for outstanding Performance Architects with a background in performance analysis, performance modeling, and AI/deep learning to help analyze and develop the next generation of architectures that accelerate AI and high-performance computing applications.

What you'll be doing:

  • Develop innovative architectures to extend the state of the art in deep learning performance and efficiency

  • Analyze performance, cost and power trade-offs by developing analytical models, simulators and test suites

  • Understand and analyze the interplay of hardware and software architectures on future algorithms, programming models and applications

  • Develop, analyze, and harness groundbreaking Deep Learning frameworks, libraries, and compilers

  • Actively collaborate with software, product and research teams to guide the direction of deep learning HW and SW

What we need to see:

  • MS or PhD in Computer Science, Computer Engineering, Electrical Engineering or equivalent experience

  • 6+ years of meaningful work experience

  • Strong background in GPU or Deep Learning ASIC architecture for training and/or inference

  • Experience with performance modeling, architecture simulation, profiling, and analysis

  • Solid foundation in machine learning and deep learning

  • Strong programming skills in Python, C, C++

Ways to stand out from the crowd:

  • Background with deep neural network training, inference and optimization in leading frameworks (e.g. Pytorch, JAX, TensorRT)

  • Experience with relevant libraries, compilers, and languages - CUDNN, CUBLAS, CUTLASS, MLIR, Triton, CUDA, OpenCL

  • Experience with the architecture of or workload analysis on other DL accelerators

  • Demonstration of self-motivation, with a knack for critical thinking and thinking outside the box

Intelligent machines powered by Artificial Intelligence computers that can learn, reason and interact with people are no longer science fiction. GPU Deep Learning has provided the foundation for machines to learn, perceive, reason and solve problems. NVIDIA's GPUs run AI algorithms, simulating human intelligence, and act as the brains of computers, robots and self-driving cars that can perceive and understand the world. Increasingly known as "the AI computing company", NVIDIA wants you! Come, join our Deep Learning Architecture team, where you can help build real-time, efficient computing platforms driving our success in this exciting and rapidly growing field.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until January 13, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Nvidia logo

About Nvidia

Sourced by ZipRecruiter

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology--and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent.

Industry

Computer and electronic product manufacturing

Company size

10,000+ Employees

Headquarters location

Santa Clara, CA, US

Year founded

1993