We are now looking for a Senior High-Performance LLM Training Engineer! NVIDIA is seeking ... GPU computing is the most productive and pervasive platform for deep learning and AI. It begins ...
We are now looking for a Senior High-Performance LLM Training Engineer! NVIDIA is seeking ... GPU computing is the most productive and pervasive platform for deep learning and AI. It begins ...
We are now looking for a Senior High-Performance LLM Training Engineer! NVIDIA is seeking ... GPU computing is the most productive and pervasive platform for deep learning and AI. It begins ...
We are now looking for a Senior High-Performance LLM Training Engineer! NVIDIA is seeking ... GPU computing is the most productive and pervasive platform for deep learning and AI. It begins ...
... high-performance computing or systems-level optimization • Experience with infrastructure-as-code (Kubernetes, Docker, Terraform) • Contributions to open-source ML or systems projects Company
... high-performance computing or systems-level optimization • Experience with infrastructure-as-code (Kubernetes, Docker, Terraform) • Contributions to open-source ML or systems projects Company
NVIDIA has been transforming computer graphics and accelerated computing for over 25 years. They are looking for senior engineers obsessed with performance analysis and optimization to enhance AI ...
NVIDIA has been transforming computer graphics and accelerated computing for over 25 years. They are looking for senior engineers obsessed with performance analysis and optimization to enhance AI ...
Deep Learning Kernel Software Performance Architect - New College Grad 2026
Santa Clara, CA · On-site
$196K/yr
... high-performance computing, performance analysis and profiling to identify performance bottlenecks. • Experience and familiarity with GPU computing and parallel programming models. • Work ...
Deep Learning Kernel Software Performance Architect - New College Grad 2026
Santa Clara, CA · On-site
$196K/yr
... high-performance computing, performance analysis and profiling to identify performance bottlenecks. • Experience and familiarity with GPU computing and parallel programming models. • Work ...
HPC Engineer
Fremont, CA · On-site
... Engineering team ... This individual will design, implement, optimize, and support high-performance computing solutions ...
HPC Engineer
Fremont, CA · On-site
... Engineering team ... This individual will design, implement, optimize, and support high-performance computing solutions ...
Deep Learning Kernel Software Performance Architect - New College Grad 2026
Santa Clara, CA · On-site
$196K/yr
... high-performance computing, performance analysis and profiling to identify performance bottlenecks. * Experience and familiarity with GPU computing and parallel programming models. * Work experience ...
Deep Learning Kernel Software Performance Architect - New College Grad 2026
Santa Clara, CA · On-site
$196K/yr
... high-performance computing, performance analysis and profiling to identify performance bottlenecks. * Experience and familiarity with GPU computing and parallel programming models. * Work experience ...
Deep Learning Kernel Software Performance Architect - New College Grad 2026
Santa Clara, CA · On-site
$196K/yr
... high-performance computing, performance analysis and profiling to identify performance bottlenecks. * Experience and familiarity with GPU computing and parallel programming models. * Work experience ...
Deep Learning Kernel Software Performance Architect - New College Grad 2026
Santa Clara, CA · On-site
$196K/yr
... high-performance computing, performance analysis and profiling to identify performance bottlenecks. * Experience and familiarity with GPU computing and parallel programming models. * Work experience ...
... learning, high-performance computing, and distributed architectures You will architect and ... Key job responsibilities Our kernel engineers collaborate across compiler, runtime, framework, and ...
... learning, high-performance computing, and distributed architectures You will architect and ... Key job responsibilities Our kernel engineers collaborate across compiler, runtime, framework, and ...
... learning, high-performance computing, and distributed architectures You will architect and ... Key job responsibilities Our kernel engineers collaborate across compiler, runtime, framework, and ...
... learning, high-performance computing, and distributed architectures You will architect and ... Key job responsibilities Our kernel engineers collaborate across compiler, runtime, framework, and ...
Senior Deep Learning Compiler Engineer - XLA
Santa Clara, CA · On-site
$122K - $168K/yr
Knowledge of high-performance computing and distributed programming. • Strong interpersonal ... interns is a bonus. • Experience working deep learning frameworks such as JAX, PyTorch or ...
Senior Deep Learning Compiler Engineer - XLA
Santa Clara, CA · On-site
$122K - $168K/yr
Knowledge of high-performance computing and distributed programming. • Strong interpersonal ... interns is a bonus. • Experience working deep learning frameworks such as JAX, PyTorch or ...
Software Engineering Manager, ML Kernel Performance, AWS Neuron, Annapurna Labs
Cupertino, CA · On-site
$172K/yr
... high-performance computing, and distributed architectures. You will architect and implement ... Key job responsibilities Our kernel engineers collaborate across compiler, runtime, framework, and ...
Software Engineering Manager, ML Kernel Performance, AWS Neuron, Annapurna Labs
Cupertino, CA · On-site
$172K/yr
... high-performance computing, and distributed architectures. You will architect and implement ... Key job responsibilities Our kernel engineers collaborate across compiler, runtime, framework, and ...
Senior Fortran Compiler Engineer
Santa Clara, CA · On-site
$122K - $168K/yr
NVIDIA's HPC compiler group is seeking a Fortran compiler developer to contribute to the ... high-performance computing, while implementing and improving features in LLVM Flang, OpenACC, and ...
Senior Fortran Compiler Engineer
Santa Clara, CA · On-site
$122K - $168K/yr
NVIDIA's HPC compiler group is seeking a Fortran compiler developer to contribute to the ... high-performance computing, while implementing and improving features in LLVM Flang, OpenACC, and ...
Construction High School Internship Program Start date anticipated sometime in June Deacon ... Interns will learn how different roles work together, including superintendents, project engineers ...
Construction High School Internship Program Start date anticipated sometime in June Deacon ... Interns will learn how different roles work together, including superintendents, project engineers ...
... MBZUAI as a global hub for high-performance computing in deep learning, driving impactful ... engineers to; • Collect, prepare and processing training and validation datasets. • Assist in ...
Quick apply
... MBZUAI as a global hub for high-performance computing in deep learning, driving impactful ... engineers to; • Collect, prepare and processing training and validation datasets. • Assist in ...
... high-impact research projects that intersect with our engineering roadmap. Organization and ... performance computing - Experience driving collaborative projects from conception to delivery ...
... high-impact research projects that intersect with our engineering roadmap. Organization and ... performance computing - Experience driving collaborative projects from conception to delivery ...
Senior / Staff Software Engineer, High-Performance Onboard Algorithms
San Francisco, CA · On-site +1
$148K - $260K/yr
To learn more visit: www.waabi.ai As a Software Engineer in High-Performance Onboard Algorithms ... Bonus: - Experience with accelerated computing like CUDA, Vulkan, and OpenCL. - Experience with ...
Senior / Staff Software Engineer, High-Performance Onboard Algorithms
San Francisco, CA · On-site +1
$148K - $260K/yr
To learn more visit: www.waabi.ai As a Software Engineer in High-Performance Onboard Algorithms ... Bonus: - Experience with accelerated computing like CUDA, Vulkan, and OpenCL. - Experience with ...
AI Research Internship - LLM
Sunnyvale, CA · On-site
$100K - $140K/yr
... high-performance computing in deep learning, driving impactful discoveries that inspire the next ... engineers to; • Collect, prepare and processing training and validation datasets. • Assist in ...
AI Research Internship - LLM
Sunnyvale, CA · On-site
$100K - $140K/yr
... high-performance computing in deep learning, driving impactful discoveries that inspire the next ... engineers to; • Collect, prepare and processing training and validation datasets. • Assist in ...
Senior / Staff Software Engineer, High-Performance Onboard Algorithms
San Francisco, CA · On-site +1
$148K - $260K/yr
To learn more visit: www.waabi.ai As a Software Engineer in High-Performance Onboard Algorithms ... Bonus: - Experience with accelerated computing like CUDA, Vulkan, and OpenCL. - Experience with ...
Senior / Staff Software Engineer, High-Performance Onboard Algorithms
San Francisco, CA · On-site +1
$148K - $260K/yr
To learn more visit: www.waabi.ai As a Software Engineer in High-Performance Onboard Algorithms ... Bonus: - Experience with accelerated computing like CUDA, Vulkan, and OpenCL. - Experience with ...
... AI and high-performance computing within financial markets, focusing on optimizing complex ... Preferred : • Prior internship experience in a related field. • Experience with inference ...
... AI and high-performance computing within financial markets, focusing on optimizing complex ... Preferred : • Prior internship experience in a related field. • Experience with inference ...
Internship High Performance Computing Engineer information
What are the key skills and qualifications needed to thrive as an Internship High Performance Computing (HPC) Engineer, and why are they important?
What is the difference between Internship High Performance Computing Engineer vs Internship Data Scientist?
| Aspect | Internship High Performance Computing Engineer | Internship Data Scientist |
|---|---|---|
| Required Skills | Programming (C++, Python), parallel computing, HPC systems | Statistics, machine learning, data analysis, Python/R |
| Work Environment | Research labs, tech companies, academia with focus on HPC systems | Tech firms, finance, healthcare, research institutions |
| Industry Usage | High-performance computing projects, scientific simulations | Data analysis, predictive modeling, business insights |
Internship High Performance Computing Engineers focus on developing and optimizing computational systems for large-scale scientific and engineering problems, requiring skills in parallel programming and HPC environments. In contrast, Internship Data Scientists analyze data to extract insights, using statistical and machine learning techniques. Both roles are valuable in tech and research sectors but differ in technical focus and daily tasks.
What is an Internship High Performance Computing Engineer?
What types of projects can I expect to work on as an Internship High Performance Computing Engineer?
Full-time
Posted 17 days ago
Job description
NVIDIA is seeking experienced engineers specializing in performance analysis and optimization to improve the efficiency of LLM training workloads, which are shaping the world's most advanced computing systems. This position focuses on optimizing NVIDIA's high-performance LLM software stack in frameworks like PyTorch and JAX for high-performance training on thousands of GPUs, while also helping shape hardware roadmaps for the next generation of GPUs powering the AI revolution.
What you will be doing:
- Understand, analyze, profile, and optimize AI training workloads on innovative hardware and software platforms.
- Understand the big picture of training performance on GPUs, prioritizing and then solving problems across all state-of-the-art neural networks.
- Implement production-quality software in multiple layers of NVIDIA's deep learning platform stack, from drivers to DL frameworks.
- Build and support NVIDIA submissions to the MLPerf Training benchmark suite.
- Implement key DL training workloads in NVIDIA's proprietary processor and system simulators to enable future architecture studies.
- Build tools to automate workload analysis, workload optimization, and other critical workflows.
What we want to see:
- PhD in Computer Science, Electrical Engineering or Computer Engineering and 5+ years; or MS (or equivalent experience) and 8+ years of meaningful work experience.
- Strong background in deep learning and neural networks, in particular training.
- A deep background in computer architecture and familiarity with the fundamentals of GPU architecture.
- Proven experience analyzing and tuning application performance & processor and system-level performance modelling.
- Programming skills in C++, Python, and CUDA.
GPU computing is the most productive and pervasive platform for deep learning and AI. It begins with the most advanced GPUs and the systems and software we build on top of them. We integrate and optimize every deep learning framework. We work with the major systems companies and every major cloud service provider to make GPUs available in data centers and in the cloud. We craft computers and software to bring AI to edge devices, such as self-driving cars and autonomous robots. AI has the potential to spur a wave of social progress unmatched since the industrial revolution.
Widely considered to be one of tech's most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. Additionally, this opportunity offers you the ability to collaborate with some of the most forward-thinking and hard-working people in the world, shaping the future of AI in a creative and autonomous work environment that encourages innovation. If you're excited to work across the full hardware & software stack-from GPU architecture to application code-to achieve optimal performance, we want to hear from you!
#LI-Hybrid
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.
You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until April 12, 2026.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
About Nvidia
Sourced by ZipRecruiter
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology--and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent.
Industry
Computer and electronic product manufacturing
Company size
10,000+ Employees
Headquarters location
Santa Clara, CA, US
Year founded
1993