1

Fpga Deep Learning Jobs (NOW HIRING)

... FPGA or other DL accelerators; GPU programming experience (CUDA). * Track record of leading ambiguous, high-impact technical programs across multiple teams under tight deadlines. GPU deep learning ...

Profile and analyze deep learning workloads on Sohu to identify micro-architectural bottlenecks and ... Exposure to ASIC, FPGA, or CGRA-based accelerator development and hardware/software co-design ...

Profile and analyze deep learning workloads on Sohu to identify micro-architectural bottlenecks and ... Exposure to ASIC, FPGA, or CGRA-based accelerator development and hardware/software co-design ...

Senior DSP Engineer

Costa Mesa, CA · On-site

$153K - $179K/yr

... FPGA, Nvidia Jetson, and Software Defined Radios. * Skilled with Modeling and Simulation of RF systems including Radar and SAR * Familiar with deep learning algorithms. Experience with ML frameworks ...

Lead DSP Engineer, EW

Costa Mesa, CA · On-site

$153K - $179K/yr

... FPGA, Nvidia Jetson, and Software Defined Radios. * Skilled with Modeling and Simulation of RF systems including Radar and SAR * Familiar with deep learning algorithms. Experience with ML frameworks ...

next page

Showing results 1-20

Fpga Deep Learning information

See salary details

$70K

$147.1K

$210.5K

How much do fpga deep learning jobs pay per year?

As of Jun 7, 2026, the average yearly pay for fpga deep learning in the United States is $147,056.00, according to ZipRecruiter salary data. Most workers in this role earn between $123,000.00 and $169,500.00 per year, depending on experience, location, and employer.

What are FPGA Deep Learning engineers?

FPGA Deep Learning engineers are professionals who design, implement, and optimize deep learning models to run efficiently on Field-Programmable Gate Arrays (FPGAs). FPGAs are specialized hardware chips that can be programmed to perform specific computational tasks at high speeds and low power consumption. These engineers bridge the gap between artificial intelligence algorithms and hardware, ensuring that neural networks and AI applications can leverage FPGA advantages such as parallelism and flexibility. Their work is crucial in industries requiring real-time data processing, like autonomous vehicles, robotics, and edge computing.

What is the difference between Fpga Deep Learning vs Machine Learning Engineer?

AspectFpga Deep LearningMachine Learning Engineer
Required CredentialsBachelor's or higher in CS, EE, or related; knowledge of FPGA programming and deep learning frameworksBachelor's or higher in CS, Data Science, or related; expertise in ML algorithms and software development
Work EnvironmentHardware-focused, embedded systems, FPGA development labsSoftware-focused, data centers, cloud platforms, or research labs
Industry UsageEmbedded AI, edge computing, specialized hardware accelerationData analysis, predictive modeling, software solutions across industries

While both roles involve AI and machine learning, Fpga Deep Learning specialists focus on hardware acceleration using FPGAs to optimize deep learning models, whereas Machine Learning Engineers develop and deploy ML algorithms primarily in software environments. The roles often overlap in AI projects but differ in technical focus and work environment.

What are the key skills and qualifications needed to thrive as an FPGA Deep Learning Engineer, and why are they important?

To thrive as an FPGA Deep Learning Engineer, you need a solid background in digital design, hardware description languages (such as VHDL or Verilog), deep learning frameworks, and a relevant degree in electrical engineering, computer engineering, or a similar field. Familiarity with FPGA development tools (like Xilinx Vivado or Intel Quartus), hardware accelerators, and experience with deploying neural networks on embedded systems are typically required. Problem-solving ability, attention to detail, and strong collaboration skills are key soft skills that make a candidate stand out. These skills and qualities are essential for efficiently bridging the gap between AI algorithms and hardware implementations, ensuring high-performance, reliable solutions.

How do professionals in FPGA Deep Learning roles typically collaborate with software and data science teams?

FPGA Deep Learning professionals often work closely with software engineers and data scientists to optimize deep learning models for hardware acceleration. This collaboration involves translating neural network architectures from high-level frameworks (like TensorFlow or PyTorch) into efficient hardware implementations, communicating constraints or opportunities for parallelization, and iteratively refining models for performance. Regular meetings and code reviews are common to ensure alignment between hardware and software development. Effective communication and understanding of both domains are essential for successfully deploying deep learning solutions on FPGA platforms.
More about Fpga Deep Learning jobs
What cities are hiring for Fpga Deep Learning jobs? Cities with the most Fpga Deep Learning job openings:
What states have the most Fpga Deep Learning jobs? States with the most job openings for Fpga Deep Learning jobs include:
What job categories do people searching Fpga Deep Learning jobs look for? The top searched job categories for Fpga Deep Learning jobs are:
Infographic showing various Fpga Deep Learning job openings in the United States as of May 2026, with employment types broken down into 100% Full Time. Highlights an 89% In-person, and 11% Remote job distribution, with an average salary of $147,056 per year, or $70.7 per hour.
AI Inference Performance Engineer

AI Inference Performance Engineer

Nvidia

Santa Clara, CA • Hybrid

$164K/yr

Full-time

Posted 28 days ago


Job description

We optimize and benchmark GenAI inference on NVIDIA's latest accelerators, defining the industry's performance standards across language models, video generation, and speech workloads. We work directly within TensorRT-LLM, SGLang, and vLLM, building the tools that evaluate serving performance at scale. This team sits at the intersection of GPU performance engineering and public accountability.

What You Will Be Doing:

  • Drive industry benchmark results: own the end-to-end optimization pipeline, implement and integrate optimizations in quantization, scheduling, memory management, and distributed inference across TensorRT-LLM, SGLang, and vLLM.

  • Define and optimize cutting-edge workloads: identify and shape next-generation inference benchmarks, multi-turn coding, agentic workflows, and other emerging AI use cases. Collaborate with framework and kernel teams to push performance to its extreme on large-scale LLM-MoE models, vision-language models, video diffusion models, recommendation, and speech workloads.

  • Architect distributed inference: Design and optimize execution from single-GPU to rack-scale clusters, managing performance across clusters of GPUs.

  • Establish performance methodology: Apply roofline analysis and systematic profiling to decompose bottlenecks across CUDA kernels, frameworks, and serving layers.

  • Influence the ecosystem: contribute to TensorRT-LLM, vLLM, SGLang, and other open-source projects. Partner with architecture, kernel, and compiler teams to shape GPU roadmaps based on real workload data.

  • Technical Leadership: Raise the technical bar for the team, drive cross-functional execution on tight benchmark timelines, and lead a world-class team.

What We Need To See:

  • BS, MS, or PhD in Computer Science, Computer Engineering, Electrical Engineering, or equivalent experience.

  • 5+ years of relevant software development experience.

  • Strong Python or C++ programming, software design, and software engineering skills.

  • Expertise with a DL framework such as PyTorch or JAX.

  • Proven track record of delivering measurable performance improvements in deep learning inference or high-performance systems.

  • Deep understanding of LLM/VLM architectures and inference mechanics: attention, KV caching, batching strategies, decode-phase bottlenecks, speculative decoding, disaggregated serving etc.

Ways To Stand Out From The Crowd:

  • Prior experience with an LLM framework (TensorRT-LLM, vLLM, SGLang, etc) or a DL compiler in inference, deployment, algorithms, or implementation.

  • Prior experience with performance modeling, profiling, debug, and code optimization of a DL/HPC/high-performance application.

  • Experience with scale-out inference orchestration (MPI, NCCL, K8S) on large GPU clusters.

  • Expertise in kernel development (CUTLASS, cuteDSL, tilelang, OpenAI Triton) or compiler/runtime paths (torch.compile, graph lowering, operator fusion). Architectural knowledge of CPU, GPU, FPGA or other DL accelerators; GPU programming experience (CUDA).

  • Track record of leading ambiguous, high-impact technical programs across multiple teams under tight deadlines.

GPU deep learning has provided the foundation for machines to learn, perceive, reason and solve problems posed using human language. The GPU started out as the engine for simulating human imagination, conjuring up the outstanding virtual worlds of video games and Hollywood films. Now, NVIDIA's GPU runs deep learning algorithms, simulating human intelligence, and acts as the brain of computers, robots and self-driving cars that can perceive and understand the world. Just as human imagination and intelligence are linked, computer graphics and artificial intelligence come together in our architecture. Two modes of the human brain, two modes of the GPU. This may explain why NVIDIA GPUs are used broadly for deep learning, and NVIDIA is increasingly known as "the AI computing company." Come, join our DL Architecture team, where you can help build the real-time, cost-effective computing platform driving our success in this exciting and quickly growing field.

#LI-Hybrid

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until March 13, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Nvidia logo

About Nvidia

Sourced by ZipRecruiter

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology--and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent.

Industry

Computer and electronic product manufacturing

Company size

10,000+ Employees

Headquarters location

Santa Clara, CA, US

Year founded

1993