Job Summary:
NVIDIA AI is building the software foundation for scalable, high-performance vehicle computing platforms that power autonomous driving and centralized vehicle architectures. They are seeking a Senior Software Engineer to lead architecture and optimization efforts across the autonomous driving software stack, focusing on deep neural network optimization and deployment on NVIDIA automotive compute platforms.
Responsibilities:
• Lead architecture and technical strategy for optimizing inference workloads in autonomous driving applications.
• Drive end-to-end performance analysis across DNN models, TensorRT/compiler flows, CUDA kernels, memory behavior, scheduling, runtime services, and automotive platform constraints.
• Develop and guide model optimization techniques such as quantization, pruning, distillation, graph optimization, operator fusion, kernel selection, and layout/memory optimization.
• Collaborate with TensorRT, CUDA, compiler, silicon architecture, perception, planning, DriveOS and safety platform teams.
• Build tools, methodologies, and metrics for profiling, benchmarking, debugging, and validating model and platform performance.
Qualifications:
Required:
• BS, MS, or PhD in Computer Science, Computer Engineering, Electrical Engineering, or related field (or equivalent experience).
• 12+ years of software engineering experience in systems software, AI/ML infrastructure, deep learning inference, compiler/runtime technology, or platform performance.
• Strong C/C++ and practical Python experience.
• Deep familiarity with TensorRT, TensorRT-LLM, ONNX, PyTorch, CUDA, Triton, or related frameworks.
• Experience optimizing DNN models for latency, throughput, memory footprint, and power.
Preferred:
• Hands-on experience with TensorRT internals, CUDA kernels, Triton kernels, or other compiler/runtime technologies.
• Experience deploying optimized DNNs, LLMs, VLMs, or perception models on embedded, edge, robotics, or automotive platforms.
• Background in autonomous driving, ADAS, robotics, real-time systems, safety-aware software, or deterministic low-latency systems.
• Experience with ISO 26262, QNX, Safe RTOS, DriveOS, Linux, hypervisors, or virtualization.
Company:
Explore the latest breakthroughs made possible with AI. Founded in , the company is headquartered in Santa Clara, CA, US, , with a team of 10001+ employees. The company is currently Late Stage.