Job Summary:
NVIDIA has been transforming computer graphics and accelerated computing for more than 25 years, and they are seeking a Senior Deep Learning Engineer to develop systems and algorithms for autonomous vehicles. This role involves collaborating with researchers and software engineers to integrate AI models from training to production, focusing on large language models and vision-language models.
Responsibilities:
• Explore SOTA LLM/VLM models for search and classification of AV scenarios
• Hands on model developments such as fine-tuning large LLM/VLMs for internal use cases
• Collaborate with software engineers and researchers to ensure seamless integration of models from training to deployment.
Qualifications:
Required:
• Master’s or PhD in Computer Science, Electrical Engineering, Computer Engineering, or a related field (or equivalent experience)
• 10+ years of professional experience in deep learning or applied machine learning.
• Strong foundation in deep learning algorithms, including hands-on experience with LLMs and VLMs
• Deep understanding of general transformer architectures, inference bottlenecks, and popular model architectures such Qwen family.
• Proficient in building and deploying models using PyTorch in production-grade environments.
• Solid programming skills in Python
Preferred:
• Proven experience deploying LLMs or VLMs at scale in real-world applications using vLLM, SGLang.
• Hands-on experience with SFT, DPO, GRPO techniques for fine-tuning
• Proven experience in developing image and video search solutions at scale.
Company:
NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI. Founded in 1993, the company is headquartered in Santa Clara, USA, with a team of 10001+ employees. The company is currently Late Stage.