Job Summary:
NVIDIA is a world-class leader in AI workload optimization, and they are seeking a senior software engineer to automate and optimize performance analysis workflows for AI training and inference workloads. The role involves designing and building performance analysis tools, collaborating with engineers, and identifying performance bottlenecks in AI workloads.
Responsibilities:
โข Design and build performance analysis tools and workflows for AI training and inference workloads.
โข Understand how AI performance engineers work and translate their needs into scalable, intuitive tooling.
โข Develop integrations between profiling infrastructure and AI frameworks and workflows.
โข Collaborate with performance engineers, hardware architects, and software teams to ensure profiling capabilities align with real-world AI workloads.
โข Identify performance bottlenecks in AI workloads and develop automated approaches to detect and diagnose them.
Qualifications:
Required:
โข M.S., or PhD in Computer Science, Computer Engineering, or a related field (or equivalent experience). 6+ years of relevant work experience
โข Deep knowledge of AI workloads, frameworks, and performance characteristics.
โข Experience building tools, workflows, or infrastructure used by other engineers.
โข Strong software development skills (Python, C++ preferred).
โข Ability to translate user requirements into scalable tooling solutions.
โข Up to date with AI-enabled tooling for software development and performance analysis.
โข Strong interpersonal skills for understanding engineer difficulties and working across multi-functional teams.
Preferred:
โข Experience profiling or optimizing AI training or inference pipelines at scale
โข Background building developer tools or platforms for ML engineers
โข Contributions to open-source AI tooling or frameworks
Company:
NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI. Founded in 1993, the company is headquartered in Santa Clara, USA, with a team of 10001+ employees. The company is currently Late Stage.