1

Contract Computer Vision Deep Learning Engineer Jobs in Texas

... Deep Learning Engineer, NLP Engineer, Computer Vision Engineer, AI Research Scientist, Robotics ... Most contracts allow additional experience (4-5 years) in lieu of a Bachelor's Degree. Some ...

Senior Deep Learning Compiler Engineer

Austin, TX · On-site

$103.60K - $142.20K/yr

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than ... Doing what's never been done before takes vision, innovation, and the world's best talent. As an ...

... Deep Learning Engineer, NLP Engineer, Computer Vision Engineer, AI Research Scientist, Robotics ... Most contracts allow additional experience (4-5 years) in lieu of a Bachelor's Degree. Some ...

We're looking for a Deep Learning Researcher to help us push the state-of-the-art in image and ... Significant prior experience with CNNs and other common computer vision network architectures * 2 ...

Senior ML Engineer

Addison, TX

$101.20K - $138.90K/yr

... deep learning, and reinforcement learning. Experience with cloud platforms such as Google Cloud ... Experience with natural language processing (NLP) or computer vision (CV) techniques. Experience ...

Senior ML Engineer

Addison, TX · On-site

$101.20K - $138.90K/yr

... unsupervised learning, deep learning, and reinforcement learning. • Experience with cloud ... computer vision (CV) techniques. • Experience with continuous integration and continuous ...

next page

Showing results 1-20

Contract Computer Vision Deep Learning Engineer information

What are the most commonly searched types of Computer Vision Deep Learning Engineer jobs in Texas? The most popular types of Computer Vision Deep Learning Engineer jobs in Texas are:
What are popular job titles related to Contract Computer Vision Deep Learning Engineer jobs in Texas? For Contract Computer Vision Deep Learning Engineer jobs in Texas, the most frequently searched job titles are:
What job categories do people searching Contract Computer Vision Deep Learning Engineer jobs in Texas look for? The top searched job categories for Contract Computer Vision Deep Learning Engineer jobs in Texas are:
What cities in Texas are hiring for Contract Computer Vision Deep Learning Engineer jobs? Cities in Texas with the most Contract Computer Vision Deep Learning Engineer job openings:
Senior Deep Learning Frameworks CUDA Software Engineer

Senior Deep Learning Frameworks CUDA Software Engineer

Nvidia

Austin, TX

$121.40K - $160.10K/yr

Full-time

Posted 15 days ago


Job description

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars.

We are looking for a motivated Deep Learning engineer to bring advanced CUDA features and Distributed Runtime technologies into AI stacks, including PyTorch, TRT-LLM, vLLM, SGLang, JAX, etc. You will be working with the team that created core CUDA features and runtimes for scaling Deep Learning and HPC applications. Your customers will have diverse multi-GPU demands, ranging from training on scales up to 100K GPUs to inference down at microsecond latency. CUDA features improve both productivity and performance of AI applications. Your work in AI toolkits will accelerate enabling those for the community. This is an outstanding opportunity for someone with an AI background to advance the state of the art in this space. Are you ready to contribute to the development of innovative technologies and help realize NVIDIA's vision?

What you will be doing:

  • Integrate new CUDA features and Runtime abstractions in AI frameworks: from PoC to performance analysis to production

  • Perform deep analysis of AI workloads and frameworks to identify requirements and opportunities to innovate in the lower layers of the stack. Collaborate hands-on with teams working on the latest AI models.

  • Own and drive improvements in the AI Compiler-Runtime interface to build speed-of-light multi-GPU multi-node solutions.

  • Design fault-tolerant and elastic solutions for large-scale or dynamic AI workloads.

  • Influence the roadmap of core CUDA to facilitate building next-gen DL frameworks.

  • Collaborate with a very dynamic team across multiple time zones.

  • Collaborate closely with AI researchers, HW and SW architects, kernel and compiler authors and CUDA driver experts to co-design systems and frameworks that enhance performance and programmability.

  • Develop exploratory tools and runtime systems to profile and accelerate new paradigms in deep learning.

  • Write clean, effective, and maintainable code, ensuring exploratory prototypes can smoothly transition into open-source releases, upstream framework integrations, internal tools, or closed-source commercial products.

What we need to see:

  • BS, MS, or PhD degree in Computer Science, Computer Engineering, Electrical Engineering, or related field (or equivalent experience).

  • 8+ years of relevant industry experience or equivalent academic experience after completed degree.

  • Development experience with Deep Learning Frameworks such PyTorch, JAX, and Inference Engines such as TRT-LLM, vLLM, SGLang

  • Rapid prototyping and development with Python, C++, CUDA or related DSLs

  • Solid grasp of AI models, parallelisms, and/or compiler technologies (e.g. torch.compile)

  • Experience conducting performance benchmarking on AI clusters. Familiarity with at least one performance profiler toolchain (PyTorch profiler, NVIDIA Nsight Systems)

  • Understanding of HPC/AI communication concepts

  • Good understanding of computer system architecture, HW-SW interactions and operating systems principles (aka systems software fundamentals)

  • Adaptability and passion to learn new frameworks and tools

  • Flexibility to work and communicate effectively across different teams and timezones

Ways to stand out from the crowd:

  • Deep expertise in the performance internals and execution graphs of major deep learning autograd, training and inference frameworks (e.g., PyTorch, JAX, TensorRT, vLLM, sgLang, Nemo, Megatron, MaxText, etc.).

  • Hands-on experience with CUDA, specific communication libraries (e.g., NCCL, MPI, UCX) and distributed machine learning techniques (e.g., pipeline parallelism, tensor parallelism).

  • Expertise in one or more of these areas: Training, Distributed inference, MoE, Reinforcement Learning, kernel authoring (on CUDA, Triton, cuTe, etc).

  • Background in deep learning compilers, both graph-level and codegen (e.g., Triton, XLA, torch compile)

  • Experience with programming for compute & communication overlap in distributed runtime

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until May 18, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Nvidia logo

About Nvidia

Sourced by ZipRecruiter

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology--and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent.

Industry

Computer and electronic product manufacturing

Company size

10,000+ Employees

Headquarters location

Santa Clara, CA, US

Year founded

1993