SambaNova

11 jobs near Columbus, OH

ML Features Solutions Engineer

Austin, TX · On-site

$81.80K - $109K/yr

SambaNova is a leading company in the generative AI space, providing a full-stack AI platform optimized for enterprises. They are seeking an ML Features Solutions Engineer to drive the development ...

ML Features Solutions Engineer

Palo Alto, CA · On-site

$97.10K - $129.50K/yr

SambaNova is a leading company in the AI sector, specializing in generative AI solutions for enterprise and government organizations. They are seeking an ML Features Solutions Engineer to drive the ...

Sr Product Manager - AI Cloud

Palo Alto, CA · On-site

$148.90K - $196.50K/yr

SambaNova is a company focused on generative AI and its applications in enterprise and government organizations. They are seeking a Principal Product Manager to define the AI Cloud strategy, drive ...

SambaNova is at the forefront of AI computing, building a transformative generative AI platform for enterprise and government organizations. The Senior Cloud Platform Engineer will focus on the ...

Principal Cloud Backend Engineer

$128.50K - $177.10K/yr

SambaNova is at the forefront of the AI revolution, providing a generative AI platform optimized for enterprise and government organizations. They are seeking a Principal Cloud Backend Engineer to ...

Senior AI Systems Performance Engineer

SambaNova

Palo Alto, CA • On-site

Full-time

Posted 14 days ago


Job description

Job Summary:
SambaNova is a leading company in the generative AI space, providing a full-stack platform optimized for enterprise and government organizations. The role involves optimizing and scaling advanced foundation models on SambaNova's dataflow platform, collaborating with various teams to enhance performance and deliver high-performance AI applications.
Responsibilities:
• Bring up and optimize cutting-edge foundation models (e.g., DeepSeek, Llama, Qwen, and others) on the SambaNova platform through the SambaNova software stack.
• Profile and enhance model performance across compiler, runtime, and hardware layers to achieve SOTA throughput and latency.
• Collaborate with machine learning, compiler, runtime, and hardware teams to deliver co-designed, high-performance AI applications.
• Integrate the latest advances in model architecture, quantization, scheduling, and memory optimization from both academia and industry.
• Develop robust, scalable, and efficient end-to-end inference solutions aligned with customer needs.
• Identify performance bottlenecks and propose dataflow or scheduling optimizations for both single-node and distributed systems.
Qualifications:
Required:
• Bachelor's or higher degree in computer science, electrical engineering, or a related field (e.g., applied mathematics, physics, or statistics).
• 3+ years of experience in one or more of the following areas: Deep learning model development and performance optimization, Compiler, runtime, or kernel-level optimization, Software–hardware co-design or systems performance tuning.
• Proficiency in Python or C++, with strong foundations in algorithms, data structures, and numerical computing.
• Experience with at least one major ML framework — PyTorch, TensorFlow, or JAX.
• Demonstrated ability to analyze and optimize performance in real-world ML pipelines.
Preferred:
• Hands-on experience with LLM or multimodal model training and inference.
• Background in large-scale distributed training, continuous batching, and high-throughput inference systems.
• Familiarity with quantization, graph optimization, kernel fusion, and model partitioning.
• Experience with frameworks such as DeepSpeed, Megatron, vLLM, or TensorRT.
• Strong GPU programming skills (CUDA, Triton, or OpenCL); experience with cuDNN, cuBLAS, or similar libraries is a plus.
• Knowledge of memory hierarchy optimization, caching, and scheduling for large-scale model execution.
• Publication record or open-source contributions in ML systems or performance optimization is a plus.
Company:
SambaNova is an AI hardware and software company that specializes in providing infrastructure for AI and machine learning applications. Founded in 2017, the company is headquartered in Palo Alto, USA, with a team of 201-500 employees. The company is currently Growth Stage.