1

Sambanova Jobs (NOW HIRING)

next page

Showing results 1-20

Sambanova information

See salary details

$8

$26

$61

How much do sambanova jobs pay per hour?

As of Jun 29, 2026, the average hourly pay for sambanova in the United States is $26.34, according to ZipRecruiter salary data. Most workers in this role earn between $15.14 and $30.77 per hour, depending on experience, location, and employer.

What is Sambanova and what do they do?

SambaNova Systems is a technology company that specializes in developing advanced artificial intelligence (AI) hardware and software solutions. Their primary focus is on providing high-performance computing platforms, including AI chips and systems, to help organizations accelerate AI workloads such as machine learning and deep learning. SambaNova's products are used in various industries, including research, finance, and healthcare, to power complex AI models and improve efficiency. The company is known for its innovative approach to AI infrastructure, enabling faster and more flexible deployment of AI applications.

Is SambaNova being acquired?

SambaNova is a technology company specializing in AI hardware and software solutions. As of now, there are no publicly announced plans for SambaNova to be acquired. Job seekers should monitor official company communications for any updates on corporate changes.

What opportunities for professional growth are available for engineers working at SambaNova Systems?

Engineers at SambaNova Systems often have the chance to work on cutting-edge AI hardware and software projects alongside industry experts, which accelerates professional development. The company encourages continuous learning through internal knowledge-sharing sessions, access to advanced technical resources, and support for attending industry conferences. Additionally, the fast-paced, collaborative environment allows engineers to take ownership of impactful projects, fostering skill expansion in both technical and leadership areas. Career advancement is supported through mentorship and opportunities to move into specialized or managerial roles as the company grows.

What does SambaNova actually do?

SambaNova is a company that develops advanced hardware and software solutions for artificial intelligence and machine learning workloads. Their products include AI hardware accelerators and software platforms designed to optimize data processing and model training, often used in data centers and enterprise environments.

What are the key skills and qualifications needed to thrive as a SambaNova Systems Engineer, and why are they important?

To excel as a SambaNova Systems Engineer, you need a strong background in computer science, machine learning, and hardware/software integration, typically supported by a relevant degree. Familiarity with AI/ML frameworks, dataflow architectures, and programming languages like Python or C++, as well as experience with SambaNova’s DataScale platform, are essential. Strong problem-solving, collaboration, and communication skills help in navigating complex projects and working effectively with cross-functional teams. These competencies enable engineers to deliver cutting-edge AI solutions and maximize the performance of advanced hardware systems.

What is the difference between Sambanova vs Data Scientist?

AspectSambanovaData Scientist
Required CredentialsTypically requires a background in computer science, machine learning, or AI, often with a master's or PhDRequires a degree in computer science, statistics, or related fields; often with advanced degrees
Work EnvironmentWorks in AI hardware and software development, often in tech companies or research labsWorks in data analysis, modeling, and insights generation across various industries
Employer & Industry UsageUsed by tech companies developing AI hardware/software solutionsEmployed across finance, healthcare, tech, and consulting industries

While Sambanova focuses on AI hardware and software development, Data Scientists analyze data to inform business decisions. Both roles require strong technical skills and advanced education, but they serve different functions within the tech ecosystem.

How many employees are at SambaNova?

As of 2023, SambaNova Systems has approximately 1,000 employees. The company specializes in AI hardware and software solutions, and its workforce includes engineers, researchers, and technical staff working in a collaborative environment.

Who are SambaNova customers?

SambaNova customers typically include large enterprises, research institutions, and government agencies seeking advanced AI and data processing solutions. They often require expertise in machine learning, data infrastructure, and may utilize SambaNova's hardware and software platforms for high-performance computing tasks.
More about Sambanova jobs
What cities are hiring for Sambanova jobs? Cities with the most Sambanova job openings:
What states have the most Sambanova jobs? States with the most job openings for Sambanova jobs include:
Cloud Site Reliability Engineer

Cloud Site Reliability Engineer

SambaNova Systems

San Jose, CA

$66.75 - $88.75/hr

Other

Posted 17 hours ago


Key responsibilities

  • Take shared ownership of the production inferencing service, including availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning across multiple regions.

  • Participate in a balanced on-call rotation to provide 24/7 support for the AI inferencing service.

  • Develop and maintain advanced monitoring, alerting, and dashboarding to ensure service health and actionable alerts with a low false-positive rate.


Job description

About SambaNova Systems

Join the company that's building the future of AI computing. At SambaNova, we are disrupting the AI and high-performance computing space with our integrated hardware and software platform. Our DataScale systems and SambaFlow software are pushing the boundaries of what's possible with generative AI and large language models. We are a team of passionate innovators tackling some of the world's most challenging computational problems.

The Role

As a Cloud Site Reliability Engineer (SRE) specializing in our AI Inferencing Service, you will be the guardian of its reliability, performance, and scalability. You will bridge the gap between software development and operations, applying an engineering mindset to solve operational challenges. Your primary focus will be ensuring our inference endpoints have exceptional uptime, low-latency response times, and efficient resource utilization, directly impacting the experience of our customers and the success of our AI products. This role includes participating in a shared on-call rotation to maintain 24/7 service reliability. 

What You'll Do

Service Ownership & On-Call: Take shared ownership of the production inferencing service, including its availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning across multiple regions. This includes implementing and supporting AI infrastructure in new regions, such as Asia, Europe, and Latin America, to support the growth of our business.  Participate in a balanced on-call rotation to provide 24/7 support for the service.

On-Call & Work-Life Balance

We believe a sustainable on-call schedule is critical for long-term success and team health. Our on-call philosophy is built on the following principles:

  • Balanced Rotation: The on-call rotation is shared equally across the team, typically following a primary/secondary (follow-the-sun) model to ensure no single person bears a disproportionate burden.
  • Focus on Prevention: We invest heavily in automation, robust testing, and system design to prevent pages before they happen. The goal of on-call is not to heroically fight fires, but to manage rare, complex failures and use those learnings to make the system more resilient.
  • Actionable Alerts: We have a strict policy against alert fatigue. Alerts must be actionable and require immediate human intervention.
  • Incident Management: Lead the response to incidents affecting the inferencing service, driving blameless post-mortems and implementing corrective actions to prevent recurrence.
  • Monitoring & Alerting: Develop and maintain advanced monitoring, alerting, and dashboarding (using tools like Prometheus, Grafana, Datadog) to gain deep insights into service health, model performance (e.g., latency, throughput, error rates), and accelerator utilization. A key responsibility is ensuring alerts are actionable and have a low false-positive rate, minimizing on-call fatigue.
  • Performance & Scalability: Proactively identify and eliminate performance bottlenecks. Design and implement auto-scaling policies to handle variable inference loads cost-effectively. Use insights from on-call incidents to drive improvements that enhance system stability and scalability.
  • Infrastructure as Code (IaC): Manage and evolve our cloud infrastructure (on AWS, GCP, and/or Azure along with on-prem) using tools like Terraform and Ansible, ensuring it is secure, repeatable, and scalable.
  • CI/CD & Automation: Champion automation by building and improving CI/CD pipelines for the seamless and safe deployment of new model versions and service updates. A core goal is to automate manual toil identified during on-call shifts, reducing future operational overhead.
  • Capacity Planning: Forecast infrastructure needs based on product roadmaps and usage trends. Work with finance and engineering teams to manage cloud costs and optimize spending.
  • SLOs & SLIs: Define, measure, and report on Service Level Objectives (SLOs) and Indicators (SLIs) for the inferencing platform, using data to drive prioritization and reliability investments.


What We're Looking For (Must-Haves)
  • Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience.
  • 3-5+ years of experience in a Site Reliability Engineer, DevOps, or related role supporting a large-scale, customer-facing service in a public cloud environment (AWS, GCP, Azure).
  • Strong programming/scripting skills in languages like Python, Go, or Java.
  • Proven experience with containerization and orchestration technologies (Docker, Kubernetes).
  • Deep understanding of monitoring and observability principles and tools (e.g., Prometheus, Grafana, ELK Stack, Datadog).
  • Solid experience with Infrastructure as Code (e.g., Terraform, CloudFormation).
  • Familiarity with CI/CD principles and tools (e.g., Jenkins, GitHub Actions, ArgoCD).
  • Excellent problem-solving skills and a systematic approach to troubleshooting complex distributed systems.
What Will Make You Stand Out (Nice-to-Haves)
  • Experience in a hybrid environment bridging cloud and on-premise/data center infrastructure.
  • Direct experience supporting ML/AI inferencing services in production.
  • Familiarity with GPU-accelerated computing and optimizing workloads for NVIDIA GPUs for purposes of mapping to RDUs.
  • Knowledge of model serving frameworks like vLLM, SGLang or Ray.
  • Understanding of MLOps principles and practices.
  • Experience with managing and tuning databases (SQL or NoSQL) and caching systems (Redis, Memcached).
  • Strong Linux/Unix system administration fundamentals.
Why SambaNova?
  • Massive Impact: You will be a key part of a critical platform with high visibility and direct impact on our product and engineers.
  • Cutting-Edge Technology: Work with a world-class team on one of the most advanced AI stacks in the industry.
  • Autonomy and Growth: We trust you to make technical decisions. This is a greenfield opportunity to build something remarkable from the ground up.
  • Competitive Compensation: Including equity, excellent benefits, and a flexible work environment.