1

Software Operations Jobs in California (NOW HIRING)

Conduct basic software operations to support real-time data collection. * Accurately document observations, issues, and anomalies encountered during test runs. * Collaborate with engineers, providing ...

Conduct basic software operations to support real-time data collection. * Accurately document observations, issues, and anomalies encountered during test runs. * Collaborate with engineers, providing ...

Sr. Software Engineer

San Jose, CA ยท On-site

$144K - $189K/yr

Knowledge of Beckhoff TwinCAT/EtherCat software operation is a plus * Knowledge of factory automation and SEMI standards (SECS/GEM) is a plus * Knowledge of Windows GUI development is a plus * Strong ...

Conduct basic software operations to support real-time data collection. * Accurately document observations, issues, and anomalies encountered during test runs. * Collaborate with engineers, providing ...

Sr. Software Engineer

San Jose, CA

$144K - $189K/yr

Knowledge of Beckhoff TwinCAT/EtherCat software operation is a plus * Knowledge of factory automation and SEMI standards (SECS/GEM) is a plus * Knowledge of Windows GUI development is a plus * Strong ...

next page

Showing results 1-20

Software Operations information

Is operations a high paying job?

Software operations roles can offer competitive salaries, especially with experience and specialized skills such as automation, cloud platforms, or scripting. Compensation varies by industry, location, and company size, but these positions often include benefits and opportunities for advancement.

What is the difference between Software Operations vs Software Development?

AspectSoftware OperationsSoftware Development
Primary FocusMaintaining, deploying, and supporting software systemsDesigning, coding, and creating software applications
Required SkillsSystem administration, scripting, troubleshootingProgramming, software design, problem-solving
Work EnvironmentIT departments, production environmentsDevelopment teams, coding labs
CertificationsITIL, Linux, Cloud certificationsJava, Python, Agile certifications

Software Operations focuses on maintaining and supporting existing software systems, ensuring stability and performance. In contrast, Software Development involves creating new software applications through coding and design. While both roles require technical skills, their daily tasks and objectives differ significantly, making them distinct career paths within the tech industry.

What are the key skills and qualifications needed to thrive as a Software Operations professional, and why are they important?

To excel in Software Operations, you need a strong background in systems administration, software deployment, and IT infrastructure management, often supported by a degree in computer science or related fields. Familiarity with tools such as CI/CD pipelines, cloud platforms (like AWS or Azure), and monitoring systems, as well as certifications in areas like DevOps or cloud technologies, is highly beneficial. Excellent problem-solving, communication, and collaboration skills help you manage incidents efficiently and coordinate with development and support teams. These abilities are crucial for maintaining software reliability, minimizing downtime, and ensuring seamless operations within technology-driven organizations.

What jobs pay 200,000 a year in the USA?

In the field of Software Operations, roles such as senior software engineers, solutions architects, and engineering managers can earn $200,000 or more annually, especially with extensive experience, advanced skills in cloud platforms, and leadership responsibilities. High-paying positions often require specialized knowledge, certifications, and a track record of managing complex projects or teams.

What is Software Operations?

Software Operations refers to the management, monitoring, and maintenance of software applications and systems throughout their lifecycle. This role ensures that software runs smoothly, efficiently, and securely in production environments. Responsibilities often include deploying updates, troubleshooting issues, optimizing performance, and collaborating with development and IT teams. The goal is to maximize software reliability and minimize downtime for end users. Software Operations professionals play a critical part in supporting business continuity and user satisfaction.

What are the typical daily responsibilities of someone working in Software Operations?

In a Software Operations role, your daily tasks often involve monitoring software systems for performance and reliability, managing deployments and updates, and responding to incidents or outages. You may collaborate closely with development, QA, and IT teams to ensure smooth releases and quick issue resolution. Additionally, documenting procedures, optimizing workflows, and automating recurring tasks are common aspects, helping to maintain efficient and stable software environments.

What are software operations?

Software operations involve managing and maintaining software systems to ensure their reliable performance, availability, and security. This includes tasks such as deployment, monitoring, troubleshooting, and updates, often using tools like automation scripts and monitoring platforms. Professionals in this field focus on optimizing software workflows and minimizing downtime.

What jobs in the US pay 300,000 a year?

In software operations, senior roles such as Software Engineering Managers, Director of Software Development, and Principal Software Engineers can earn $300,000 or more annually, especially with extensive experience, advanced skills, and leadership responsibilities. These positions often require strong technical expertise, project management skills, and sometimes certifications or advanced degrees.
What are the most commonly searched types of Software Operations jobs in California? The most popular types of Software Operations jobs in California are:
Infographic showing various Software Operations job openings in California as of June 2026, with employment types broken down into 87% Full Time, 9% Part Time, 2% Temporary, and 2% Contract. Highlights an 96% Physical, 1% Hybrid, and 3% Remote job distribution.
Senior Technical Program Manager, DGX Cloud Software Products and Services

Senior Technical Program Manager, DGX Cloud Software Products and Services

Nvidia Corporation

Santa Clara, CA โ€ข On-site

Full-time

Posted 23 days ago


Job description

NVIDIA's DGX Cloud (DGXC) powers AI for strategic research and product workloads. The company seeks an expert Technical Program Manager (IC5) to lead strategic programs emphasizing resilience, reliability, and goodput. This role requires collaboration across multiple teams. It involves driving improvements in resilience, service stability, and operational scale. The TPM also guides architectural decisions related to resilience reference architecture. The TPM leads programs spanning DGXC infrastructure, Resilience Tools, and core platform services to deliver fault-tolerant, high-availability training and inference environments at scale.
We are looking for a TPM who is analytical, technically skilled, and comfortable working with cloud infrastructure, software, operations, and environments driven by data and research. You will work closely with engineering, SRE, operations, and researchers to develop scalable resilience strategies, improve operational performance, and assist in building open, modular software components and reference stacks for DGX Cloud at scale.
What You'll Be Doing:
  • Lead cross-functional programs that improve resilience, reliability, operational scale, and fleet-wide goodput across DGX Cloud.
  • Partner across infrastructure, platform, site reliability, operational, and tenant teams to identify systemic risks, resolve cross-stack dependencies, and improve end-to-end service stability.
  • Drive the definition and adoption of resilience reference stacks, operational standards, and scalable guidelines that strengthen service readiness and recovery.
  • Partner with engineering teams and researchers to support the development and delivery of open, modular software components for resilience, facilitating reusable and extensible capabilities across the platform.
  • Build and scale resilience tooling and operational mechanisms that improve observability, failure detection and attribution, root cause analysis, recovery orchestration, and operational readiness.
  • Define, measure, and improve goodput, using data-driven insights to increase usable fleet capacity, workload efficiency, and customer outcomes at scale.
  • Establish clear metrics, dashboards, and operating cadences to track program health, reliability posture, operational maturity, and performance.

What we need to see:
  • MS EE or CS degree, or equivalent experience.
  • 8+ years of experience in program management of large-scale software or infrastructure projects.
  • Proven track record of leading complex cross-functional programs in cloud, infrastructure, distributed systems, or platform environments.
  • Strong analytical skills with the ability to assess issues across infrastructure, software, and operational layers.
  • Excellent organizational skills and ability to use project management tools (e.g. Jira, Aha!, Confluence) and distributed version control systems (e.g. Git).
  • Solid understanding of reliability engineering, resilience development, and service performance metrics, including goodput, efficiency, and utilization.
  • Experience working alongside engineering, SRE, operations, and technical collaborators to advance projects in ambiguous, high-complexity environments.
  • Outstanding communication and presentation skills for diverse technical and non-technical audiences with strong problem-solving and conflict management skills.

Ways To Stand Out From The Crowd:
  • Background in computer science, machine learning, deep learning, open-source software, and GPU technology, AI infrastructure, or large-scale compute platforms.
  • Experience with large-scale AI training environments (e.g., distributed training frameworks, checkpointing, NCCL, Slurm or other schedulers).
  • Prior experience in the management of customer workflows using large scale distributed computing and working with AI researchers or directly training and evaluating AI models.
  • Proven ability to harness AI-enabled workflows and tools to improve program management efficiency, decision-making, execution visibility, and operational efficiency.

Widely considered to be one of the technology world's most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family www.nvidiabenefits.com/
#LI-Hybrid
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 168,000 USD - 258,750 USD for Level 4, and 200,000 USD - 322,000 USD for Level 5.
You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until May 8, 2026.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Nvidia logo

About Nvidia

Sourced by ZipRecruiter

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology--and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent.

Industry

Computer and electronic product manufacturing

Company size

10,000+ Employees

Headquarters location

Santa Clara, CA, US

Year founded

1993