1

Director Site Reliability Engineering Jobs (NOW HIRING)

The Director, Site Reliability Engineering (SRE) will lead reliability, performance, and observability initiatives for a portfolio of Vertafore products. This role owns SLIs/SLOs, incident response ...

The Director, Site Reliability Engineering (SRE) will lead reliability, performance, and observability initiatives for a portfolio of Vertafore products. This role owns SLIs/SLOs, incident response ...

Director, Site Reliability Engineering

Bethpage, NY ยท On-site

$58.25 - $77.50/hr

The Director, Site Reliability Engineering, will lead a team which is responsible for operating, architecting and building key infrastructure that runs the high availability services that our ...

Director, Site Reliability Engineering

Bethpage, NY ยท On-site

$58.25 - $77.50/hr

The Director, Site Reliability Engineering, will lead a team which is responsible for operating, architecting and building key infrastructure that runs the high availability services that our ...

The Director, Site Reliability Engineering (SRE) will lead reliability, performance, and observability initiatives for a portfolio of Vertafore products. This role owns SLIs/SLOs, incident response ...

Director, Site Reliability Engineering

Denver, CO ยท On-site

$58.75 - $78/hr

The Director, Site Reliability Engineering (SRE) will lead reliability, performance, and observability initiatives for a portfolio of Vertafore products. This role owns SLIs/SLOs, incident response ...

Director, Site Reliability Engineering

Denver, CO ยท On-site

$58.75 - $78/hr

The Director, Site Reliability Engineering (SRE) will lead reliability, performance, and observability initiatives for a portfolio of Vertafore products. This role owns SLIs/SLOs, incident response ...

Director, Site Reliability Engineering

Phoenix, AZ ยท On-site

$56.50 - $75.25/hr

The Director, Site Reliability Engineering plays a critical role in ensuring the continuous and reliable operation of SmartRent services at the property level. This role drives system reliability ...

Site Reliability Engineering

Los Angeles, CA ยท On-site

$61.50 - $81.50/hr

Site Reliability Engineering (SRE) Location: Los Angeles, CA Remote position Fulltime position JD * Site Reliability Engineer * Experience in Cloud platforms (AWS, Azure, Google Cloud) and hybrid ...

next page

Showing results 1-20

Director Site Reliability Engineering information

See salary details

$10

$63

$91

How much do director site reliability engineering jobs pay per hour?

As of Jun 22, 2026, the average hourly pay for director site reliability engineering in the United States is $63.74, according to ZipRecruiter salary data. Most workers in this role earn between $54.81 and $72.84 per hour, depending on experience, location, and employer.

What is a Director Site Reliability Engineering job?

A Director of Site Reliability Engineering (SRE) leads teams responsible for ensuring the availability, performance, and scalability of software systems. They define reliability best practices, drive automation, and collaborate with engineering and product teams to improve system resilience. This role requires strong leadership, technical expertise, and a focus on balancing innovation with operational stability.

What are the main challenges faced by a Director of Site Reliability Engineering, and how can I prepare for them?

A Director of Site Reliability Engineering often encounters challenges such as balancing rapid feature delivery with system stability, managing complex incident responses, and fostering a culture of continuous improvement. Additionally, aligning reliability goals with business objectives and securing cross-functional buy-in can be demanding. To prepare, it is helpful to gain experience in high-scale system management, develop strong leadership and communication abilities, and cultivate a proactive approach to risk management and automation. Staying up to date with the latest SRE practices and building relationships with both engineering and business teams will also support your success in this pivotal role.

What are the key skills and qualifications needed to thrive in the Director Site Reliability Engineering position, and why are they important?

To thrive as a Director Site Reliability Engineering, you need extensive experience in software engineering, infrastructure management, incident response, and people leadership, often supported by a degree in computer science or a related field. Familiarity with cloud platforms (such as AWS, GCP, or Azure), automation tools (Terraform, Ansible), monitoring systems (Prometheus, Datadog), and relevant certifications like CKA or AWS Solutions Architect is valued. Outstanding communication, stakeholder management, and strategic vision are key soft skills that set leaders apart in this role. These abilities ensure the reliability, scalability, and efficiency of critical systems while effectively guiding and motivating technical teams.

More about Director Site Reliability Engineering jobs
What cities are hiring for Director Site Reliability Engineering jobs? Cities with the most Director Site Reliability Engineering job openings:
What states have the most Director Site Reliability Engineering jobs? States with the most job openings for Director Site Reliability Engineering jobs include:
What job categories do people searching Director Site Reliability Engineering jobs look for? The top searched job categories for Director Site Reliability Engineering jobs are:
Infographic showing various Director Site Reliability Engineering job openings in the United States as of June 2026, with employment types broken down into 2% As Needed, 86% Full Time, 8% Part Time, 2% Temporary, and 2% Contract. Highlights an 95% Physical, 2% Hybrid, and 3% Remote job distribution, with an average salary of $132,583 per year, or $63.7 per hour.
Director, Site Reliability Engineering

Director, Site Reliability Engineering

eGain Corporation

Sunnyvale, CA โ€ข On-site

$250K/yr

Full-time

Posted 13 days ago


Job description

SHARE
eGain is the leader in AI knowledge management solutions for enterprises. As organizations recognize the critical value of trusted knowledge and content feeding AI systems, eGain provides the single source of truth-explainable, reliable, and maintainable-that serves as the repository for all enterprise know-how including SOPs, policy documents, troubleshooting guides, and product information. This foundation enables scalable and effective AI automation of business operations, with customer service as the primary point of ROI. Our solutions power leading companies including JP Morgan, Liberty Mutual, Florida Blue, and Bosch.
The Opportunity
Join us in reimagining knowledge management as mission-critical infrastructure for the AI-powered enterprise. We're seeking talented, hungry, and bold leaders to shape the future of how enterprises leverage AI and knowledge at scale.
Position Overview
As Director of Site Reliability Engineering, you will ensure that eGain's AI knowledge management platform operates with the reliability, performance, and resilience that enterprise customers demand. You'll lead the strategy and execution for observability, incident management, capacity planning, and continuous improvement of our production systems. This role is critical as our platform becomes mission-critical infrastructure for the world's leading enterprises.
Key Responsibilities
  • Build and lead a world-class SRE organization that ensures exceptional reliability and performance of eGain's cloud services
  • Define and achieve ambitious SLOs/SLAs that meet the demands of enterprise customers operating 24/7 customer service operations
  • Establish comprehensive observability across the platform including monitoring, logging, tracing, and alerting
  • Drive incident response processes, post-mortems, and continuous improvement to prevent recurring issues
  • Lead capacity planning and performance optimization to ensure the platform scales efficiently with customer growth
  • Implement automation for deployment, operations, and remediation to reduce toil and improve reliability
  • Partner with platform and application engineering teams to build reliability into the system from the ground up
  • Champion a culture of reliability engineering across the organization, educating teams on best practices
  • Manage disaster recovery planning and business continuity to protect customer operations
  • Own the technical relationship with customers on reliability and performance topics
What We're Looking For
  • 10+ years of experience in software engineering, operations, or SRE roles with 5+ years in SRE leadership
  • Deep expertise in observability tools, monitoring systems, and incident management practices
  • Strong background in distributed systems, cloud infrastructure, and production operations at scale
  • Experience establishing and achieving SLOs/SLAs for enterprise SaaS or mission-critical systems
  • Proficiency with automation, infrastructure-as-code, and modern DevOps/SRE tooling
  • Track record of improving system reliability through data-driven approaches and systematic problem-solving
  • Excellent incident management and crisis leadership skills
  • Strong collaboration abilities and experience partnering with engineering teams to improve reliability
  • Passion for operational excellence and continuous improvement
  • Bold thinking about what's possible in system reliability combined with pragmatic execution
Why eGain
  • Ensure the reliability of systems that power customer service for the world's leading enterprises
  • Build SRE practices from the ground up with significant impact and visibility
  • Work with modern cloud technologies and solve complex reliability challenges
  • Lead a team focused on operational excellence and engineering rigor
Our Hiring Process is "Easy with eGain"
Step 1
Written test
  • Aptitude section - this is a GRE style test (60 minutes or less)
  • Functional section - this is a take-home test

Step 2
Panel interview (in-person at eGain Sunnyvale office)
Next step
Email your resumรฉ to [email protected] with the position title "Director, Site Reliability Engineering" in the email subject.
Compensation
  • Base salary is $250,000 per year.
  • Stock options.

Please note that the compensation package can vary based on the candidate's qualifications and experience level.