1

Site Reliability Engineers Jobs (NOW HIRING)

Site Reliability Engineer (SRE)

Englewood, CO · On-site

$56.25 - $74.75/hr

The SRE team is responsible for maintaining the existing systems, supporting our development teams, and implementing innovative solutions. You will work alongside software developers, testers, and ...

Site Reliability Engineering

Charlotte, NC · On-site

$55.75 - $74/hr

Site Reliability Engineer Location: Charlotte, NC (Onsite) Experience: 10+ Years Job Summary We are seeking an experienced Site Reliability Engineer (SRE) with a strong background in Java application ...

Cloud Site Reliability Engineer

Santa Clara, CA · On-site

$67 - $89/hr

Our cloud operations team is where the rubber meets the road and needs innovative Site Reliability Engineers. Join a professional team of smart and hard-working professionals building enterprise ...

SRE

Austin, TX · On-site

$56.50 - $75/hr

ABOUT THIS FEATURED OPPORTUNITY This organization is seeking a Site Reliability Engineer (SRE) to support Clarity, a business planning tool, and Halo, an internal IT inventory platform used to track ...

Site Reliability Engineer (SRE)

Downingtown, PA · On-site

$59 - $78.50/hr

The SRE team is responsible for maintaining the existing systems, supporting our development teams, and implementing innovative solutions. You will work alongside software developers, testers, and ...

SRE

Charlotte, NC · On-site

$55.75 - $74/hr

Role: SRE Location: Charlotte, NC Skills: Grafana, Python, Splunk, Linux, Scripting. Microsoft 360 ... engineers to provide exceptional service. (1.) Key Responsibilities 1. Lead and manage a team of ...

Site Reliability Engineer (SRE)

Downingtown, PA · On-site

$59 - $78.50/hr

The SRE team is responsible for maintaining the existing systems, supporting our development teams, and implementing innovative solutions. You will work alongside software developers, testers, and ...

SRE Manager / SRE Architect

New York, NY

$62.25 - $82.75/hr

- SRE Manager / SRE Architect (Hands-on) Location: New York City, NY / Fort Mill, SC (Hybrid) Employment Type: Full-Time / Contract Industry: Financial Services Position Overview We are seeking a ...

$53 - $70.50/hr

- SRE Manager / SRE Architect (Hands-on) Location: New York City, NY / Fort Mill, SC (Hybrid) Employment Type: Full-Time / Contract Industry: Financial Services Position Overview We are seeking a ...

Principal Site Reliability Engineers are key to building resilient systems that scale efficiently while minimizing downtime and risk. This opportunity will support the modernization of a large-scale ...

SRE Manager / SRE Architect

Fort Mill, SC · On-site

$50 - $66.50/hr

- SRE Manager / SRE Architect (Hands-on) Location: New York City, NY / Fort Mill, SC (Hybrid) Employment Type: Full-Time / Contract Industry: Financial Services Position Overview We are seeking a ...

next page

Showing results 1-20

Site Reliability Engineers information

See salary details

$10

$63

$91

How much do site reliability engineers jobs pay per hour?

As of Jun 9, 2026, the average hourly pay for site reliability engineers in the United States is $63.74, according to ZipRecruiter salary data. Most workers in this role earn between $54.81 and $72.84 per hour, depending on experience, location, and employer.

What is the difference between Site Reliability Engineers vs DevOps Engineers?

AspectSite Reliability EngineersDevOps Engineers
Primary FocusEnsuring system reliability, uptime, and performanceAutomating deployment, integration, and continuous delivery
Skills & CertificationsCloud platforms, scripting, monitoring toolsCI/CD tools, scripting, cloud services
Work EnvironmentOperations, infrastructure, and incident responseDevelopment pipelines, automation, collaboration
Industry UsageTech companies, cloud providers, large enterprisesStartups, tech firms, software development teams

While both roles focus on improving software systems, Site Reliability Engineers primarily ensure system stability and reliability, often working closely with operations teams. DevOps Engineers concentrate on automating and streamlining development and deployment processes. Both roles overlap in skills and tools but serve different core objectives within the software lifecycle.

What cities are hiring for Site Reliability Engineers jobs? Cities with the most Site Reliability Engineers job openings:
Senior Manager, Site Reliability Engineering

Senior Manager, Site Reliability Engineering

Tubi

San Francisco, CA

$67.25 - $89.25/hr

Other

Posted 11 days ago


Job description

About the Role:

Site Reliability Engineering (SRE) at Tubi is not a traditional operations team. We are a software engineering organization that applies a developer's mindset and toolkit to the challenges of building and running large-scale, distributed systems. Our mission is to engineer resilience from the ground up, enabling our product teams to innovate rapidly while ensuring our users have a stellar experience. We own the availability, latency, performance, and capacity of our platform, and we achieve our goals through a culture of data-driven decision-making, blameless learning, and relentless automation.

We are seeking an experienced and visionary Senior SRE Manager to lead and grow our newly built Site Reliability Engineering team. You are more than a people manager or a tech lead; you are the strategic leader responsible for architecting our reliability roadmap. You will build and mentor a team of talented engineers, foster a culture of blameless learning and continuous improvement, and champion the engineering practices that allow us to balance rapid innovation with rock-solid stability. You will be a key influencer in our engineering leadership, partnering with peers across the organization to ensure reliability is a shared responsibility and a core tenet of our engineering culture.

What You'll Do:

  • Team Leadership & Mentorship:
    • Lead, mentor, and grow a team of Site Reliability Engineers. Foster a culture of innovation and technical excellence where engineers feel empowered to do their best work. Provide personalized coaching, create professional development plans, and guide the careers of senior and emerging talent within the team.
    • Establish equitable, sustainable on-call practices (including global coverage where applicable) that protect focus time and avoid burnout.
    • Define team rituals - runbook reviews, game days, and incident retros - that reinforce quality and learning.
  • Strategic Planning & Vision: Define and drive the multi-year technical strategy and vision for Tubi's observability, and automation platforms. Partner with infra lead to align Tubi's infrastructure & SRE roadmap. Partner with tech leaders to align the SRE roadmap with business objectives. Champion a data-driven approach to reliability, using Service Level Objectives (SLOs) and error budgets to facilitate productive conversations about risk and feature velocity.
  • Operational Excellence & Incident Management: 
    • Own the end-to-end availability, performance, and efficiency of our critical user-facing services. Evolve our incident response practice to reduce Mean Time to Resolution (MTTR) and Mean Time Between Failures (MTBF). Champion a rigorous, blameless, and data-driven post-mortem culture to ensure we learn from both successes and failures, driving eng teams for systemic fixes and automation to prevent the recurrence of incidents.
    • Streamline and improve our existing processes and practices, and collaborate with other teams to enhance our production release standards by improving current processes.
    • Define and tune a 247 on-call rotation for low noise and fast response; act as executive escalation partner during major incidents.
    • Own disaster-recovery strategy (playbooks, failover drills, recovery simulations) and track SLO gaps with time-bound remediations.
  • Financial & Vendor Management: Own the SRE budget, tooling, and headcount. Manage relationships with key third-party vendors for our observability and SRE related AI platforms, work with infra lead and finance team for contract negotiations and ensure we derive maximum value from our investments.
  • Cross-Functional Collaboration: Act as a key influencer and strategic partner to leaders in Software Engineering, Product Management, and Infra/Sec. Drive the adoption of SRE best practices and principles throughout the organization, ensuring new services are designed for reliability, scalability, and observability from day one.
  • The AI Mandate: Building the Future of Observability with AI. You will not just manage a team that uses AI; you will lead the charge in building an AI-native SRE function. This is a strategic mandate that requires a forward-thinking leader who understands both the potential and the pitfalls of integrating intelligent systems into critical operations. This includes:
    • AIOps Strategy Development: Developing and executing the strategy for integrating AIOps and machine learning into our observability stack. Your goal will be to move the team from a reactive monitoring posture to one of predictive maintenance and automated anomaly detection, fundamentally changing how we ensure reliability.
    • Accelerating Automation with AI: Championing the effective and responsible use of AI-assisted coding tools (e.g., Claude Code, Cursor) within the SRE team. You will set the standards and practices to leverage these tools to accelerate the development of automation, operational tooling, and infrastructure code.
    • Building the Business Case: Building the techno-economic case for new AI tooling, managing vendor relationships, and ensuring the cost-effective and secure implementation of these powerful systems. You must be able to articulate the ROI of these investments in terms of reduced downtime, improved operational efficiency, and faster incident resolution.
    • Fostering Critical AI Literacy: Fostering a culture that can critically evaluate, debug, and learn from the outputs of AI systems. This involves extending our blameless post-mortem philosophy to AI-driven actions and recommendations, ensuring that the team remains in control and understands the "why" behind automated decisions.

Your Background:

  • 8+ years of experience in a technical field, with at least a year in an engineering leadership position managing SRE, DevOps, or Production Engineering teams.
  • A deep, principled understanding of SRE tenets, including Service Level Indicators (SLIs), SLOs, error budgets, toil reduction, and capacity planning.
  • Exceptional communication, negotiation, and influencing skills, with the ability to articulate complex technical concepts and strategies to both technical and non-technical stakeholders at all levels of the organization.
  • A strong technical background as a hands-on software engineer or site reliability engineer prior to moving into management. Deep knowledge of AWS services (especially networking, IAM, EKS, ALBs/NLBs, Route 53, CloudWatch). Proven experience with Kubernetes in production (EKS preferred), including service exposure, networking, and availability engineering.
  • Hands-on familiarity with modern SRE tools and technologies, including Infrastructure as Code (e.g., Terraform, Ansible), container orchestration (Kubernetes), observability platforms (e.g., Prometheus, Grafana, Datadog, Splunk), and incident tooling (e.g., PagerDuty, FireHydrant), deployment-safety tooling (e.g., Argo Rollouts, LaunchDarkly), and observability standards (e.g., OpenTelemetry).

#LI-BT1

#LI-HybridÂ