1

Observability Site Reliability Engineer Jobs in Colorado

Site Reliability Engineer

Denver, CO ยท On-site

$58.75 - $78/hr

... in Platform Engineering, Site Reliability Engineering, DevOps, or Systems Engineering roles ... Experience designing and operating observability platforms (e.g., Splunk, Sumo Logic, or similar)

Director, Site Reliability Engineering

Denver, CO ยท On-site

$58.75 - $78/hr

... (SRE) will lead reliability, performance, and observability initiatives for a portfolio of Vertafore products. This role owns SLIs/SLOs, incident response, automation, and CI/CD practices for ...

Director, DevSecOps& SRE

Golden, CO

$58.75 - $78.25/hr

Create and own an Observability Platform to track and support application health * Hire, mentor, and guide SRE and cloud infrastructure engineers. * Determinetechnicalobjectivesand manage software ...

Possess deep expertise in modern software engineering practices, SRE, Agile, DevSecOps, and CI/CD, Observability, deployment techniques like Blue-Green, Canary to minimize down-time and enable A/B ...

CO

$138.40K - $173K/yr

... observability patterns. Along with your team, you'll ensure all aspects of our shared product ... You'll collaborate or embed with engineering teams, helping them to improve the reliability and ...

DevOps Engineer - SRE and SaaS Support

Englewood, CO ยท On-site

$56.50 - $75.25/hr

Mid-Level DevOps Engineer With Site Reliability Engineering Experience We are seeking a mid-level ... Observability implementation: Assist in configuring and maintaining monitoring solutions using ...

Site Reliability Engineer The Opportunity: Engineering to make a system more resilient and efficient frees up time and money to build more capabilities. Whether you come from a background in network ...

next page

Showing results 1-20

Observability Site Reliability Engineer information

What is the difference between Observability Site Reliability Engineer vs Monitoring Engineer?

AspectObservability Site Reliability EngineerMonitoring Engineer
FocusEnsuring system reliability through observability, automation, and incident responseImplementing and managing monitoring tools and dashboards
SkillsCloud platforms, scripting, incident management, observability toolsMonitoring tools, alerting systems, data analysis
Work EnvironmentDevOps teams, cloud infrastructure, large-scale systemsOperations teams, infrastructure monitoring

While both roles involve system health, the Observability Site Reliability Engineer focuses on comprehensive system reliability using observability practices, whereas Monitoring Engineers primarily manage monitoring tools and alerts. The SRE role emphasizes automation, incident response, and system resilience, making it broader in scope.

What job categories do people searching Observability Site Reliability Engineer jobs in Colorado look for? The top searched job categories for Observability Site Reliability Engineer jobs in Colorado are:
What cities in Colorado are hiring for Observability Site Reliability Engineer jobs? Cities in Colorado with the most Observability Site Reliability Engineer job openings:

Site Reliability Engineer (SRE)

1 point system

Greenwood Village, CO โ€ข On-site

$57.75 - $76.75/hr

Contractor

Posted 18 days ago


Job description

Job Role

Site Reliability Engineer (SRE)

Duration

Long-term Contract (through 2026)

Location

Greenwood Village, CO 80111

Work Model

Hybrid

Onsite Expectations

  • Contractors:
    • Flexible onsite schedule
    • Expected to come onsite a few times per week during ramp-up
    • Once ramped up: 1โ€“2 times per month
  • Full-Time Employees:
    • 4 days per week onsite

Project Scope & Platform Overview

The Configuration Management (CMX) Team experimentation and configuration management platforms used across all customer-facing products.

Platform Responsibilities

  • Internal A/B testing and experimentation platform
  • Ensures:
    • Safe product releases
    • No negative customer impact
    • Product enhancements drive measurable improvements

TDCS (Targeted Delivery Client Services) Platform

  • Real-time configuration management tool
  • Used across:
    • Streaming applications
    • Customer web portals
    • Client-facing platforms
  • Enables:
    • Market-specific targeting
    • Customer-specific experiences
    • Controlled experiments and testing variants

Role Focus

  • Primary support for TDCS platform
  • Migrating infrastructure into a dedicated AWS account
  • Maintain:
    • High availability
    • Low latency
  • Platform is business-critical:
    • Called multiple times daily by every application
    • Directly impacts customer access to streaming/video services

Top Skills Required

  • 6+ years of DevOps / SRE experience in large, complex environments
  • Strong development background (ability to read and understand code)
  • AWS
  • Terraform (Infrastructure as Code)
  • Kubernetes
  • GitLab or similar CI/CD tools
  • Datadog or similar monitoring tools

Job Description

The Applied AI and Data Science Program brings together data scientists, data engineers, and software engineers to empower Spectrum teams to safely release, test, and evaluate product changes. The mission is to deliver targeted, dynamic customer experiences while providing leaders with data-driven insights.

As a Senior Site Reliability Engineer, you will deploy, monitor, support, and optimize Charterโ€™s experimentation and configuration management platforms hosted on AWS. Youโ€™ll work closely with software engineers, test engineers, and DevOps teams to ensure these mission-critical systems remain highly available and performant.

Key Responsibilities

Release Management

  • Build and deploy application, service, and infrastructure releases
  • Validate system integrity post-deployment
  • Document release notes

Production Support

  • Maintain 99.999% availability of critical systems
  • Ensure smooth operation of infrastructure and applications
  • Keep infrastructure resources up to date
  • Participate in on-call rotation for incidents and outages
  • Perform root cause analysis for production issues

Monitoring & Alerting

  • Implement monitoring and alerting policies across systems
  • Build and enhance dashboards
  • Monitor:
    • Errors and unexpected behavior
    • Latency and resource consumption
    • System degradation
  • Proactively mitigate issues
  • Alert stakeholders when SLAs are at risk

Optimization

  • Manage scaling strategies aligned with project goals
  • Optimize system behavior and resource utilization

Team Collaboration

  • Assist with user support
  • Act as the system architecture and deployment expert
  • Coordinate with onshore and offshore teams
  • Develop bug fixes as needed

Primary Qualifications

  • Expertise with monitoring tools such as Datadog and/or Splunk
  • Strong experience with AWS services (EKS, S3, DocumentDB, etc.)
  • Experience supporting containerized microservices
  • Proficient with Terraform and AWS Console
  • Experience with performance benchmarking and testing
  • Hands-on deployment of cloud-based applications
  • Git-based source control experience (GitLab preferred)
  • Bachelorโ€™s degree or equivalent experience

Secondary / Nice-to-Have Qualifications

  • 6+ years of SDLC experience
  • Familiarity with:
    • Python, Node.js, React, TypeScript, GraphQL
  • Experience with:
    • SQL and NoSQL databases
    • Docker, Kubernetes, Redis
    • ORMs over relational databases
  • Exposure to experimentation platforms and statistical testing
  • Masterโ€™s degree or higher

Thanks and Regards

Monu Singh Chauhan |ย 1Point System LLC
Technical Recruiter
ย ย 
monu.singh@1pointsys.comย 

LinkedIn: linkedin.com/in/monu-singh-chauhan-610857204
115 Stone Village Driveย โ€ขย Suite Cย โ€ขย Fort Mill, SCย โ€ขย 29708

ย ย ย ย ย ย ย ย  An E-Verified company | An Equal Opportunity Employerย 

DISCLAIMER:ย If you have received this email in error or prefer not to receive such emails in the future, please notify by replying with a ''REMOVE'' in the subject line and your email address shall be removed immediately from the mailer list.