1

Observability Jobs in Colorado (NOW HIRING)

Sr Platform Engineer-1

Denver, CO · On-site

$150K - $165K/yr

Develop, and manage the Observability OpenTelemetry Central Backend Stack: Grafana Enterprise, Mimir, Loki, Tempo, and Alertmanager on Kubernetes/RKE2 via Helm and GitLab CI-CD. * Build and manage ...

The Director, Site Reliability Engineering (SRE) will lead reliability, performance, and observability initiatives for a portfolio of Vertafore products. This role owns SLIs/SLOs, incident response ...

Senior Backend Software Engineer

Boulder, CO · Hybrid

$127K - $167K/yr

This is a hands-on engineering role focused on backend systems, platform engineering, observability, and infrastructure-aware software development. You'll work closely with software, infrastructure ...

Director, Site Reliability Engineering

Denver, CO · On-site

$58.75 - $78/hr

The Director, Site Reliability Engineering (SRE) will lead reliability, performance, and observability initiatives for a portfolio of Vertafore products. This role owns SLIs/SLOs, incident response ...

The Director, Site Reliability Engineering (SRE) will lead reliability, performance, and observability initiatives for a portfolio of Vertafore products. This role owns SLIs/SLOs, incident response ...

... observability, security, and deployment workflows • Partner with Product, Operations, and Engineering leadership on roadmap execution • Mentor engineers and establish engineering best practices ...

ArgoCD or Flux preferred • AIOps / Observability Engineering - 2+ years, Alertmanager rule authoring, anomaly detection integration, event correlation, noise reduction techniques • Working ...

New

Design, build, ship, and maintain the core observability libraries, tools, and patterns used by all of Checkr's engineering teams * Troubleshoot complex production issues across the stack, with ...

... observability tools. • Bonus: Exposure to HIPAA compliance, secure data exchange, and healthcare interoperability. • Proficient in enterprise relational databases (MS SQL Server, Oracle, or DB2 ...

Lead Cloud & DevOps Engineer

Denver, CO

$54.25 - $74.25/hr

The ideal candidate will have strong expertise in AWS-native architecture, infrastructure-as-code (Terraform), release engineering, observability, and secure platform operations in regulated ...

Software Engineer Principal

Arvada, CO · On-site

$135K - $181K/yr

Linux, Windows Server, PowerShell, Python, automation, scripting, platform engineering, configuration management, drift management, drift detection, observability, monitoring, Elastic, Dynatrace ...

Lead Cloud & DevOps Engineer

Denver, CO · On-site +1

$54.25 - $74.25/hr

The ideal candidate will have strong expertise in AWS-native architecture, infrastructure-as-code (Terraform), release engineering, observability, and secure platform operations in regulated ...

next page

Showing results 1-20

Observability information

See Colorado salary details

$17

$63

$90

How much do observability jobs pay per hour?

As of Jun 15, 2026, the average hourly pay for observability in Colorado is $63.65, according to ZipRecruiter salary data. Most workers in this role earn between $53.32 and $73.03 per hour, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive in the Observability position, and why are they important?

To thrive in an Observability role, you need a strong background in monitoring, alerting, logging, and analyzing system performance, often supported by a degree in computer science or related field. Familiarity with tools such as Prometheus, Grafana, Datadog, Splunk, and experience with cloud platforms and scripting languages is crucial. Excellent problem-solving, communication, and collaboration skills help you work effectively with cross-functional engineering and operations teams. These capabilities are essential to ensure system reliability, quickly detect issues, and maintain seamless digital experiences.

What is an Observability job?

An Observability job focuses on ensuring the performance, reliability, and health of software systems by collecting, analyzing, and visualizing telemetry data such as logs, metrics, and traces. Professionals in this field work with monitoring tools, distributed tracing, and alerting systems to detect and troubleshoot issues proactively. They collaborate with engineering and operations teams to improve system visibility, reduce downtime, and enhance overall system performance.

What are the typical day-to-day responsibilities of someone in an Observability role?

In an Observability role, your daily tasks often include designing and maintaining monitoring dashboards, configuring alerts, analyzing system logs, and working closely with development and operations teams to troubleshoot issues. You'll proactively identify areas of improvement to increase system reliability, document monitoring strategies, and support incident response efforts. Collaboration is key, as you may participate in post-incident reviews and help drive architectural improvements based on the data you collect. The role is dynamic and requires a proactive approach to ensure systems stay healthy and downtime is minimized.

What are the most commonly searched types of Observability jobs in Colorado? The most popular types of Observability jobs in Colorado are:
What are popular job titles related to Observability jobs in Colorado? For Observability jobs in Colorado, the most frequently searched job titles are:
What cities in Colorado are hiring for Observability jobs? Cities in Colorado with the most Observability job openings:
Infographic showing various Observability job openings in Colorado as of June 2026, with employment types broken down into 100% Full Time. Highlights an 100% In-person job distribution, with an average salary of $132,395 per year, or $63.7 per hour.
Sr Platform Engineer-1

Sr Platform Engineer-1

Flexential

Denver, CO • On-site

$150K - $165K/yr

Full-time

Medical, Dental, Vision, Life, Retirement, PTO

Posted 3 days ago


Job description

Job Description:
The Senior Platform Engineer is a hands-on engineering role on a platform development team responsible for building and operating Flexential's IT platforms including observability, devops, ITSM incident and release mgmt, and Integrations technologies. This role develops and manages critical platform subsystems for high availability, operational resiliency, security and scalability utilizing native-AI enablement for all outcomes. This is an individual contributor role with significant technical ownership and direct impact on critical Flexential technology roadmap
You will work across infrastructure, automation, and application layers - deploying Kubernetes workloads, authoring Terraform modules, building Ansible playbooks, and building GitLab pipelines that other engineers depend on daily.
Key Responsibilities and Essential Job Functions:
  • Design, develop and operationally manage automated, resilient, high availability, self-healing, secure platforms with native-AI capabilities for IT needs, serving both internal as well as customer business capabilities

  • Develop, and manage the Observability OpenTelemetry Central Backend Stack: Grafana Enterprise, Mimir, Loki, Tempo, and Alertmanager on Kubernetes/RKE2 via Helm and GitLab CI-CD.

  • Build and manage iaC and CI-CD for automated provisiong and deployment, including Terraform modules for Infra/VM/storage provisioning, Ansible AWX playbooks for OS/App bootstrap, ArgoCD and Helm for Kubernetes configuration.

  • Develop and manage OpenTelemetry Prometheus scrape profile library including SNMP exporters, REST API exporters, and cloud provider exporters (CloudWatch, Azure Monitor, GCP) for multiple device classes.

  • Develop AIOps capabilities on platforms for e.g Observability use-cases: anomaly detection integrations, event correlation rules in Alertmanager, and synthetic monitoring patterns to reduce alert noise.

  • Configure and maintain Zabbix auto-discovery: network range scanning, device classification, and Prometheus service discovery integration.

  • Build and harden Edge Stack deployments (Prometheus + OTel collector) per data center site using GitOps templates.

  • Integrate Alertmanager with ServiceNow: webhook routing, ticket enrichment, auto-close logic, and escalation policy configuration.

  • Maintain platform security: Conjur/CyberArk secret injection at runtime, mTLS between stack components, RBAC in Grafana Enterprise.

  • Author and maintain Grafana dashboards in JSON/GitLab - facility overview, network health, RED metrics, application telemetry.

  • Mentor mid-level engineers, lead code reviews, and establish engineering standards for the team. Represent platform engineering in cross-functional architecture reviews and executive-level program updates.

  • Perform other duties as required and assigned

Required Qualifications:
  • DevOps / Automation - 5+ years in a production environment, Kubernetes (RKE2/k3s), Helm chart deployment, system services, Docker/container

  • LGTM Stack Development and Configuration - 4+ years: Grafana, Mimir, Loki, Tempo configuration, tuning, dash-boarding and production operations; Prometheus required

  • Senior-level Python / Scripting frameworks - 5+ years, Automation scripts, exporter development, GitLab pipeline scripting, REST API integrations

  • GitOps / CI/CD - 5+ years, GitLab CI/CD pipeline authoring; Terraform and Ansible as primary IaC tools; ArgoCD or Flux preferred

  • AIOps / Observability Engineering - 2+ years, Alertmanager rule authoring, anomaly detection integration, event correlation, noise reduction techniques

  • Working infrastructure (Linux/VM) management knowledge - 5+ years, Linux administration, VMware vCenter/VCF experience, Netapp storage management, network fundamentals (SNMP, TCP/IP)

  • Secrets Management - 2+ years, CyberArk/Conjur, HashiCorp Vault, or equivalent - runtime secret injection patterns

  • Minimal travel may be required

Preferred Skills:
  • Experience and/or knowledge of ITSM processes and workflow automation e.g. Incident & Response Mgmt (IRM), Release mgmt., ServiceNow ITSM integration, alert routing, escalation policy design, SLA-driven on-call workflows

  • Hands-on experience or working knowledge of Boomi integrations PaaS(iPaaS) technologies

  • Experience working with BAS / BMS systems in a Datacenter / OT environment.

  • Hands-on experience working with AWS products in a Well-architected Framework and multi-account model to develop various compute, storage, network iaaS and PaaS services for IT applications.

Base Pay Range: Annualized/Hourly salary range offered for this position is estimated to be $150,000 - $165,000. However, the actual pay range depends on each candidate's experience, location, and qualifications.
Not meeting every single requirement? No problem! We are looking for candidates who possess unique skills that set them apart from the rest. If you're enthusiastic about this role and believe you have the skills and abilities that would make you successful, don't hesitate to apply today!
Benefits of working at Flexential:
• Medical, Telehealth, Dental and Vision
• 401(k)
• Health Savings Accounts (HSA) and Flexible Spending Accounts (FSA)
• Life and AD&D
• Short Term and Long-Term disability
• Flex Paid Time Off (PTO)
• Leave of Absence
• Employee Assistance Program
• Wellness Program
• Rewards and Recognition Program
Benefits are subject to change at the Company's discretion.
Flexential participates in the E-Verify program. Please click here for more information.
EEOC Statement: Flexential is an equal opportunity employer, and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity or expression, pregnancy, age, national origin, disability status, genetic information, protected veteran status, or any other characteristic protected by law.