Principal Engineer
Denver, CO · Remote
Develop agent infrastructure, tool interfaces, evaluation frameworks, observability standards, and operational guardrails. * Establish engineering standards for integrating AI systems safely into ...
Denver, CO · Remote
Develop agent infrastructure, tool interfaces, evaluation frameworks, observability standards, and operational guardrails. * Establish engineering standards for integrating AI systems safely into ...
Denver, CO · Remote
Develop agent infrastructure, tool interfaces, evaluation frameworks, observability standards, and operational guardrails. * Establish engineering standards for integrating AI systems safely into ...
$53.25 - $73/hr
I mplement monitoring, logging, and observability solutions (Prometheus, Grafana, ELK, Datadog). * E mbed DevSecOps practices into pipelines and infrastructure. * E nsure high availability, disaster ...
$53.25 - $73/hr
I mplement monitoring, logging, and observability solutions (Prometheus, Grafana, ELK, Datadog). * E mbed DevSecOps practices into pipelines and infrastructure. * E nsure high availability, disaster ...
Denver, CO · On-site
$53.25 - $73/hr
Implement monitoring, logging, and observability solutions (Prometheus, Grafana, ELK, Datadog). * Embed DevSecOps practices into pipelines and infrastructure. * Ensure high availability, disaster ...
Denver, CO · On-site
$53.25 - $73/hr
Implement monitoring, logging, and observability solutions (Prometheus, Grafana, ELK, Datadog). * Embed DevSecOps practices into pipelines and infrastructure. * Ensure high availability, disaster ...
Denver, CO · On-site +1
$149K - $157K/yr
LLM observability, tool-use tracking, failure detection, and graceful degradation * Architect for blast radius containment - agent failures must have bounded customer impact through isolation ...
Denver, CO · On-site +1
$149K - $157K/yr
LLM observability, tool-use tracking, failure detection, and graceful degradation * Architect for blast radius containment - agent failures must have bounded customer impact through isolation ...
Colorado Springs, CO · Hybrid
$164K - $190K/yr
Ifyou'veworked in cloud provisioning, infrastructure operations, observability, or platform engineering and love explaining how thingsactually work , this role was built for you. WhatYou'llDo Own the ...
Colorado Springs, CO · Hybrid
$164K - $190K/yr
Ifyou'veworked in cloud provisioning, infrastructure operations, observability, or platform engineering and love explaining how thingsactually work , this role was built for you. WhatYou'llDo Own the ...
Colorado Springs, CO · Hybrid
$164K - $190K/yr
Ifyou'veworked in cloud provisioning, infrastructure operations, observability, or platform engineering and love explaining how thingsactually work , this role was built for you. WhatYou'llDo Own the ...
Colorado Springs, CO · Hybrid
$164K - $190K/yr
Ifyou'veworked in cloud provisioning, infrastructure operations, observability, or platform engineering and love explaining how thingsactually work , this role was built for you. WhatYou'llDo Own the ...
Colorado Springs, CO · Hybrid
$164K - $190K/yr
Ifyou'veworked in cloud provisioning, infrastructure operations, observability, or platform engineering and love explaining how thingsactually work , this role was built for you. WhatYou'llDo Own the ...
Colorado Springs, CO · Hybrid
$164K - $190K/yr
Ifyou'veworked in cloud provisioning, infrastructure operations, observability, or platform engineering and love explaining how thingsactually work , this role was built for you. WhatYou'llDo Own the ...
Denver, CO · On-site
$90K - $105K/yr
Run and improve our deploy pipelines, observability stack, and infrastructure as code. Triage and respond to production alerts and customer-reported issues. Collaborate with engineering on the ...
Quick apply
Denver, CO · On-site
$90K - $105K/yr
Run and improve our deploy pipelines, observability stack, and infrastructure as code. Triage and respond to production alerts and customer-reported issues. Collaborate with engineering on the ...
Denver, CO · On-site
$90K - $105K/yr
Run and improve our deploy pipelines, observability stack, and infrastructure as code. Triage and respond to production alerts and customer-reported issues. Collaborate with engineering on the ...
Quick apply
Denver, CO · On-site
$90K - $105K/yr
Run and improve our deploy pipelines, observability stack, and infrastructure as code. Triage and respond to production alerts and customer-reported issues. Collaborate with engineering on the ...
Denver, CO · On-site
$58.75 - $78/hr
The Director, Site Reliability Engineering (SRE) will lead reliability, performance, and observability initiatives for a portfolio of Vertafore products. This role owns SLIs/SLOs, incident response ...
Denver, CO · On-site
$58.75 - $78/hr
The Director, Site Reliability Engineering (SRE) will lead reliability, performance, and observability initiatives for a portfolio of Vertafore products. This role owns SLIs/SLOs, incident response ...
Westminster, CO · On-site
$105K - $144K/yr
What You'll Do Observability, Monitoring & Alerting Leadership * Own and improve monitoring, alerting, and operational visibility patterns across critical enterprise systems. * Partner with MOE, TSOC ...
Westminster, CO · On-site
$105K - $144K/yr
What You'll Do Observability, Monitoring & Alerting Leadership * Own and improve monitoring, alerting, and operational visibility patterns across critical enterprise systems. * Partner with MOE, TSOC ...
Denver, CO · On-site +1
Improve system observability through metrics, structured logging, dashboards, and alerting * Participate in code and design reviews with a strong emphasis on security, correctness, and failure modes
Denver, CO · On-site +1
Improve system observability through metrics, structured logging, dashboards, and alerting * Participate in code and design reviews with a strong emphasis on security, correctness, and failure modes
Denver, CO · On-site
$58.75 - $78/hr
The Director, Site Reliability Engineering (SRE) will lead reliability, performance, and observability initiatives for a portfolio of Vertafore products. This role owns SLIs/SLOs, incident response ...
Denver, CO · On-site
$58.75 - $78/hr
The Director, Site Reliability Engineering (SRE) will lead reliability, performance, and observability initiatives for a portfolio of Vertafore products. This role owns SLIs/SLOs, incident response ...
Denver, CO · Hybrid
$58.75 - $78/hr
Influence of service and system design to improve fault tolerance, observability and operational sustainability. * Debug complex production issues across application code, services and infrastructure ...
Denver, CO · Hybrid
$58.75 - $78/hr
Influence of service and system design to improve fault tolerance, observability and operational sustainability. * Debug complex production issues across application code, services and infrastructure ...
Denver, CO · Hybrid
$58.75 - $78/hr
Influence of service and system design to improve fault tolerance, observability and operational sustainability. * Debug complex production issues across application code, services and infrastructure ...
Denver, CO · Hybrid
$58.75 - $78/hr
Influence of service and system design to improve fault tolerance, observability and operational sustainability. * Debug complex production issues across application code, services and infrastructure ...
$58.75 - $78/hr
Advanced Observability Vision : Dictate the enterprise strategy for observability frameworks, ensuring the Four Golden Signals (Latency, Traffic, Errors, and Saturation) provide actionable ...
$58.75 - $78/hr
Advanced Observability Vision : Dictate the enterprise strategy for observability frameworks, ensuring the Four Golden Signals (Latency, Traffic, Errors, and Saturation) provide actionable ...
Denver, CO · On-site
$58.75 - $78/hr
Observability Implementation: Assist in building and maintaining observability frameworks. Help track the Four Golden Signals (latency, traffic, errors, and saturation) to ensure service health is ...
Denver, CO · On-site
$58.75 - $78/hr
Observability Implementation: Assist in building and maintaining observability frameworks. Help track the Four Golden Signals (latency, traffic, errors, and saturation) to ensure service health is ...
Englewood, CO · On-site +1
Cross-cutting concerns: observability (OpenTelemetry, CloudWatch), security posture (Auth0 consolidation, IAM), and data architecture (DynamoDB single-table design, Aurora consolidation). * Mentoring ...
Englewood, CO · On-site +1
Cross-cutting concerns: observability (OpenTelemetry, CloudWatch), security posture (Auth0 consolidation, IAM), and data architecture (DynamoDB single-table design, Aurora consolidation). * Mentoring ...
$58.75 - $78/hr
Advanced Observability Vision : Dictate the enterprise strategy for observability frameworks, ensuring the Four Golden Signals (Latency, Traffic, Errors, and Saturation) provide actionable ...
Quick apply
$58.75 - $78/hr
Advanced Observability Vision : Dictate the enterprise strategy for observability frameworks, ensuring the Four Golden Signals (Latency, Traffic, Errors, and Saturation) provide actionable ...
Denver, CO · On-site
$58.75 - $78/hr
Advanced Observability Vision : Dictate the enterprise strategy for observability frameworks, ensuring the Four Golden Signals (Latency, Traffic, Errors, and Saturation) provide actionable ...
Denver, CO · On-site
$58.75 - $78/hr
Advanced Observability Vision : Dictate the enterprise strategy for observability frameworks, ensuring the Four Golden Signals (Latency, Traffic, Errors, and Saturation) provide actionable ...
$17.19 - $23.88
0% of jobs
$23.88 - $30.56
0% of jobs
$30.56 - $37.25
2% of jobs
$37.25 - $43.94
5% of jobs
$43.94 - $50.62
10% of jobs
$53.76 is the 25th percentile. Wages below this are outliers.
$50.62 - $57.31
17% of jobs
The median wage is $62.59 / hr.
$57.31 - $64
20% of jobs
$64 - $70.68
18% of jobs
$71.88 is the 75th percentile. Wages above this are outliers.
$70.68 - $77.37
15% of jobs
$77.37 - $84.06
9% of jobs
$84.06 - $90.74
4% of jobs
$17
$63
$90
To thrive in an Observability role, you need a strong background in monitoring, alerting, logging, and analyzing system performance, often supported by a degree in computer science or related field. Familiarity with tools such as Prometheus, Grafana, Datadog, Splunk, and experience with cloud platforms and scripting languages is crucial. Excellent problem-solving, communication, and collaboration skills help you work effectively with cross-functional engineering and operations teams. These capabilities are essential to ensure system reliability, quickly detect issues, and maintain seamless digital experiences.
An Observability job focuses on ensuring the performance, reliability, and health of software systems by collecting, analyzing, and visualizing telemetry data such as logs, metrics, and traces. Professionals in this field work with monitoring tools, distributed tracing, and alerting systems to detect and troubleshoot issues proactively. They collaborate with engineering and operations teams to improve system visibility, reduce downtime, and enhance overall system performance.
In an Observability role, your daily tasks often include designing and maintaining monitoring dashboards, configuring alerts, analyzing system logs, and working closely with development and operations teams to troubleshoot issues. You'll proactively identify areas of improvement to increase system reliability, document monitoring strategies, and support incident response efforts. Collaboration is key, as you may participate in post-incident reviews and help drive architectural improvements based on the data you collect. The role is dynamic and requires a proactive approach to ensure systems stay healthy and downtime is minimized.

Contractor
Medical, Dental, Vision, Life, Retirement, PTO
Posted 17 days ago
Sourced by ZipRecruiter
Recruiting and staffing services
201 - 500 Employees
Irvine, CA, US