2

Remote Observability Engineer Jobs in Colorado (NOW HIRING)

... observability best practices, product usability, and SRE standards. Your Impact * Experienced operational leader who understands incident response, customer critical issue dynamics, engineering ...

Platform Engineer

Aurora, CO ยท On-site +1

Denver, CO preferred (Hybrid) | Also open to San Antonio, TX, Brooklyn, NY, or Remote (US-based ... observability, security, and deployment workflows โ€ข Partner with Product, Operations, and ...

... remote. Why This Role Exists We operate a large-scale, multi-tenant SaaS platform supporting ... Develop agent infrastructure, tool interfaces, evaluation frameworks, observability standards, and ...

Principal Software Engineer

Boulder, CO ยท On-site +1

$140K - $187K/yr

Strong distributed systems fundamentals: messaging, caching, service architecture, observability ... For exceptional candidates we would consider remote locations in these states: AZ, CA, CO, GA, MD ...

ML Engineer

Denver, CO ยท On-site +1

Remote (Preferred U.S. Time Zones) Employment Type: Full-Time Company: Performacentric About ... Collaborate on model serving, monitoring, logging, and observability. * Assist with infrastructure ...

Site Reliability Engineer II

Denver, CO ยท On-site +1

$98K - $138K/yr

Enhance and evolve monitoring tools and platforms to improve observability. * Promote and apply ... Wellness initiatives #BI-Remote DYN365, Inc d/b/a Restaurant365 is an equal opportunity employer.

Site Reliability Engineer II

Denver, CO ยท On-site +1

$98K - $138K/yr

Enhance and evolve monitoring tools and platforms to improve observability. * Promote and apply ... Wellness initiatives #BI-Remote DYN365, Inc d/b/a Restaurant365 is an equal opportunity employer ...

Sr. Software Engineer I

Denver, CO ยท Remote

$126K - $166K/yr

Observability, on-call/operational support, incident analysis * AI-assisted development (Cursor ... Working experience in a PAAS environment ๐Ÿ“ Location This is a remote-first role. We are ...

next page

Showing results 1-20

Remote Observability Engineer information

What are the typical collaboration patterns for a Remote Observability Engineer working with distributed teams?

Remote Observability Engineers frequently collaborate with software developers, DevOps teams, and IT operations to ensure systems are monitored effectively and issues are detected early. Working remotely, you'll often use communication tools like Slack, Jira, and video conferencing to coordinate incident response, discuss monitoring strategies, and review system health dashboards. Regular sync meetings and asynchronous updates are common, and you'll likely contribute to documentation and knowledge sharing to keep all stakeholders informed. Building strong communication habits is important, as much of the troubleshooting and improvement work hinges on clear coordination with multiple teams.

What are the key skills and qualifications needed to thrive as a Remote Observability Engineer, and why are they important?

To thrive as a Remote Observability Engineer, you need strong expertise in monitoring, logging, and tracing systems, along with a background in computer science or related technical fields. Familiarity with tools like Prometheus, Grafana, ELK Stack, Datadog, and cloud platforms is typically required, as well as relevant certifications such as AWS Certified Cloud Practitioner or Google Cloud Professional DevOps Engineer. Excellent problem-solving abilities, communication skills, and a proactive mindset help you detect and resolve issues before they impact users. These competencies ensure system reliability, enable rapid incident response, and support seamless collaboration in distributed environments.

What is the difference between Remote Observability Engineer vs Site Reliability Engineer?

AspectRemote Observability EngineerSite Reliability Engineer
CredentialsKnowledge of monitoring tools, scripting, cloud platformsSame as Observability Engineer, plus SRE certifications often preferred
Work EnvironmentFocus on monitoring, logging, and tracing systems remotelyBroader scope including system reliability, incident response, and automation
Industry UsagePrimarily in tech, SaaS, cloud servicesWidely in tech, finance, and large-scale online services

The Remote Observability Engineer specializes in monitoring and analyzing system performance remotely, focusing on tools like logs and metrics. In contrast, the Site Reliability Engineer has a broader role, ensuring overall system reliability, automation, and incident management. While both roles require similar technical skills, SREs often have additional responsibilities related to system resilience and scalability.

What is a Remote Observability Engineer?

A Remote Observability Engineer is a professional responsible for designing, implementing, and maintaining systems that monitor the health, performance, and reliability of software applications and infrastructure from a remote location. They use observability tools to collect and analyze logs, metrics, and traces, helping organizations quickly detect and resolve issues. Their work ensures that distributed systems are transparent, reliable, and efficient, often collaborating with development, operations, and security teams. Remote Observability Engineers often work from anywhere, leveraging cloud-based tools and platforms to manage complex IT environments.
What are the most commonly searched types of Observability Engineer jobs in Colorado? The most popular types of Observability Engineer jobs in Colorado are:
What are popular job titles related to Remote Observability Engineer jobs in Colorado? For Remote Observability Engineer jobs in Colorado, the most frequently searched job titles are:
What job categories do people searching Remote Observability Engineer jobs in Colorado look for? The top searched job categories for Remote Observability Engineer jobs in Colorado are:
What cities in Colorado are hiring for Remote Observability Engineer jobs? Cities in Colorado with the most Remote Observability Engineer job openings:

Senior Scalability Engineer - Observability

Judi Health

Denver, CO โ€ข On-site, Remote

$126K - $166K/yr

Full-time

This job post hasย expired today.ย Applications are no longer accepted.


Job description

About Judi Health
Judi Health is an enterprise health technology company providing a comprehensive suite of solutions for employers and health plans, including:
  • Capital Rx, a public benefit corporation delivering full-service pharmacy benefit management (PBM) solutions to self-insured employers,
  • Judi Healthโ„ข, which offers full-service health benefit management solutions to employers, TPAs, and health plans, and
  • Judiยฎ, the industry's leading proprietary Enterprise Health Platform (EHP), which consolidates all claim administration-related workflows in one scalable, secure platform.

Together with our clients, we're rebuilding trust in healthcare in the U.S. and deploying the infrastructure we need for the care we deserve. To learn more, visit www.judi.health.
Location: Remote
Position Summary:
Our Scalability team as a Senior Scalability Engineer focused on observability platform development and engineering productivity. In this role, you will define, own, and build Judi Health's organization-wide observability strategy, tooling, and platform products. Beyond maintaining infrastructure, you'll architect and develop a custom observability platform that gives engineering teams powerful, fast, and cost-effective visibility into every layer of our infrastructure-from application logs and metrics to distributed traces. You'll build production-grade internal products using React/TypeScript frontends with Python and Rust backends, creating tools that fundamentally improve how engineers at Judi Health debug, monitor, and optimize their systems. Working closely with leadership and cross-functional teams, your work will be foundational to platform stability, performance optimization, and developer productivity across our rapidly growing healthcare platform.
Position Responsibilities:
In this role, you'll own the observability infrastructure that powers our engineering organization. You will:
  • Architect observability platform: Design, implement, and maintain the LGTM stack (Loki, Grafana, Tempo, Mimir/Prometheus) as the primary observability platform across all engineering teams, making architectural decisions that balance cost, performance, and developer experience.
  • Build internal observability products: Design and develop production-grade internal platform products with React/TypeScript frontends and Python/Rust backends that provide engineers with powerful log search, metrics visualization, and trace analysis capabilities.
  • Develop custom log indexing systems: Architect and build high-performance log indexing solutions using Rust that process logs and provide sub-second search across billions of log lines at a fraction of the cost.
  • Integrate SQL analytics for logs: Design and implement solutions leveraging AWS Athena or similar SQL query engines (DuckDB, ClickHouse) for ad-hoc log analysis and historical queries, enabling engineers to run complex SQL queries over S3-based log data for deep investigations and trend analysis.
  • Create advanced query interfaces: Build sophisticated web interfaces that allow engineers to query logs, metrics, and traces with features like saved queries, query templates, correlation analysis, and pattern detection, supporting both full-text search and SQL-based analytics.
  • Balance cloud-native and open-source: Architect solutions that thoughtfully leverage both AWS-managed services (CloudWatch, Athena, Kinesis) and open-source tooling (LGTM stack, Quickwit) to optimize for cost, performance, and operational flexibility based on use case requirements.
  • Integrate AWS observability: Design seamless integration between AWS CloudWatch Logs/Metrics and our custom observability platform, providing unified visibility across managed and self-hosted infrastructure.
  • Build intelligent alerting: Develop smart dashboards, monitors, and alerting systems that reduce noise, detect anomalies, and help teams respond to incidents quickly.
  • Partner with engineering teams: Work directly with product teams to integrate observability into their services, establish logging and metrics standards, and instrument code effectively, serving as the observability subject matter expert.
  • Enable performance optimization: Provide the observability foundation that allows the Scalability team to identify performance bottlenecks, track optimization impact, and measure platform stability with data-driven insights.
  • Establish observability standards: Define and document comprehensive observability standards including structured logging patterns, metric naming conventions, trace instrumentation, dashboard design principles, and query best practices.
  • Drive platform adoption: Lead workshops, create documentation, and build self-service tooling that democratizes observability across engineering, making it easy for teams to adopt best practices.
  • Demonstrate technical leadership: Mentor engineers on observability practices, lead architecture reviews for instrumentation approaches, and represent the Scalability team in cross-functional planning.
  • Work in an Agile/Scrum environment to continually deliver value to stakeholders and clients.
  • Code of Conduct: Responsible for adherence to the Capital Rx Code of Conduct including reporting of noncompliance.

Required Qualifications:
  • 10+ years of software engineering or infrastructure engineering experience with demonstrated progression into technical leadership roles.
  • Several years of experience leading technical initiatives, building platform products, or serving as a subject matter expert on observability infrastructure.
  • Strong experience with React/TypeScript for frontend development and Python (Flask/SQLAlchemy) for backend services.
  • LGTM stack expertise: Deep production experience with Loki, Grafana, Tempo, and Prometheus/Mimir for logs, metrics, and distributed tracing at scale.
  • AWS observability: Extensive experience with AWS CloudWatch Logs and Metrics, including custom metrics, log insights, dashboard creation, and integration patterns.
  • SQL analytics for logs: Production experience with SQL-based log analytics using AWS Athena, DuckDB, or similar query engines for analyzing structured and semi-structured data at scale.
  • Cloud-native and open-source balance: Demonstrated ability to architect solutions leveraging both managed cloud services and open-source tooling, understanding trade-offs between operational overhead, cost, flexibility, and vendor lock-in.
  • Search and indexing experience: Hands-on experience building or operating search systems using OpenSearch, Elasticsearch, Lucene, Tantivy, or similar search and analytics engines.
  • Performance-critical systems: Experience building high-performance systems that process large volumes of data efficiently (millions of log lines, high-cardinality metrics).
  • Systems thinking: Deep understanding of distributed systems, microservices architectures, and the complex observability challenges they present.
  • Data at scale: Proven track record handling high-volume structured and unstructured logging data, identifying patterns, and building efficient search/query solutions that perform well under load.
  • Product mindset: Ability to build internal platform products that engineers love to use, with attention to UX, performance, and reliability.

Preferred Qualifications:
  • Rust development experience: Production experience with Rust for building high-performance data processing, indexing, or search systems. Strong interest in learning Rust is acceptable if combined with systems programming experience in C/C++/Go.
  • Infrastructure as code: Experience with Terraform for managing observability infrastructure and AWS resources.
  • Additional observability platforms: Experience architecting or operating Datadog, New Relic, Splunk, or other enterprise observability platforms.
  • Advanced query languages: Deep expertise with PromQL, LogQL, SQL optimization, and query optimization for high-cardinality data.
  • Columnar storage formats: Experience with Parquet, ORC, or other columnar storage formats for efficient log storage and analytics on S3.
  • Incident management: Experience designing incident response workflows, postmortem processes, and SLO/SLI frameworks that drive reliability improvements.
  • Cost optimization: Track record of reducing observability costs while maintaining or improving capabilities (e.g., CloudWatch โ†’ S3/custom indexing migration).
  • Data pipelines: Experience with streaming data pipelines, ETL processes, or real-time data processing.
  • Distributed tracing: Deep knowledge of OpenTelemetry, Jaeger, Zipkin, or distributed tracing architectures.
  • Git expertise and experience working in a mono repository.
  • Previous Pharmacy Benefits Manager (PBM) or healthcare technology experience.
  • Experience building developer tools or internal platforms that improve engineering productivity.

This range represents the low and high end of the anticipated base salary range for the NY - based position. The actual base salary will depend on several factors such as: experience, knowledge, and skills, and if the location of the job changes.
Nothing in this position description restricts management's right to assign or reassign duties and responsibilities to this job at any time.
This range represents the low and high end of the anticipated base salary range. The actual base salary will depend on several factors such as: experience, knowledge, skills, and location of the job.
Remote, US Salary Range
$110,400-$213,000 USD
All employees are responsible for adherence to the Capital Rx Code of Conduct including the reporting of non-compliance. This position description is designed to be flexible, allowing management the opportunity to assign or reassign duties and responsibilities as needed to best meet organizational goals.
Judi Health values a diverse workplace and celebrates the diversity that each employee brings to the table. We are proud to provide equal employment opportunities to all employees and applicants for employment and prohibit discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, medical condition, genetic information, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.
By submitting an application, you agree to the retention of your personal data for consideration for a future position at Judi Health. More details about Judi Health's privacy practices can be found at https://www.judi.health/legal/privacy-policy.