1

Chronosphere Jobs (NOW HIRING)

next page

Showing results 1-20

Chronosphere information

What are the key skills and qualifications needed to thrive as a Cloud Platform Engineer at Chronosphere, and why are they important?

To thrive as a Cloud Platform Engineer at Chronosphere, you need strong expertise in cloud computing, distributed systems, and programming languages like Go or Python, often supported by a degree in computer science or related experience. Familiarity with Kubernetes, CI/CD pipelines, observability tools, and cloud providers such as AWS or GCP is typically required. Strong problem-solving, communication skills, and the ability to collaborate in fast-paced, remote teams make candidates stand out. These skills ensure the reliable delivery, scalability, and performance of Chronosphere's cloud-native observability platform.

What are the typical challenges faced by engineers working on observability platforms like Chronosphere?

Engineers working on observability platforms such as Chronosphere often encounter challenges related to handling massive volumes of real-time telemetry data and ensuring low-latency performance. Balancing scalability with cost efficiency, maintaining reliability during rapid growth, and integrating seamlessly with customers' diverse cloud-native environments are also common hurdles. Team members regularly collaborate with product, customer success, and infrastructure teams to address these issues, making strong communication and problem-solving skills especially valuable in this role.

What is a Chronosphere?

A Chronosphere typically refers to a device or concept related to time manipulation, often seen in science fiction or certain video games. In the context of gaming, particularly in the game Dota 2, Chronosphere is an ultimate ability used by the hero Faceless Void that creates a time-freezing sphere, trapping all units inside except Faceless Void. In technology, Chronosphere is also the name of a cloud-native observability platform designed to monitor and troubleshoot large-scale, distributed systems. Understanding the context is important to determine which meaning is relevant to your needs.

What is the difference between Chronosphere vs Data Engineer?

AspectChronosphereData Engineer
Required credentialsCloud monitoring, observability tools, sometimes certifications in cloud platformsData management, SQL, programming languages, certifications like Google Cloud or AWS
Work environmentCloud-based, monitoring and observability platformsData warehouses, pipelines, cloud or on-premises data systems
Employer and industry usageTech companies, cloud service providers, SaaS firmsFinance, healthcare, tech, retail, any data-driven industry

Chronosphere focuses on cloud monitoring and observability solutions, helping companies track system performance. Data Engineers build and maintain data pipelines and infrastructure. While both roles work within tech environments, Chronosphere emphasizes system monitoring, whereas Data Engineers focus on data processing and management.

More about Chronosphere jobs
What cities are hiring for Chronosphere jobs? Cities with the most Chronosphere job openings:
What states have the most Chronosphere jobs? States with the most job openings for Chronosphere jobs include:
Infographic showing various Chronosphere job openings in the United States as of May 2026, with employment types broken down into 95% Full Time, and 5% Contract. Highlights an 77% Physical, and 23% Remote job distribution.

Senior Production Engineer (IC4) (Remote)

Ontrac Solutions

Chicago, IL โ€ข Remote

$90 - $100/hr

Full-time

Posted 2 days ago


Job description

About Ontrac Solutions

At Ontrac Solutions, we partner with elite engineering organizations to build systems that operate at planetary scale. Our team supports complex cloud, infrastructure, automation, and production engineering initiatives for organizations modernizing critical platforms and high-availability environments.

We are seeking a highly skilled Senior Production Engineer IC4 to support a critical customer engagement. This role is ideal for a hands-on engineering professional with deep experience in infrastructure modernization, Linux systems, Python automation, production support, and large-scale migration execution.

Role Overview

The Senior Production Engineer will work closely with Cloud Platform Engineering, CloudTech SRE, internal engineering teams, and customer stakeholders to support the modernization of legacy infrastructure into production-ready environments.

This individual will help lead complex operating system upgrades, packaging migrations, configuration management transitions, observability improvements, CI/CD hardening, and service onboarding efforts across a large-scale infrastructure footprint.

The ideal candidate is comfortable executing independently, owning technical workstreams, resolving complex production issues, and documenting repeatable processes for long-term operational success.

Key Responsibilities
  • Lead and execute large-scale OS modernization efforts, including migrations from RHEL7 to EL8/EL9 across approximately 1,700 systems and virtual machines.
  • Support configuration management transitions, including Chef to CINC and legacy package/configuration migration from yinst to RPM.
  • Build, maintain, and configure RPM packages to support infrastructure modernization and application migration efforts.
  • Develop, execute, and improve automated runbooks for OS upgrades, configuration changes, service onboarding, and production support.
  • Triage, own, and resolve complex production issues, including high-priority S-bugs and infrastructure-related incidents.
  • Harden CI/CD pipelines, observability frameworks, and rollout/rollback mechanisms for legacy-to-modern infrastructure transitions.
  • Partner closely with CloudTech SRE to provide follow-the-sun Tier-2 production support, including hands-on incident response and break/fix operations.
  • Onboard services to modern monitoring, logging, and observability stacks.
  • Support migrations from legacy monitoring tools such as Yamas to platforms such as Chronosphere, Prometheus, and Grafana.
  • Assist with log management and Splunk integration strategies.
  • Partner with application development teams during cloud cutovers, component migrations, and production readiness activities.
  • Automate repetitive operational tasks using Python and related tooling.
  • Document technical procedures, runbooks, migration steps, and operational standards.
Required Qualifications
  • 5+ years of professional software engineering, production engineering, SRE, DevOps, or infrastructure engineering experience.
  • Strong hands-on experience with Python for automation, tooling, scripting, and operational workflows.
  • Experience supporting Linux infrastructure in production environments, ideally including RHEL7, EL8, and EL9.
  • Experience with OS modernization, infrastructure migration, or large-scale systems upgrade initiatives.
  • Hands-on experience with package management and build processes, preferably including RPM packaging.
  • Experience with configuration management tools such as Chef, CINC, Ansible, Puppet, or similar platforms.
  • Strong understanding of production support, incident response, break/fix workflows, and Tier-2 operational support.
  • Experience hardening CI/CD pipelines and supporting safe rollout/rollback processes.
  • Familiarity with observability, monitoring, logging, and alerting frameworks.
  • Ability to work independently, manage technical tasks, and communicate clearly with engineering and stakeholder teams.
  • Strong documentation skills and the ability to create repeatable runbooks and operational procedures.
Preferred Qualifications
  • Experience with Chef to CINC migrations.
  • Experience with yinst to RPM migration or similar legacy packaging transitions.
  • Experience supporting monitoring migrations from Yamas to Chronosphere, Prometheus, or Grafana.
  • Experience with Splunk log management strategy and integration.
  • Experience supporting developers through cloud cutovers and application migration phases.
  • Experience working with Cloud Platform Engineering, SRE, or infrastructure modernization teams.
  • Familiarity with NetAuto or similar network automation / operational support tooling.
  • Experience operating in a follow-the-sun support model.
  • Prior experience supporting high-scale cloud, infrastructure, or platform engineering environments.
Scope of Work / Delivery Expectations

The contractor will help drive the technical transition of legacy systems to modern infrastructure environments. Expected workstreams include:

  • Migrating and updating configurations across approximately 1,700 systems and virtual machines from RHEL7 to EL8/EL9.
  • Developing and executing automated runbooks for OS upgrades and configuration management changes.
  • Building and maintaining RPM packages to replace legacy configuration and packaging processes.
  • Supporting the transition of monitoring infrastructure to a modern observability stack, including Chronosphere, Prometheus, and Grafana.
  • Supporting Splunk integration and logging strategies.
  • Providing Tier-2 operational support and incident response under a follow-the-sun model.
  • Partnering with application developers during cloud migration and cutover phases.
  • Improving CI/CD pipelines, deployment safety, and rollback readiness.
  • Creating documentation to support repeatable operational processes and long-term platform maintainability.