2

Remote Observability Engineer Jobs in Virginia (NOW HIRING)

Senior Infrastructure Engineer

Herndon, VA ยท On-site +1

$111K - $151K/yr

... open to remote candidates in certain states. Responsibilities : * Design and operate AWS ... Production monitoring and observability with Prometheus and Grafana (exporters, PromQL, alerting ...

Senior Infrastructure Engineer

Herndon, VA ยท On-site +1

$111K - $151K/yr

... open to remote candidates in certain states. Responsibilities : * Design and operate AWS ... Production monitoring and observability with Prometheus and Grafana (exporters, PromQL, alerting ...

Microservices Developer (Remote)

Alexandria, VA ยท Remote

$54.50 - $70.75/hr

... Developer to support our US Government client. * This will be a 100% remote contract-to-hire ... Add observability: structured logging, metrics, distributed tracing, dashboards, and alerting.

Microservices Developer (Remote)

Alexandria, VA ยท Remote

$54.50 - $70.75/hr

... Developer to support our US Government client. * This will be a 100% remote contract-to-hire ... Add observability: structured logging, metrics, distributed tracing, dashboards, and alerting.

Microservices Developer (Remote)

Alexandria, VA ยท On-site +1

$54.50 - $70.75/hr

... Developer to support our US Government client. * This will be a 100% remote contract-to-hire ... Add observability: structured logging, metrics, distributed tracing, dashboards, and alerting.

Evergreen: Senior AI Platform & Data Engineer

Mclean, VA ยท Remote

$107K - $145K/yr

AI Platform & Data Engineer This is a remote position. Ad Hoc is a technology company that empowers ... Expertise in AI/ML workflows, model deployment, and observability (latency, throughput). * Ability ...

Kafka Engineer

Arlington, VA ยท Remote

$100K - $130K/yr

Experience with monitoring tools and observability platforms for Kafka (e.g., Prometheus, Grafana ... Location: 100% Remote (US-based only) * Hours: 40 hours/week, availability during core business ...

Sr. Architect - Cloud Platforms

VA ยท On-site +1

$65.25 - $83/hr

... reliability, observability, and performance optimization. Platform Engineering & Developer ... Remote EEO Statement Maximus is an equal opportunity employer. We evaluate qualified applicants ...

$128K - $168K/yr

... observability. The ideal candidate will have a strong background in low-latency trading systems and ... REMOTE

next page

Showing results 1-20

Remote Observability Engineer information

What are the typical collaboration patterns for a Remote Observability Engineer working with distributed teams?

Remote Observability Engineers frequently collaborate with software developers, DevOps teams, and IT operations to ensure systems are monitored effectively and issues are detected early. Working remotely, you'll often use communication tools like Slack, Jira, and video conferencing to coordinate incident response, discuss monitoring strategies, and review system health dashboards. Regular sync meetings and asynchronous updates are common, and you'll likely contribute to documentation and knowledge sharing to keep all stakeholders informed. Building strong communication habits is important, as much of the troubleshooting and improvement work hinges on clear coordination with multiple teams.

What are the key skills and qualifications needed to thrive as a Remote Observability Engineer, and why are they important?

To thrive as a Remote Observability Engineer, you need strong expertise in monitoring, logging, and tracing systems, along with a background in computer science or related technical fields. Familiarity with tools like Prometheus, Grafana, ELK Stack, Datadog, and cloud platforms is typically required, as well as relevant certifications such as AWS Certified Cloud Practitioner or Google Cloud Professional DevOps Engineer. Excellent problem-solving abilities, communication skills, and a proactive mindset help you detect and resolve issues before they impact users. These competencies ensure system reliability, enable rapid incident response, and support seamless collaboration in distributed environments.

What is the difference between Remote Observability Engineer vs Site Reliability Engineer?

AspectRemote Observability EngineerSite Reliability Engineer
CredentialsKnowledge of monitoring tools, scripting, cloud platformsSame as Observability Engineer, plus SRE certifications often preferred
Work EnvironmentFocus on monitoring, logging, and tracing systems remotelyBroader scope including system reliability, incident response, and automation
Industry UsagePrimarily in tech, SaaS, cloud servicesWidely in tech, finance, and large-scale online services

The Remote Observability Engineer specializes in monitoring and analyzing system performance remotely, focusing on tools like logs and metrics. In contrast, the Site Reliability Engineer has a broader role, ensuring overall system reliability, automation, and incident management. While both roles require similar technical skills, SREs often have additional responsibilities related to system resilience and scalability.

What is a Remote Observability Engineer?

A Remote Observability Engineer is a professional responsible for designing, implementing, and maintaining systems that monitor the health, performance, and reliability of software applications and infrastructure from a remote location. They use observability tools to collect and analyze logs, metrics, and traces, helping organizations quickly detect and resolve issues. Their work ensures that distributed systems are transparent, reliable, and efficient, often collaborating with development, operations, and security teams. Remote Observability Engineers often work from anywhere, leveraging cloud-based tools and platforms to manage complex IT environments.
What are the most commonly searched types of Observability Engineer jobs in Virginia? The most popular types of Observability Engineer jobs in Virginia are:
What job categories do people searching Remote Observability Engineer jobs in Virginia look for? The top searched job categories for Remote Observability Engineer jobs in Virginia are:
What cities in Virginia are hiring for Remote Observability Engineer jobs? Cities in Virginia with the most Remote Observability Engineer job openings:
Infographic showing various Remote Observability Engineer job openings in Virginia as of June 2026, with employment types broken down into 75% Full Time, and 25% Contract. Highlights an 100% Remote job distribution.
Senior Infrastructure Engineer

Senior Infrastructure Engineer

BlackSky

Herndon, VA โ€ข On-site, Remote

$111K - $151K/yr

Full-time

Medical, Dental, Vision, Life, Retirement, PTO

This job post hasย expired today.ย Applications are no longer accepted.


Job description

Senior Infrastructure Engineer

About Us:

BlackSky is a real-time intelligence company. We own and operate the world\'s most advanced space-based intelligence platform and provide customers satellite imagery, automated analytics and high-frequency monitoring of strategic locations, economic assets and events from around the globe. BlackSky is trusted by the most demanding allied military and intelligence organizations and commercial companies to deliver foresight into critical matters that affect national security and the economy. BlackSky\'s data enables governments and businesses to see, understand and anticipate change as it happens, giving them the ultimate strategic advantage so they can act quickly. Our global team works with cutting-edge technology to make a difference around the world and prides itself on being people-first, customer-focused and fun.

The BlackSky Platform team is building the premier global intelligence platform to deliver timely, relevant, and actionable information to customers. As a Senior Infrastructure Engineer, you will design, build and operate platforms that run our customer workloads across public cloud, private data centers, and air-gapped/disconnected environments. ย This is a hands-on engineering role for someone who is equally comfortable writing Terraform against AWS, troubleshooting kubernetes deployments and packaging full application stacks for delivery and implementation in isolated network environments.

As one of the team, you will be engaged in all aspects of our our multi-environment delivery pipelines from development through production and business continuity (BCP) environments. ย You will directly be involved in the successful automated deployment of solutions that meet our customersโ€™ business objectives. Your contribution matters! The ideal candidate brings deep Kubernetes and GitOps operational experience implementing in public, private and isolated environments.ย  This position reports to the Manager, Infrastructure and while we have offices in Herndon, VA and Seattle, WA, we are open to remote candidates in certain states.

Responsibilities:

  • Design and operate AWS infrastructure (VPC, subnets, NLB/ALB, IAM, EKS, EC2, S3, Route 53) and the hybrid connectivity that ties cloud to on-premises and private/air-gapped networks.
  • Stand up and run production-grade Kubernetes clusters on EKS, Rancher (RKE2) and/or Red Hat OpenShift 4, including upgrades, capacity planning, networking, storage, and day-2 operations.
  • Implement and own GitOps workflows with Argo CD โ€” declarative cluster and application state, app-of-apps patterns, sync policies, drift detection, and progressive rollout strategies.
  • Author, version, and maintain Helm charts for internal and third-party workloads, including values management, chart dependencies, and templating standards across environments.
  • Build repeatable delivery into disconnected environments using Zarf (and equivalent packaging/mirroring tooling) โ€” bundling images, charts, and manifests for air-gapped installs and reproducible deployments.
  • Codify infrastructure and platform configuration as code (Terraform, Helm, Kustomize) with a clear build-once / promote-per-environment strategy.
  • Build and harden CI/CD pipelines that move artifacts safely from dev through to restricted production and BCP targets.
  • Integrate platform services โ€” certificate management (cert-manager), secrets management, container registries, storage, and observability โ€” as shared, reusable building blocks.
  • Establish operational standards: monitoring, alerting, logging, runbooks, incident response, and capacity/cost management.
  • Other responsibilities as assigned.ย 

Required Qualifications:

  • At least five years years in infrastructure, platform, DevOps, or SRE engineering, with at least 3 years running Kubernetes in production.
  • Bachelor\'s degree in a relevant field of study or equivalent experience (four years).
  • Strong hands-on AWS experience across networking, compute, storage, and IAM, including hybrid/on-prem connectivity patterns.
  • Production experience operating Kubernetes in one or more enterprise distributions โ€” Amazon EKS, Rancher/RKE2, or OpenShift 4.
  • Demonstrated GitOps experience with Argo CD (or Flux) as the primary deployment mechanism.
  • Proficiency authoring and maintaining Helm charts, and a solid grasp of Kubernetes primitives (workloads, networking, RBAC, storage, CRDs/operators).
  • Experience with the Kubernetes Operator deployment model โ€” deploying and managing workloads via operators and CRDs (OLM/OperatorHub).
  • Strong infrastructure-as-code skills, ideally with Terraform.
  • Comfort with Linux systems administration and scripting (Bash, plus Python or Go).
  • Experience building on hardened, non-CVE / zero-known-vulnerability base images (e.g., Chainguard, Iron Bank, or distroless/minimal baselines) and supply-chain security practices.
  • Production monitoring and observability with Prometheus and Grafana (exporters, PromQL, alerting, dashboards).
  • Clear written and verbal communication, and the ability to work independently across the full lifecycle of a platform component.

Preferred Qualifications:

  • ย  ย  Breadth across all three of EKS, Rancher/RKE2, and OpenShift 4, with the ability to move fluidly between them.
  • ย  ย  Experience running Kubernetes in edge / resource-constrained environments (e.g., k3s), including the operational tradeoffs of lightweight and disconnected deployments.
  • ย  ย  Direct experience packaging and deploying into air-gapped / disconnected environments using Zarf, image mirroring, and private registries.
  • ย  ย  Container and image scanning experience (Trivy, Grype, Clair, or equivalents) integrated into CI/CD and registry workflows.
  • ย  ย  Familiarity with secrets management (Vault, External Secrets Operator) and PKI/certificate automation.
  • ย  ย  Experience with persistent storage at scale (Ceph, EBS/EFS-backed storage classes).
  • ย  ย  Hands-on OpenTelemetry (OTEL) experience โ€” instrumenting services, running the OTEL Collector, and standardizing traces, metrics, and logs across the platform.
  • ย  ย  Centralized log aggregation and analysis with Elasticsearch / OpenSearch (and shippers such as Fluent Bit, Fluentd, or Logstash).
  • ย  ย  Background supporting regulated, government, or other compliance-driven programs.
  • ย  ย  Service mesh experience with Istio (traffic management, mTLS, ingress/egress gateways, and observability integration).
  • ย  ย  Relevant certifications (CKA/CKAD/CKS, AWS Solutions Architect / DevOps Engineer, Red Hat OpenShift, Rancher).

Life at BlackSky for full-time US benefits eligible employees includes:

  • Medical, dental, vision, disability, group term life and AD&D, voluntary life and AD&D insurance
    • BlackSky pays 100% of employee-only premiums for medical, dental and vision and contributes $100/month for out-of-pocket expenses!
  • 15 days of PTO, 11 Company holidays, four Floating Holidays (pro-rated based on hire date), one day of paid volunteerism leave per year, parental leave and more
  • 401(k) pre-tax and Roth deferral options with employer match
  • Flexible Spending Accounts
  • Employee Stock Purchase Program
  • Employee Assistance and Travel Assistance Programs
  • Employer matching donations
  • Professional development
  • Mac or PC? Your choice!
  • Awesome swag

The anticipated salary range for candidates in Seattle, WA is $135,000-$150,000 per year. The final compensation package offered to a successful candidate will be dependent on specific background and education. BlackSky is a multi-state employer, and this pay scale may not reflect salary ranges in other states or locations outside of Seattle, WA.

BlackSky is committed to hiring and retaining a diverse workforce. We are proud to be an Equal Opportunity/Affirmative Action Employer All Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, age, national origin, sexual orientation, gender identity, disability, protected veteran status or any other characteristic protected by law.

To conform to U.S. Government space technology export regulations, including the International Traffic in Arms Regulations (ITAR) you must be a U.S. citizen, lawful permanent resident of the U.S., protected individual as defined by 8 U.S.C. 1324b(a)(3), or eligible to obtain the required authorizations from the U.S. Department of State.ย 

EEO/AAP/ Pay Transparency Statements:

https://www.dol.gov/ofccp/regs/compliance/posters/pdf/eeopost.pdf

https://www.dol.gov/ofccp/regs/compliance/posters/pdf/OFCCP_EEO_Supplement_Final_JRF_QA_508c.pdf


BlackSky logo

About BlackSky

Sourced by ZipRecruiter

Industry

Guided missile and space vehicle manufacturing

Company size

11 - 50 Employees

Headquarters location

Herndon, VA, US

Year founded

2013