Everforth ECS is seeking a Senior ML Observability Engineer to work in the National Capital Region covering the Pentagon, Falls Church, and Fairfax . Please Note: This position is contingent upon ...
Everforth ECS is seeking a Senior ML Observability Engineer to work in the National Capital Region covering the Pentagon, Falls Church, and Fairfax . Please Note: This position is contingent upon ...
Position Overview Our Observability, Software (SW) Development & Automation Enablement department is in search of a Senior Tech Lead, Network Observability & Automation Enablement with excellent time ...
Position Overview Our Observability, Software (SW) Development & Automation Enablement department is in search of a Senior Tech Lead, Network Observability & Automation Enablement with excellent time ...
The Senior Kibana/Observability Engineer will act as the principal subject matter expert for designing, deploying, and maintaining the Elastic Observability solution (Logs, Metrics, APM, and Uptime ...
The Senior Kibana/Observability Engineer will act as the principal subject matter expert for designing, deploying, and maintaining the Elastic Observability solution (Logs, Metrics, APM, and Uptime ...
Dynatrace Observability Engineer - McLean, VA - $80/hr Location: McLean, VA Work Arrangement: Onsite Overview: We're seeking an experienced Dynatrace Observability Engineer to design, implement, and ...
Quick apply
Dynatrace Observability Engineer - McLean, VA - $80/hr Location: McLean, VA Work Arrangement: Onsite Overview: We're seeking an experienced Dynatrace Observability Engineer to design, implement, and ...
Senior Observability Engineer
Alexandria, VA · On-site
$111K - $153K/yr
Leidos Digital Modernization sector is seeking an experienced Senior Observability Engineer to support the delivery, enhancement, and adoption of enterprise data and analytics products used across ...
Senior Observability Engineer
Alexandria, VA · On-site
$111K - $153K/yr
Leidos Digital Modernization sector is seeking an experienced Senior Observability Engineer to support the delivery, enhancement, and adoption of enterprise data and analytics products used across ...
Elastic SRE - Security & Observability with Security Clearance
Hampton, VA · On-site
$180K - $200K/yr
Implement and support observability frameworks including logging, metrics, tracing, and monitoring solutions. * Support CI/CD pipelines and infrastructure-as-code initiatives within DevOps ...
Elastic SRE - Security & Observability with Security Clearance
Hampton, VA · On-site
$180K - $200K/yr
Implement and support observability frameworks including logging, metrics, tracing, and monitoring solutions. * Support CI/CD pipelines and infrastructure-as-code initiatives within DevOps ...
Elastic SRE - Security & Observability with Security Clearance
Hampton, VA · On-site
$180K - $200K/yr
Zachary Piper Solutions is seeking an experienced Elastic Site Reliability Engineer (SRE) to support a high-visibility federal engagement focused on observability, platform reliability, and security ...
Elastic SRE - Security & Observability with Security Clearance
Hampton, VA · On-site
$180K - $200K/yr
Zachary Piper Solutions is seeking an experienced Elastic Site Reliability Engineer (SRE) to support a high-visibility federal engagement focused on observability, platform reliability, and security ...
Manager, Product Management - Customer Assist Observability Product Management at Capital One is a booming, vibrant craft that requires reimagining the status quo, finding value creation ...
Manager, Product Management - Customer Assist Observability Product Management at Capital One is a booming, vibrant craft that requires reimagining the status quo, finding value creation ...
Manager, Product Management - Customer Assist Observability Product Management at Capital One is a booming, vibrant craft that requires reimagining the status quo, finding value creation ...
Manager, Product Management - Customer Assist Observability Product Management at Capital One is a booming, vibrant craft that requires reimagining the status quo, finding value creation ...
Manager, Product Management - Customer Assist Observability Product Management at Capital One is a booming, vibrant craft that requires reimagining the status quo, finding value creation ...
Manager, Product Management - Customer Assist Observability Product Management at Capital One is a booming, vibrant craft that requires reimagining the status quo, finding value creation ...
Infrastructure Observability and Monitoring Lead Job Category: Information Technology Time Type: Full time Minimum Clearance Required to Start: TS/SCI Employee Type: Regular Percentage of Travel ...
New
Infrastructure Observability and Monitoring Lead Job Category: Information Technology Time Type: Full time Minimum Clearance Required to Start: TS/SCI Employee Type: Regular Percentage of Travel ...
New
Infrastructure Observability and Monitoring Specialist Job Category: Information Technology Time Type: Full time Minimum Clearance Required to Start: TS/SCI Employee Type: Regular Percentage of ...
New
Infrastructure Observability and Monitoring Specialist Job Category: Information Technology Time Type: Full time Minimum Clearance Required to Start: TS/SCI Employee Type: Regular Percentage of ...
New
DevOps Engineer - Lead
Richmond, VA · On-site
$52.25 - $71.50/hr
Observability Tools: Proficiency in monitoring, logging, and tracing tools, including Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), Splunk, Datadog, New Relic, and cloud-native ...
Quick apply
DevOps Engineer - Lead
Richmond, VA · On-site
$52.25 - $71.50/hr
Observability Tools: Proficiency in monitoring, logging, and tracing tools, including Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), Splunk, Datadog, New Relic, and cloud-native ...
DevOps engineer
Richmond, VA · On-site
$52.25 - $71.50/hr
Implement and manage full-stack observability using Datadog, ensuring seamless monitoring across infrastructure, applications, and services. * Instrument agents for on-premise, cloud, and hybrid ...
Quick apply
DevOps engineer
Richmond, VA · On-site
$52.25 - $71.50/hr
Implement and manage full-stack observability using Datadog, ensuring seamless monitoring across infrastructure, applications, and services. * Instrument agents for on-premise, cloud, and hybrid ...
Elastic Site Reliability Engineer (SRE) with Security Clearance
Hampton, VA · On-site
$180K - $200K/yr
Implement and support observability frameworks including logging, metrics, tracing, and monitoring solutions. * Support CI/CD pipelines and infrastructure-as-code initiatives within DevOps ...
Elastic Site Reliability Engineer (SRE) with Security Clearance
Hampton, VA · On-site
$180K - $200K/yr
Implement and support observability frameworks including logging, metrics, tracing, and monitoring solutions. * Support CI/CD pipelines and infrastructure-as-code initiatives within DevOps ...
Elastic Site Reliability Engineer with Security Clearance
Hampton, VA · On-site
$180K - $200K/yr
Zachary Piper Solutions is seeking an Elastic Site Reliability Engineer (SRE) to support a mission-focused organization delivering secure, scalable observability and reliability solutions across ...
Elastic Site Reliability Engineer with Security Clearance
Hampton, VA · On-site
$180K - $200K/yr
Zachary Piper Solutions is seeking an Elastic Site Reliability Engineer (SRE) to support a mission-focused organization delivering secure, scalable observability and reliability solutions across ...
... observability, and service-level indicator frameworks supporting AI and machine learning model-serving operations across all WDP classification enclaves, ensuring enterprise-wide operational ...
... observability, and service-level indicator frameworks supporting AI and machine learning model-serving operations across all WDP classification enclaves, ensuring enterprise-wide operational ...
Software Engineer 2 or 3 - Infrastructure
Richmond, VA · Hybrid
$171K - $202K/yr
The platform provides foundational infrastructure, container runtime environments, developer tooling, messaging systems, and observability capabilities required to run reliable and scalable ...
Software Engineer 2 or 3 - Infrastructure
Richmond, VA · Hybrid
$171K - $202K/yr
The platform provides foundational infrastructure, container runtime environments, developer tooling, messaging systems, and observability capabilities required to run reliable and scalable ...
... observability, and service-level indicator frameworks supporting AI and machine learning model-serving operations across all WDP classification enclaves, ensuring enterprise-wide operational ...
... observability, and service-level indicator frameworks supporting AI and machine learning model-serving operations across all WDP classification enclaves, ensuring enterprise-wide operational ...
We are seeking a Full Stack Developer to support our Infrastructure Monitoring and Observability platform development. The role requires strong experience in backend and frontend development ...
New
We are seeking a Full Stack Developer to support our Infrastructure Monitoring and Observability platform development. The role requires strong experience in backend and frontend development ...
New
Observability information
See Virginia salary details
$16.21 - $22.51
0% of jobs
$22.51 - $28.82
0% of jobs
$28.82 - $35.12
2% of jobs
$35.12 - $41.42
5% of jobs
$41.42 - $47.73
10% of jobs
$50.68 is the 25th percentile. Wages below this are outliers.
$47.73 - $54.03
17% of jobs
The median wage is $59.01 / hr.
$54.03 - $60.34
20% of jobs
$60.34 - $66.64
18% of jobs
$67.77 is the 75th percentile. Wages above this are outliers.
$66.64 - $72.95
15% of jobs
$72.95 - $79.25
9% of jobs
$79.25 - $85.56
4% of jobs
$16
$60
$85
How much do observability jobs pay per hour?
What are the key skills and qualifications needed to thrive in the Observability position, and why are they important?
To thrive in an Observability role, you need a strong background in monitoring, alerting, logging, and analyzing system performance, often supported by a degree in computer science or related field. Familiarity with tools such as Prometheus, Grafana, Datadog, Splunk, and experience with cloud platforms and scripting languages is crucial. Excellent problem-solving, communication, and collaboration skills help you work effectively with cross-functional engineering and operations teams. These capabilities are essential to ensure system reliability, quickly detect issues, and maintain seamless digital experiences.
What is an Observability job?
An Observability job focuses on ensuring the performance, reliability, and health of software systems by collecting, analyzing, and visualizing telemetry data such as logs, metrics, and traces. Professionals in this field work with monitoring tools, distributed tracing, and alerting systems to detect and troubleshoot issues proactively. They collaborate with engineering and operations teams to improve system visibility, reduce downtime, and enhance overall system performance.
What are the typical day-to-day responsibilities of someone in an Observability role?
In an Observability role, your daily tasks often include designing and maintaining monitoring dashboards, configuring alerts, analyzing system logs, and working closely with development and operations teams to troubleshoot issues. You'll proactively identify areas of improvement to increase system reliability, document monitoring strategies, and support incident response efforts. Collaboration is key, as you may participate in post-incident reviews and help drive architectural improvements based on the data you collect. The role is dynamic and requires a proactive approach to ensure systems stay healthy and downtime is minimized.

$103K - $142K/yr
Other
Posted 15 days ago
Job description
• Develops semantic conventions, runtime instrumentation patterns, and telemetry pipelines that generate latency metrics, error signatures, throughput indicators, model-specific performance signals, and operational readiness measurements for deployed models and serving surfaces.
• Integrates observability capabilities into existing data pipelines, model-deployment workflows, API access patterns, and serving runtime frameworks to provide mission-relevant monitoring aligned with Combatant Command and Joint Staff decision-support needs.
• Configures and validates instrumentation using platforms such as OpenTelemetry, Prometheus, Grafana, Elastic, Splunk, Amazon CloudWatch, and service mesh telemetry components to deliver real-time visibility into model behavior, cross-domain access interactions, and pipeline execution characteristics.
• Conducts observability readiness reviews, supports test and evaluation gates, and collaborates with cybersecurity personnel to embed anomaly-detection signals aligned with Zero Trust and DoW cyber standards.
• Works with serving engineers, pipeline engineers, platform teams, and external provider integration engineers to maintain observability consistency across enclaves and resolve domain-specific telemetry constraints.
• Produces observability standards, instrumentation specifications, dashboards, alerting configurations, and performance analysis reports that strengthen reliability, accelerate incident response, and reinforce mission assurance for production model access across all security networks.
• Performs other duties as assigned. Required Skills • Current Secret security clearance with the ability to obtain and maintain a Top Secret (TS) security clearance with Sensitive Compartmented Information (SCI).
• 10 or more years of progressive experience in systems engineering, platform operations, or ML/AI infrastructure roles, with a demonstrated focus on observability, telemetry, and monitoring in classified or federal government cloud environments.
• Hands-on experience designing and implementing observability pipelines using industry-standard tooling such as OpenTelemetry, Prometheus, Grafana, Elastic, Splunk, or Amazon CloudWatch, including instrumentation of AI/ML model-serving runtimes and data pipelines.
• Experience operating across multi-enclave environments, including NIPRNet, SIPRNet, and JWICS, with demonstrated ability to adapt telemetry and observability architectures to cross-domain constraints and multi-level security requirements.
• CompTIA Cloud+ certification or equivalent, demonstrating foundational knowledge of cloud infrastructure, security, and operational monitoring standards.
• Strong problem-solving and decision-making capabilities, with a proven ability to weigh the relative costs and benefits of potential actions and identify the most appropriate solution.
• Highly developed interpersonal and oral/written communication skills, with the ability to effectively and professionally interact with a diverse set of stakeholders (from peers to end-users to executive management). Desired Skills • Active Top Secret (TS) security clearance with Sensitive Compartmented Information (SCI) eligibility.
• Advanced cloud certification such as AWS Solutions Architect (Professional), AWS DevOps Engineer (Professional), or an equivalent credential demonstrating deep expertise in cloud-native observability and infrastructure-as-code practices in GovCloud or classified cloud environments.
• Practical experience with AI/ML model monitoring concepts including model drift detection, performance degradation alerting, and model validation pipelines using frameworks such as TensorFlow, PyTorch, or MLflow.
• Familiarity with Zero Trust Architecture principles and Risk Management Framework (RMF) requirements as they apply to telemetry data handling, anomaly detection, and continuous monitoring in DoW-compliant environments.
• Experience contributing to DevSecOps pipelines and CI/CD workflows in support of production AI/ML model deployment, including integration of observability gates at promotion checkpoints across development, test, and production environments. ECS Federal LLC is an equal opportunity employer and does not discriminate or allow discrimination on the basis any characteristic protected by law. All qualified applicants will receive consideration for employment without regard to disability, status as a protected veteran or any other status protected by applicable federal, state, or local jurisdiction law. is the federal segment of , a $4B global organization with over 10,000 employees. Our nearly 3,500 professionals deliver advanced technology solutions in data and AI, cybersecurity, and enterprise transformation, serving defense, intelligence, and federal civilian agencies. Our work powers mission-critical outcomes, strengthens technology partnerships, and creates meaningful opportunities for our people. We are defined by a commitment to excellence in delivery, a culture of innovation, and an environment where talent can thrive and grow. We value: * Attracting and developing top talent and high-performing teams * Fostering a culture that is engaging, accountable, and mission-driven