1

Elasticsearch Observability Engineer Jobs (NOW HIRING)

Senior NDR & Platform Observability Engineer Senior NDR & Platform Observability Engineer will ... Exposure to data engineering platforms (Kafka, Elasticsearch, Loki). Knowledge of MITRE ATT&CK and ...

... s and Observability Engineer, on contract. The role covers infrastructure automation, CI/CD ... Splunk and/or ELK Stack (Elasticsearch, Logstash, Kibana) • Observability and alerting:

New

Sr. Elastic Engineer

Hampton, VA

$103K - $142K/yr

... Observability Engineer ECK/Kubernetes Knowledge of Kubernetes and able to create visualization ... Design, deploy, and maintain Elastic Stack environments, including Elasticsearch, Kibana, Logstash ...

Engineer

Chandler, AZ · On-site

$110K - $125K/yr

Must Have Technical/Functional Skills Job Summary Seeking an experienced Reporting & Observability ... Elasticsearch, Splunk, SQL databases, or equivalent. • Solid understanding of metrics, logs, and ...

This role sits at the intersection of API engineering, identity security, and observability ... Configure and manage ELK Stack (Elasticsearch, Logstash, Kibana) for log ingestion, monitoring, and ...

Senior Scalability Engineer - Observability

$125K - $165K/yr

They are seeking a Senior Scalability Engineer focused on observability platform development and ... Hands-on experience building or operating search systems using OpenSearch, Elasticsearch, Lucene ...

Senior Scalability Engineer - Observability

$125K - $165K/yr

They are seeking a Senior Scalability Engineer focused on observability platform development ... Hands-on experience building or operating search systems using OpenSearch, Elasticsearch, Lucene ...

next page

Showing results 1-20

Elasticsearch Observability Engineer information

How does an Elasticsearch Observability Engineer typically collaborate with development and operations teams?

As an Elasticsearch Observability Engineer, you frequently partner with both development and operations teams to design and implement monitoring solutions using the Elastic Stack. You'll help developers instrument applications for better traceability and support operations in troubleshooting and optimizing system performance. Regular communication is key, as you'll often lead workshops, create dashboards, and respond to incidents collaboratively. This cross-functional teamwork ensures observability solutions align with organizational goals and provide actionable insights.

What does an Elasticsearch Observability Engineer do?

An Elasticsearch Observability Engineer is responsible for designing, implementing, and maintaining observability solutions using the Elasticsearch stack (Elasticsearch, Logstash, Kibana, and Beats). Their primary role is to ensure that systems are monitored effectively, logs and metrics are collected and analyzed, and issues are detected and diagnosed quickly. They collaborate with development and operations teams to build dashboards, set up alerts, and optimize performance for monitoring infrastructure and applications. These engineers play a key role in improving system reliability and supporting incident response.

What are the key skills and qualifications needed to thrive as an Elasticsearch Observability Engineer, and why are they important?

To thrive as an Elasticsearch Observability Engineer, you need expertise in Elasticsearch, log management, and data analysis, often supported by a degree in computer science or a related field. Familiarity with observability tools such as Kibana, Logstash, Beats, and experience with cloud platforms and scripting languages like Python or Bash are typically required. Strong problem-solving abilities, attention to detail, and effective communication skills help you stand out in this role. These competencies are vital for ensuring system reliability, quickly detecting issues, and delivering actionable insights for continuous improvement.

What is the difference between Elasticsearch Observability Engineer vs Elasticsearch Developer?

AspectElasticsearch Observability EngineerElasticsearch Developer
Primary FocusMonitoring, logging, and observability of Elasticsearch clusters and related systemsDeveloping, customizing, and optimizing Elasticsearch applications and integrations
Skills & CertificationsKnowledge of Elasticsearch, Prometheus, Grafana, scripting, and monitoring toolsProficiency in Elasticsearch APIs, Java, REST, and development frameworks
Work EnvironmentOperations teams, DevOps, SREs, cloud environmentsDevelopment teams, software engineers, backend developers

While both roles require expertise in Elasticsearch, the Elasticsearch Observability Engineer focuses on system monitoring and ensuring Elasticsearch health, whereas the Elasticsearch Developer concentrates on building and customizing Elasticsearch-based applications. Their skills and daily tasks differ, but both are essential in Elasticsearch-centric environments.

More about Elasticsearch Observability Engineer jobs
What cities are hiring for Elasticsearch Observability Engineer jobs? Cities with the most Elasticsearch Observability Engineer job openings:
What states have the most Elasticsearch Observability Engineer jobs? States with the most job openings for Elasticsearch Observability Engineer jobs include:
What job categories do people searching Elasticsearch Observability Engineer jobs look for? The top searched job categories for Elasticsearch Observability Engineer jobs are:
Infographic showing various Elasticsearch Observability Engineer job openings in the United States as of May 2026, with employment types broken down into 66% Full Time, 10% Part Time, and 24% Contract. Highlights an 81% Physical, 7% Hybrid, and 12% Remote job distribution.
Network Detection Engineer

Network Detection Engineer

W3Global

Minneapolis, MN

Other

Posted 15 days ago


Job description

Senior NDR & Platform Observability Engineer

Senior NDR & Platform Observability Engineer will support the operational health, visibility, and performance of the enterprise Network Detection & Response (NDR) environment, with a primary focus on the Corelight platform and surrounding telemetry pipelines. This role combines security operations expertise with the ability to build a modern monitoring and observability framework leveraging APIs, time series databases, automation, and data visualization tools.

The engineer will design and implement a comprehensive health monitoring architecture that ensures accurate, timely detection of platform degradation, enhanced visibility into sensor and pipeline performance, and operational insights that support Security Operations, Incident Response, and Network Engineering teams.

Role Overview

This role is responsible for:

Operating and maintaining the NDR ecosystem.

Developing automated collection of health and performance metrics using Python and REST APIs.

Building a production ready observability stack using Grafana, Prometheus, InfluxDB, and Telegraf.

Ensuring platform reliability, data quality, and visibility through dashboards, alerts, and automation workflows.

Providing advanced troubleshooting support to ensure uninterrupted NDR coverage across the enterprise.

The individual will play a critical role in improving detection efficacy, reducing noise, optimizing sensor uptime, and delivering insights that enhance the organization's overall security posture.

Key Responsibilities

NDR Operations

Oversee daily operations of NDR sensors, appliances, and Zeek based detection pipelines.

Monitor sensor health, data ingestion, packet throughput, and drop rates.

Perform triage of NDR alerts and work with SOC/IR teams on escalations.

Support tuning of Zeek scripts, Suricata rules, and Corelight detection packs.

Identify data gaps, ingest delays, or coverage issues and drive resolution.

Troubleshoot packet broker connections, SPAN/TAP feeds, and network visibility paths.

Observability & Monitoring Architecture

Design an enterprise grade observability solution for NDR platform and related telemetry systems.

Build metrics collectors using Python to ingest REST API data into monitoring platforms.

Integrate metrics into Prometheus, InfluxDB, or similar time series databases.

Configure Telegraf pipelines for data collection, parsing, tagging, and forwarding.

Develop dashboards and visualizations in Grafana for real time and historical performance analysis.

Establish SLIs/SLOs related to NDR reliability, sensor uptime, ingest freshness, and data pipeline availability.

Automation & API Integration

Develop Python automation scripts to standardize health checks, data validation, and system reporting.

Integrate with SIEM, and packet broker APIs to extract key operational metrics.

Build custom Prometheus exporters or collectors when native solutions are not available.

Automate repetitive tasks such as sensor status checks, alert validation, and data integrity verification.

Documentation & Knowledge Transfer

Create and maintain runbooks, playbooks, architecture diagrams, and troubleshooting guides.

Produce regular reports on platform status, performance, alert trends, and risk areas.

Train SOC, IR, and engineering teams on dashboards, alerting workflows, and monitoring best practices.

Stakeholder Coordination

Work closely with Security Operations to improve triage precision and reduce alert noise.

Partner with the Incident Response team to enhance detection and correlation capabilities.

Coordinate with Network Engineering to resolve sensor visibility or traffic path issues.

Collaborate with platform owners to support upgrades, tuning cycles, and architectural enhancements.

Required Qualifications

5+ years in security operations, NDR, network engineering, or observability engineering.

Hands-on experience with Corelight, Endace, cpacket, Zeek, Suricata, or related NDR technologies.

Strong Python development skills, especially for API integrations and automation.

Experience with monitoring and visualization platforms (Grafana, Prometheus, InfluxDB, Telegraf).

Solid understanding of network traffic, packet capture, and troubleshooting.

Ability to create dashboards, alerts, and metrics pipelines for large-scale environments.

Experience supporting security operations teams or incident response workflows.

Preferred Qualifications

Experience developing custom Prometheus exporters (Python/Go).

Prior exposure to Corelight APIs and Zeek script customization.

Familiarity with Docker, Kubernetes, or containerized exporters.

Experience with SIEM platforms and log ingestion pipelines.

Exposure to data engineering platforms (Kafka, Elasticsearch, Loki).

Knowledge of MITRE ATT&CK and NDR detection engineering.

Required AI Skills:

- All contractor resources are expected to demonstrate baseline proficiency in enterprise-approved AI tools as part of their day-to-day responsibilities.

This includes, but is not limited to:

-Consistent Use: Maintain a minimum of 90% weekly usage of AI tools such as GitHub Copilot, Microsoft 365 Copilot, and other GenAI platforms approved by the enterprise.

-Applied Productivity: Leverage AI tools to enhance coding, documentation, data analysis, and decision-making workflows.

-Continuous Learning: Stay current with evolving AI capabilities and features, and apply them to improve delivery quality and velocity.


W3Global logo

About W3Global

Sourced by ZipRecruiter

W3Global has been delivering staffing solutions for nearly two decades; we know which recruiting strategies work best. Our expert team is committed to developing a customized solution to fit your company’s unique needs. As a W3Global client, you’ll also receive personalized assistance from a seasoned team of staffing specialists. We are committed to providing both technical support and industry expertise to simplify the hiring process. We know that your time matters. W3Global will help you streamline the hiring process, getting it done and getting it right.

Industry

Recruiting and staffing services

Company size

501 - 1,000 Employees

Headquarters location

Frisco, TX, US

Year founded

2006