... • Mission-Critical Observability: Architect and maintain Splunk AIOps solutions across ... Engineer secure data ingestion pipelines for telemetry data from cross-domain solutions and ...
... • Mission-Critical Observability: Architect and maintain Splunk AIOps solutions across ... Engineer secure data ingestion pipelines for telemetry data from cross-domain solutions and ...
We are looking for a Staff AIOps Engineer - Generative AI Platform to join our AI team. In this ... Establish and evolve observability, tracing, and telemetry frameworks for GenAI systems across ...
We are looking for a Staff AIOps Engineer - Generative AI Platform to join our AI team. In this ... Establish and evolve observability, tracing, and telemetry frameworks for GenAI systems across ...
Technology Monitoring and Observability Engineer
Lisle, IL · On-site
$137.60K - $206.40K/yr
Event Management & AIOps Engineering * Engineer advanced event correlation, deduplication, and ... Integrate observability tooling with CMDB platforms such as ServiceNow. * Improve configuration ...
Technology Monitoring and Observability Engineer
Lisle, IL · On-site
$137.60K - $206.40K/yr
Event Management & AIOps Engineering * Engineer advanced event correlation, deduplication, and ... Integrate observability tooling with CMDB platforms such as ServiceNow. * Improve configuration ...
Senior DevOps Engineer, AIOPs
$151.50K - $194.60K/yr
... E/DevOps/Platform Ops ... Proven ownership of reliability for an observability/AIOps platform: SLOs/SLIs, on-call, addressing ...
Senior DevOps Engineer, AIOPs
$151.50K - $194.60K/yr
... E/DevOps/Platform Ops ... Proven ownership of reliability for an observability/AIOps platform: SLOs/SLIs, on-call, addressing ...
Mentor engineers, review designs, lead incident reviews, and ensure platform scalability and cost efficiency. Required Skills AIOps Observability Deep expertise in Open Telemetry, distributed tracing ...
Mentor engineers, review designs, lead incident reviews, and ensure platform scalability and cost efficiency. Required Skills AIOps Observability Deep expertise in Open Telemetry, distributed tracing ...
We are looking for a Staff AIOps Engineer - Generative AI Platform to join our AI team. In this ... Establish and evolve observability, tracing, and telemetry frameworks for GenAI systems across ...
We are looking for a Staff AIOps Engineer - Generative AI Platform to join our AI team. In this ... Establish and evolve observability, tracing, and telemetry frameworks for GenAI systems across ...
Influence and partner with engineering, platform, and operations leaders to embed observability ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
Influence and partner with engineering, platform, and operations leaders to embed observability ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
Influence and partner with engineering, platform, and operations leaders to embed observability ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
Influence and partner with engineering, platform, and operations leaders to embed observability ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
Principal Technical Product Analyst - Unified Observability, Enablement & AIOps
Virginia Beach, VA · On-site
Influence and partner with engineering, platform, and operations leaders to embed observability ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
Principal Technical Product Analyst - Unified Observability, Enablement & AIOps
Virginia Beach, VA · On-site
Influence and partner with engineering, platform, and operations leaders to embed observability ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
Principal Technical Product Analyst - Unified Observability, Enablement & AIOps
Jacksonville, FL · On-site
... across engineering, operations, and leadership stakeholders. • Own the end-to-end Dynatrace ... observability and AIOps capabilities. Qualifications : Required : • Bachelor's degree in ...
Principal Technical Product Analyst - Unified Observability, Enablement & AIOps
Jacksonville, FL · On-site
... across engineering, operations, and leadership stakeholders. • Own the end-to-end Dynatrace ... observability and AIOps capabilities. Qualifications : Required : • Bachelor's degree in ...
Principal Technical Product Analyst - Unified Observability, Enablement & AIOps
Virginia Beach, VA · On-site
... across engineering, operations, and leadership stakeholders. • Own the end-to-end Dynatrace ... observability and AIOps capabilities. Qualifications : Required : • Bachelor's degree in ...
Principal Technical Product Analyst - Unified Observability, Enablement & AIOps
Virginia Beach, VA · On-site
... across engineering, operations, and leadership stakeholders. • Own the end-to-end Dynatrace ... observability and AIOps capabilities. Qualifications : Required : • Bachelor's degree in ...
Influence and partner with engineering, platform, and operations leaders to embed observability ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
Influence and partner with engineering, platform, and operations leaders to embed observability ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
Influence and partner with engineering, platform, and operations leaders to embed observability ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
Influence and partner with engineering, platform, and operations leaders to embed observability ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
Influence and partner with engineering, platform, and operations leaders to embed observability ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
Influence and partner with engineering, platform, and operations leaders to embed observability ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
Proven ownership of reliability for an observability/AIOps platform: SLOs/SLIs, on-call, addressing ... Proven programming experience building automation tools or services - ideally in Python, or similar ...
Proven ownership of reliability for an observability/AIOps platform: SLOs/SLIs, on-call, addressing ... Proven programming experience building automation tools or services - ideally in Python, or similar ...
Influence and partner with engineering, platform, and operations leaders to embed observability ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
Influence and partner with engineering, platform, and operations leaders to embed observability ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
Influence and partner with engineering, platform, and operations leaders to embed observability ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
Influence and partner with engineering, platform, and operations leaders to embed observability ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
Principal Technical Product Analyst - Unified Observability, Enablement & AIOps
Winchester, VA · On-site
Influence and partner with engineering, platform, and operations leaders to embed observability ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
Principal Technical Product Analyst - Unified Observability, Enablement & AIOps
Winchester, VA · On-site
Influence and partner with engineering, platform, and operations leaders to embed observability ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
Principal Technical Product Analyst - Unified Observability, Enablement & AIOps
San Diego, CA · On-site
Influence and partner with engineering, platform, and operations leaders to embed observability ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
Principal Technical Product Analyst - Unified Observability, Enablement & AIOps
San Diego, CA · On-site
Influence and partner with engineering, platform, and operations leaders to embed observability ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
Influence and partner with engineering, platform, and operations leaders to embed observability ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
Influence and partner with engineering, platform, and operations leaders to embed observability ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
Observability Aiops Engineer information
What are the key skills and qualifications needed to thrive as an Observability AIOps Engineer, and why are they important?
What are some common challenges faced by Observability AIOps Engineers in integrating monitoring solutions across diverse technology stacks?
What is an Observability Aiops Engineer?
What is the difference between Observability Aiops Engineer vs Site Reliability Engineer?
| Aspect | Observability Aiops Engineer | Site Reliability Engineer |
|---|---|---|
| Primary Focus | Monitoring, analyzing, and improving system observability using AI and automation | Ensuring system reliability, scalability, and performance of services |
| Skills & Certifications | Knowledge of AI/ML, monitoring tools, scripting, cloud platforms | Systems engineering, scripting, cloud infrastructure, incident management |
| Work Environment | DevOps teams, monitoring platforms, AI tools | Operations, development teams, cloud environments |
| Industry Usage | Tech companies, cloud providers, organizations focusing on AI-driven monitoring | Large-scale tech firms, SaaS providers, internet services |
While both roles focus on system performance and reliability, the Observability Aiops Engineer specializes in leveraging AI and automation to enhance system observability, whereas the Site Reliability Engineer concentrates on maintaining overall system stability and scalability. Both roles often collaborate but have distinct core responsibilities.

Other
Posted 5 days ago
Job description
GES is seeking a Senior AIOps Engineer to support critical mission operations within a secure environment and lead the transformation of our IT Service Management (ITSM) capabilities. This role is responsible for the design, deployment, and management of AIOps solutions that enhance the reliability and security of Department of War (DoW) networks and systems. Acting as the technical lead for this initiative, you will orchestrate integrations across existing Network Engineering, ServiceNow, and SolarWinds teams. You will utilize Splunk and the Machine Learning Toolkit (MLTK) to provide descriptive and predictive analytics and establish closed-loop automated incident response, ensuring the high availability of mission-essential infrastructure. Primary Responsibilities • Cross-Functional Leadership: Lead the AIOps platform initiative by acting as the primary technical liaison to existing Network Engineering, ServiceNow, and SolarWinds administration teams to establish unified telemetry pipelines.
ITSM Orchestration & Automation: Architect closed-loop remediation workflows by deeply integrating Splunk ITSI alerts with ServiceNow Event Management and Incident Management modules. • Mission-Critical Observability: Architect and maintain Splunk AIOps solutions across unclassified and classified enclaves to provide real-time situational awareness. • Infrastructure Telemetry Integration: Normalize and correlate network performance and fault data from SolarWinds with server and application logs to provide a holistic view of enterprise health. • Advanced ML Development: Deploy custom machine learning models via Splunk MLTK to identify anomalous behavior, potential cyber threats, and infrastructure degradations. • Secure Data Integration: Engineer secure data ingestion pipelines for telemetry data from cross-domain solutions and tactical edge devices. • Incident Reduction: Utilize IT Service Intelligence (ITSI) to correlate multi-source events, reducing noise and prioritizing high-impact mission alerts. • Cyber Defense Support: Collaborate with the Cyber Security Service Provider (CSSP) to integrate AIOps insights into defensive cyber operations (DCO). • Compliance & Documentation: Ensure all observability tools comply with DoW STIGs and IL5/IL6 protocols; develop and maintain architectural documentation and compliance traceability. • Mission Alignment: Stay current on AIOps and related capabilities relevant to DoD, federal, and intelligence mission systems.
Required Qualifications • Security Clearance: Active Top Secret / Sensitive Compartmented Information (TS/SCI) required at time of hire. • Certification: Active IAT Level II certification (e.g., Security+ CE, CySA+, GSEC, or SSCP) required. • Citizenship: United States Citizenship is required. • Platform Experience: 7+ years of experience with Splunk Enterprise, including architectural design, cluster management, and advanced Search Processing Language (SPL). • AIOps & ITSM: 3+ years of experience implementing AIOps workflows, including integration with enterprise ITSM solutions (ServiceNow) for automated root cause analysis and remediation. • Machine Learning: Proven track record of building, testing, and tuning supervised and unsupervised models within the Splunk MLTK. • Scripting & Automation: Advanced scripting skills for developing custom search commands, API integrations, and automating remediation tasks (e.g., Python).
Leadership: Experience leading technical working groups and directing the efforts of adjacent infrastructure and development teams. • Operational Experience: Prior experience working within a DoW/DoD Operations Center (NOC/SOC) or supporting mission-critical systems and networks. • Communication: Must be able to present designs, plans, and analyses of alternatives to technical leadership boards for approvals.
Desired Qualifications • Enterprise Aggregation: Experience aggregating and correlating telemetry from diverse tools, specifically SolarWinds, ServiceNow, and VMware vCenter. • Expert Certification: Splunk Enterprise Certified Architect or Splunk ITSI Certified Admin. • Cloud Observability: Experience with Cloud Native Computing Foundation (CNCF) observability tools in secure hybrid multi-cloud environments (Azure/AWS). • RMF/ATO Knowledge: Understanding of the Risk Management Framework (RMF) and the Authorization to Operate (ATO) process for AI/ML workloads.
About Global Enterprise Services
Sourced by ZipRecruiter
Industry
It services
Company size
11 - 50 Employees
Headquarters location
St. Louis, MO, US
Year founded
2018