... , and incident workflows Define monitoring-to-incident processes and governance frameworks ... engineering, AIOps, or production operations Proven experience in enterprise observability ...
... , and incident workflows Define monitoring-to-incident processes and governance frameworks ... engineering, AIOps, or production operations Proven experience in enterprise observability ...
Job Title: Principal Engineer I - AIOps ServiceNow Location: Block 23 What you'll do: As a ... Lead build-out and optimization of Event Management , Cloud Observability , Metric Intelligence ...
Job Title: Principal Engineer I - AIOps ServiceNow Location: Block 23 What you'll do: As a ... Lead build-out and optimization of Event Management , Cloud Observability , Metric Intelligence ...
Senior Engineer, AIOps
$100K - $137.30K/yr
The Royal Caribbean Group's AI & Analytics Team has an exciting career opportunity for a full time Senior Engineer, AIOps reporting to the Senior Manager, Data Intelligence Operations. The position ...
Senior Engineer, AIOps
$100K - $137.30K/yr
The Royal Caribbean Group's AI & Analytics Team has an exciting career opportunity for a full time Senior Engineer, AIOps reporting to the Senior Manager, Data Intelligence Operations. The position ...
Agentic AIOps Technical Lead
$112.80K - $257K/yr
... SREs, product managers, and security teams and creating reference architectures and creating ... Experience with AIOps tools and observability platforms such as Elastic, Splunk, Datadog ...
Agentic AIOps Technical Lead
$112.80K - $257K/yr
... SREs, product managers, and security teams and creating reference architectures and creating ... Experience with AIOps tools and observability platforms such as Elastic, Splunk, Datadog ...
Senior Engineer, AIOps
Miami, FL · On-site
$100K - $137.30K/yr
The Royal Caribbean Group's AI & Analytics Team has an exciting career opportunity for a full time Senior Engineer, AIOps reporting to the Senior Manager, Data Intelligence Operations. The position ...
Senior Engineer, AIOps
Miami, FL · On-site
$100K - $137.30K/yr
The Royal Caribbean Group's AI & Analytics Team has an exciting career opportunity for a full time Senior Engineer, AIOps reporting to the Senior Manager, Data Intelligence Operations. The position ...
NETCOOL DEVELOPER WITH IBM CLOUD PAK FOR AIOPS
$51.25 - $70/hr
We are seeking an experienced Netcool Developer with expertise in IBM Cloud Pak for AIOps to support enterprise monitoring, event management, and AI-driven IT operations initiatives. The ideal ...
New
NETCOOL DEVELOPER WITH IBM CLOUD PAK FOR AIOPS
$51.25 - $70/hr
We are seeking an experienced Netcool Developer with expertise in IBM Cloud Pak for AIOps to support enterprise monitoring, event management, and AI-driven IT operations initiatives. The ideal ...
New
Senior AIOps Engineer with Security Clearance
$129.50K - $177.50K/yr
This role is responsible for the design, deployment, and management of AIOps solutions that enhance the reliability and security of Department of War (DoW) networks and systems. Acting as the ...
Senior AIOps Engineer with Security Clearance
$129.50K - $177.50K/yr
This role is responsible for the design, deployment, and management of AIOps solutions that enhance the reliability and security of Department of War (DoW) networks and systems. Acting as the ...
Event and Data Management Ensuring seamless bidirectional synchronization of event data between Netcool and the AIOps platform. This involves configuring data mapping and transformation rules (often ...
Quick apply
Event and Data Management Ensuring seamless bidirectional synchronization of event data between Netcool and the AIOps platform. This involves configuring data mapping and transformation rules (often ...
Event and Data Management Ensuring seamless bidirectional synchronization of event data between Netcool and the AIOps platform. This involves configuring data mapping and transformation rules (often ...
New
Event and Data Management Ensuring seamless bidirectional synchronization of event data between Netcool and the AIOps platform. This involves configuring data mapping and transformation rules (often ...
New
Dynatrace AIOps Consultant
Clearwater, FL · On-site
Role: Dynatrace AIOps Consultant Location: Clearwater, FL (Remote till COVID) JD Details: Key ... APM | Dynatrace Full Stack (RUM, Synthetic, DEM, Host) GrayLog | Log Management solution Postman ...
Dynatrace AIOps Consultant
Clearwater, FL · On-site
Role: Dynatrace AIOps Consultant Location: Clearwater, FL (Remote till COVID) JD Details: Key ... APM | Dynatrace Full Stack (RUM, Synthetic, DEM, Host) GrayLog | Log Management solution Postman ...
Drive consistency and alignment with ServiceNow CMDB and enterprise data sources. * Influence and ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
Drive consistency and alignment with ServiceNow CMDB and enterprise data sources. * Influence and ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
Drive consistency and alignment with ServiceNow CMDB and enterprise data sources. * Influence and ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
Drive consistency and alignment with ServiceNow CMDB and enterprise data sources. * Influence and ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
Event & Data Management : Configure bidirectional event synchronization, data mapping, and ... Use AIOps topology features for unified, application‑centric infrastructure views.
Event & Data Management : Configure bidirectional event synchronization, data mapping, and ... Use AIOps topology features for unified, application‑centric infrastructure views.
SRE Architect Lead AIOps & Dynatrace
$54.75 - $72.75/hr
Lead SRE strategy, architecture, and reliability initiatives across large-scale distributed systems Design and implement AIOps-driven monitoring and incident management solutions Build proactive ...
SRE Architect Lead AIOps & Dynatrace
$54.75 - $72.75/hr
Lead SRE strategy, architecture, and reliability initiatives across large-scale distributed systems Design and implement AIOps-driven monitoring and incident management solutions Build proactive ...
The Global Operations team at BizTech manages production services across Airbnb's corporate ... You will own the AIOps vision, strategy, and roadmap, partnering with the in-house Observability ...
The Global Operations team at BizTech manages production services across Airbnb's corporate ... You will own the AIOps vision, strategy, and roadmap, partnering with the in-house Observability ...
Netcool Developer with AIOps Cloud Pak - Omnibus and Impact knowledge
Irving, TX · On-site
$54 - $74/hr
Event and Data Management Ensuring seamless bidirectional synchronization of event data between Netcool and the AIOps platform. This involves configuring data mapping and transformation rules (often ...
Netcool Developer with AIOps Cloud Pak - Omnibus and Impact knowledge
Irving, TX · On-site
$54 - $74/hr
Event and Data Management Ensuring seamless bidirectional synchronization of event data between Netcool and the AIOps platform. This involves configuring data mapping and transformation rules (often ...
Drive consistency and alignment with ServiceNow CMDB and enterprise data sources. * Influence and ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
Drive consistency and alignment with ServiceNow CMDB and enterprise data sources. * Influence and ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
MLOps Lead Engineer
Plano, TX · On-site
$95.80K - $126.20K/yr
Experience with AIOps tools and frameworks for model deployment, monitoring, retraining, and lifecycle management, and automating operational tasks (e.g. incident triage, auto-remediation) * Hands-on ...
MLOps Lead Engineer
Plano, TX · On-site
$95.80K - $126.20K/yr
Experience with AIOps tools and frameworks for model deployment, monitoring, retraining, and lifecycle management, and automating operational tasks (e.g. incident triage, auto-remediation) * Hands-on ...
Principal Technical Product Analyst - Unified Observability, Enablement & AIOps
Virginia Beach, VA · On-site
... Enablement & AIOps to lead the enterprise strategy and evolution of monitoring and AIOps ... CMDB and enterprise data sources. • Influence and partner with engineering, platform, and ...
Principal Technical Product Analyst - Unified Observability, Enablement & AIOps
Virginia Beach, VA · On-site
... Enablement & AIOps to lead the enterprise strategy and evolution of monitoring and AIOps ... CMDB and enterprise data sources. • Influence and partner with engineering, platform, and ...
Drive consistency and alignment with ServiceNow CMDB and enterprise data sources. * Influence and ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
Drive consistency and alignment with ServiceNow CMDB and enterprise data sources. * Influence and ... Extensive experience in IT operations, monitoring, observability, or AIOps-focused supporting ...
Aiops Manager information

Other
Posted 9 days ago
Job description
Position : Enterprise Observability & AIOps Architect (App + Infra)
Location: Dallas, Texas, USA (Hybrid / Remote flexibility – Dallas preferred)
Designation: Principal Architect
Experience: 15+ Years (open to highly experienced profiles up to 25 years)
Duration: 1 Year (with possible extension)
Role Overview
We are looking for an experienced Enterprise Observability & AIOps Architect to design, modernize, and lead enterprise-scale observability ecosystems spanning applications, infrastructure, cloud platforms, databases, and operational workflows.
The ideal candidate will combine strategic architectural leadership with strong hands-on expertise in modern observability and AIOps platforms, driving operational excellence and AI-driven transformation across large enterprise environments.
Key Responsibilities
Enterprise Observability Architecture
Lead enterprise-wide observability assessments across applications, infrastructure, cloud, and databases
Define current-state and target-state architectures
Drive monitoring rationalization and tool consolidation strategies
Establish standards for telemetry, tagging, service identity, alerting, and dashboards
Define scalable operating models aligned with SRE, ITSM, and platform engineering
Application Observability
Architect solutions for:
APM, distributed tracing, logs & metrics, RUM, synthetic monitoring
Define SLI/SLO-driven monitoring strategies
Improve service visibility, dependency mapping, and telemetry quality
Build observability for microservices, APIs, Kubernetes, Azure-native & legacy systems
Infrastructure & Platform Observability
Design observability across cloud, middleware, databases, and batch systems
Analyze alert duplication, routing inefficiencies, and monitoring overlaps
Define event correlation, severity models, enrichment, and ownership frameworks
AIOps & Intelligent Operations
Design and implement:
Event correlation & noise reduction
Intelligent alert prioritization
Anomaly detection & predictive insights
Root cause analysis & contextualization
Enable AI-driven workflows for:
Incident reduction
MTTR optimization
Automated remediation
ITSM & Operational Integration
Integrate observability tools with ServiceNow, CMDB, and incident workflows
Define monitoring-to-incident processes and governance frameworks
Establish KPI-driven operational maturity models
Governance & Blueprinting
Develop enterprise standards, onboarding blueprints, and playbooks
Define reusable observability patterns and reference architectures
Establish Day-1 observability models for new services
Required Experience
15+ years in observability, SRE, platform engineering, AIOps, or production operations
Proven experience in enterprise observability transformation and monitoring rationalization
Strong background in hybrid cloud and distributed systems
Experience working with executives, enterprise architects, and platform teams
Deep understanding of incident management and reliability engineering
Technical Expertise
Observability Tools (Must-Have)
Dynatrace
Azure Monitor
Azure Application Insights
Azure Log Analytics
LogicMonitor
ManageEngine
Preferred Tools
Splunk, ELK / OpenSearch
Prometheus / Grafana
Datadog, New Relic
BigPanda, PagerDuty
Core Skills
Event correlation & alert engineering
Distributed tracing & topology mapping
AIOps & intelligent operations
Cloud telemetry & monitoring
Kubernetes & microservices observability
ITSM (ServiceNow) integration
SRE principles & operational governance
Cloud & Platform
Azure, AWS
Kubernetes & container platforms
APIs & integrations
Middleware & distributed systems
Mandatory Skills
Enterprise Observability Architecture
OpenTelemetry framework design
APM & cloud monitoring expertise
ITSM integration & event correlation
AIOps & anomaly detection
Kubernetes & microservices monitoring
Alert optimization & noise reduction
SLI/SLO framework design
Integration architecture & governance