Ai Reliability Engineer Jobs (NOW HIRING)

Principal Site Reliability Engineer (SRE)

San Francisco, CA · Remote

$180K - $210K/yr

Build SRE practices from scratch - define SLIs, SLOs, error budgets, and reliability metrics ... Leverage AI and machine learning for predictive analytics, anomaly detection, and automated ...

Principal Site Reliability Engineer (SRE)

Austin, TX · Remote

$180K - $210K/yr

Build SRE practices from scratch - define SLIs, SLOs, error budgets, and reliability metrics ... Leverage AI and machine learning for predictive analytics, anomaly detection, and automated ...

Quick apply

Principal Site Reliability Engineer (SRE)

Austin, TX · Remote

$180K - $210K/yr

Build SRE practices from scratch - define SLIs, SLOs, error budgets, and reliability metrics ... Leverage AI and machine learning for predictive analytics, anomaly detection, and automated ...

Principal Site Reliability Engineer (SRE)

Dallas, TX · On-site

$180K - $210K/yr

Build SRE practices from scratch - define SLIs, SLOs, error budgets, and reliability metrics ... Leverage AI and machine learning for predictive analytics, anomaly detection, and automated ...

Quick apply

Principal Site Reliability Engineer (SRE)

Dallas, TX · On-site

$180K - $210K/yr

Build SRE practices from scratch - define SLIs, SLOs, error budgets, and reliability metrics ... Leverage AI and machine learning for predictive analytics, anomaly detection, and automated ...

Reliability Engineer

Fort Worth, TX · On-site

$98K - $123K/yr

Reliability Engineer This critical role within the Celestica Global Quality Organization is ... Exposure to Generative AI and Large Language Models (LLMs) for application in data analysis ...

Reliability Engineer

Fort Worth, TX · On-site

$98K - $123K/yr

Reliability Engineer This critical role within the Celestica Global Quality Organization is ... Exposure to Generative AI and Large Language Models (LLMs) for application in data analysis ...

Celestica

Reliability Engineer

Fort Worth, TX · On-site

$98K - $123K/yr

Reliability Engineer This critical role within the Celestica Global Quality Organization is ... Exposure to Generative AI and Large Language Models (LLMs) for application in data analysis ...

Celestica

Reliability Engineer

Fort Worth, TX · On-site

$98K - $123K/yr

Reliability Engineer This critical role within the Celestica Global Quality Organization is ... Exposure to Generative AI and Large Language Models (LLMs) for application in data analysis ...

Reliability Engineer

Fort Worth, TX · On-site

$98K - $123K/yr

Reliability Engineer This critical role within the Celestica Global Quality Organization is ... Exposure to Generative AI and Large Language Models (LLMs) for application in data analysis ...

Reliability Engineer

Fort Worth, TX · On-site

$98K - $123K/yr

Reliability Engineer This critical role within the Celestica Global Quality Organization is ... Exposure to Generative AI and Large Language Models (LLMs) for application in data analysis ...

Logix Guru

Site Reliability Engineer, AI & Agentic Systems

Plano, TX · On-site

$53.25 - $70.75/hr

A large nationwide company in the Finance/Mortgage industry is currently seeking a Site Reliability Engineer, AI & Agentic Systems 5+ years experience is a must! In this role, you will be involved in ...

Logix Guru

Site Reliability Engineer, AI & Agentic Systems

Plano, TX · On-site

$53.25 - $70.75/hr

Apar Technologies

SRE Engineer

Redmond, WA · On-site

$63.75 - $84.75/hr

Deploy and manage AI resources on Microsoft Azure, including AI Foundry and RAG solutions * Monitor and ensure service uptime, availability, reliability, and latency * Track and integrate SRE metrics ...

Quick apply

Apar Technologies

SRE Engineer

Redmond, WA · On-site

$63.75 - $84.75/hr

SmartIPlace

Site Reliability Engineer (SRE)

Parsippany, NJ · On-site

$57.25 - $76.25/hr

We are looking for a talented Site Reliability Engineer (SRE) with a strong background in Google ... Familiarity with Google BI and AI/ML tools a plus (Looker, BigQuery ML, Vertex AI, etc.) Experience ...

Quick apply

SmartIPlace

Site Reliability Engineer (SRE)

Parsippany, NJ · On-site

$57.25 - $76.25/hr

Florence Healthcare - US

Site Reliability Engineer (SRE)

Atlanta, GA · On-site

$54.75 - $72.75/hr

We are seeking a Site Reliability Engineer (SRE) to join one of our Scrum teams and help ensure the ... AI-driven tooling and automation are a cornerstone of how we build, operate, and scale our systems.

Florence Healthcare - US

Site Reliability Engineer (SRE)

Atlanta, GA · On-site

$54.75 - $72.75/hr

Morgan Stanley

Site Reliability Engineer (SRE) - AI Platform & Cloud

Alpharetta, GA · On-site

$54.25 - $72/hr

They are seeking a Director-level Site Reliability Engineer (SRE) to join their AI Platform team, responsible for maintaining and scaling the infrastructure that supports AI/ML systems in a high ...

Morgan Stanley

Site Reliability Engineer (SRE) - AI Platform & Cloud

Alpharetta, GA · On-site

$54.25 - $72/hr

Tror AI for everyone

SRE Engineer

Scottsdale, AZ · On-site

$57.50 - $76.25/hr

Job Role: SRE Engineer Job Location: Scottsdale AZ (100% onsite) Job Duration: Long Term Contract ... ai

Quick apply

Tror AI for everyone

SRE Engineer

Scottsdale, AZ · On-site

$57.50 - $76.25/hr

Job Role: SRE Engineer Job Location: Scottsdale AZ (100% onsite) Job Duration: Long Term Contract ... ai

Mistral AI

Site Reliability Engineer - NYC

New York, NY · On-site

$62.25 - $82.75/hr

We democratize AI through high-performance, optimized, open-source and cutting-edge models ... What you will do As a Site Reliability Engineer, you balance the day-to-day operations on ...

Mistral AI

Site Reliability Engineer - NYC

New York, NY · On-site

$62.25 - $82.75/hr

We democratize AI through high-performance, optimized, open-source and cutting-edge models ... What you will do As a Site Reliability Engineer, you balance the day-to-day operations on ...

Postman

Member of Technical Staff, AI Reliability & Monitoring Engineering Lead

San Francisco, CA · On-site

$256K - $276K/yr

The Opportunity Postman is seeking an experienced AI Systems Reliability Engineer to help define, build, and maintain the infrastructure and processes that ensure the reliability, scalability, and ...

Postman

Member of Technical Staff, AI Reliability & Monitoring Engineering Lead

San Francisco, CA · On-site

$256K - $276K/yr

SRE Architect, AI-Powered Reliability

Dallas, TX · On-site

$56.50 - $75/hr

Define and lead WEX's AI-Powered Reliability Engineering strategy, driving adoption of SRE agents across the software lifecycle-from design and development through deployment and operations, to ...

SRE Architect, AI-Powered Reliability

Dallas, TX · On-site

$56.50 - $75/hr

SRE Architect, AI-Powered Reliability

Seattle, WA · On-site

$64.75 - $86.25/hr

SRE Architect, AI-Powered Reliability

Seattle, WA · On-site

$64.75 - $86.25/hr

Shield AI

Hardware Reliability Engineer II (R4675)

Dallas, TX · On-site

$81K - $137K/yr

Follow Shield AI on LinkedIn, X, Instagram, and YouTube. As a Hardware Reliability Engineer ... Engineer II) at Shield AI, you will support efforts to ensure the robustness and long-term ...

Quick apply

Shield AI

Hardware Reliability Engineer II (R4675)

Dallas, TX · On-site

$81K - $137K/yr

Follow Shield AI on LinkedIn, X, Instagram, and YouTube. As a Hardware Reliability Engineer ... Engineer II) at Shield AI, you will support efforts to ensure the robustness and long-term ...

SRE Architect, AI-Powered Reliability

Chicago, IL

$58.75 - $78/hr

SRE Architect, AI-Powered Reliability

Chicago, IL

$58.75 - $78/hr

SRE Architect, AI-Powered Reliability

Dallas, TX · On-site

$56.50 - $75/hr

Ai Reliability Engineer Jobs

SRE Architect, AI-Powered Reliability

Dallas, TX · On-site

$56.50 - $75/hr

The Hartford

Principal AI Engineer - Agent Ops / SRE

Columbus, OH · On-site +1

$55 - $73.25/hr

The Hartford's applied AI COE Team is seeking a Principal AI Engineer - Agent Ops/SRE . The AI-COE serves as a centralized function to accelerate AI maturity, eliminate silos, and streamline AI ...

The Hartford

Principal AI Engineer - Agent Ops / SRE

Columbus, OH · On-site +1

$55 - $73.25/hr

Showing results 1-20

Ai Reliability Engineer information

See salary details

$61K

$118K

$141K

How much do ai reliability engineer jobs pay per year?

As of Jul 9, 2026, the average yearly pay for ai reliability engineer in the United States is $117,973.00, according to ZipRecruiter salary data. Most workers in this role earn between $102,500.00 and $129,000.00 per year, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive as an AI Reliability Engineer, and why are they important?

To thrive as an AI Reliability Engineer, you need a solid background in computer science or engineering, expertise in AI/ML concepts, and experience with software testing and reliability methodologies. Familiarity with tools like TensorFlow, PyTorch, CI/CD pipelines, and reliability testing frameworks, along with certifications in cloud platforms (e.g., AWS Certified Machine Learning), is highly valuable. Analytical thinking, problem-solving abilities, and strong collaboration skills set top performers apart in this role. These skills ensure robust, dependable AI systems that meet performance standards and maintain trust in critical applications.

What is the difference between Ai Reliability Engineer vs Data Scientist?

Aspect	Ai Reliability Engineer	Data Scientist
Required Credentials	Bachelor's or master's in CS, engineering, or related; certifications in AI/ML	Bachelor's or master's in CS, statistics, or related; certifications in data analysis or ML
Work Environment	Tech companies, AI-focused teams, engineering departments	Research labs, tech firms, analytics teams
Employer & Industry Usage	AI product development, machine learning systems, reliability testing	Data analysis, predictive modeling, business insights

While both roles involve AI and ML, Ai Reliability Engineers focus on ensuring AI system robustness and uptime, whereas Data Scientists analyze data to generate insights and models. The roles often collaborate but serve different primary functions within AI projects.

What are AI Reliability Engineers?

AI Reliability Engineers are professionals responsible for ensuring that artificial intelligence systems function reliably, safely, and effectively over time. They work on monitoring AI models in production, identifying and mitigating potential failures, and improving the robustness of AI systems. Their tasks often include testing, validation, performance monitoring, and implementing best practices for maintaining AI infrastructure. By focusing on reliability, they help organizations deploy AI solutions that are dependable and trustworthy in real-world environments.

What are some common challenges Ai Reliability Engineers face when ensuring model robustness in production environments?

Ai Reliability Engineers often encounter challenges such as monitoring AI model performance for drift or unexpected behavior, managing data quality issues, and implementing automated alerting systems for anomalies. In production, it's crucial to ensure that AI models operate consistently and remain reliable under varying conditions and data inputs. Collaborating closely with data scientists, software engineers, and DevOps teams is essential to address these challenges and to continuously improve model reliability and uptime.

More about Ai Reliability Engineer jobs

The 10 Top Types Of Ai Reliability Engineer Jobs

What cities are hiring for Ai Reliability Engineer jobs? Cities with the most Ai Reliability Engineer job openings:

What states have the most Ai Reliability Engineer jobs? States with the most job openings for Ai Reliability Engineer jobs include:

What job categories do people searching Ai Reliability Engineer jobs look for? The top searched job categories for Ai Reliability Engineer jobs are:

Ai Reliability Engineer jobs near you

Infographic showing various Ai Reliability Engineer job openings in the United States as of July 2026, with employment types broken down into 100% Contract. Highlights an 100% In-person job distribution, with an average salary of $117,973 per year, or $56.7 per hour.

Principal Site Reliability Engineer (SRE)