1

Ai Reliability Engineer Jobs in Raleigh, NC (NOW HIRING)

Software Engineer

Raleigh, NC · On-site +1

$135K - $154K/yr

Develop and deploy AI/ML applications and MLOps workflows using Red Hat OpenShift AI, including RAG and Large Language Models (LLMs). * Perform SRE functions and technical support within an ...

AI Engineer

Cary, NC · On-site

$110K - $150K/yr

The AI Engineer is responsible for the development of AI solutions, typically leveraging pretrained ... Instrument AI solutions for telemetry, reliability, performance monitoring, and cost control.

The AI Engineer is responsible for the development of AI solutions, typically leveraging pretrained ... Instrument AI solutions for telemetry, reliability, performance monitoring, and cost control.

AI Engineer

Cary, NC · On-site

$100K - $120K/yr

... and reliability • Collaborate with product managers, data scientists, and engineers to translate requirements into AI solutions • Implement MLOps practices including versioning, CI/CD, and ...

In this role, you will design and develop the observability platforms, reliability tooling, and AI-powered automation that empower WPC engineering teams to ship with confidence, resolve issues in ...

Senior Database Engineer

Raleigh, NC · Remote

$130K - $155K/yr

You'll work closely with SRE, Platform, and Engineering teams to ensure performance, reliability ... Familiarity with AI-assisted tools (e.g., Claude, Windsurf, GitHub Copilot) Qualifications Required ...

next page

Showing results 1-20

Ai Reliability Engineer information

See Raleigh, NC salary details

$59.3K

$114.7K

$137.1K

How much do ai reliability engineer jobs pay per year?

As of Jun 9, 2026, the average yearly pay for ai reliability engineer in Raleigh, NC is $114,679.00, according to ZipRecruiter salary data. Most workers in this role earn between $99,600.00 and $125,400.00 per year, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive as an AI Reliability Engineer, and why are they important?

To thrive as an AI Reliability Engineer, you need a solid background in computer science or engineering, expertise in AI/ML concepts, and experience with software testing and reliability methodologies. Familiarity with tools like TensorFlow, PyTorch, CI/CD pipelines, and reliability testing frameworks, along with certifications in cloud platforms (e.g., AWS Certified Machine Learning), is highly valuable. Analytical thinking, problem-solving abilities, and strong collaboration skills set top performers apart in this role. These skills ensure robust, dependable AI systems that meet performance standards and maintain trust in critical applications.

What is the difference between Ai Reliability Engineer vs Data Scientist?

AspectAi Reliability EngineerData Scientist
Required CredentialsBachelor's or master's in CS, engineering, or related; certifications in AI/MLBachelor's or master's in CS, statistics, or related; certifications in data analysis or ML
Work EnvironmentTech companies, AI-focused teams, engineering departmentsResearch labs, tech firms, analytics teams
Employer & Industry UsageAI product development, machine learning systems, reliability testingData analysis, predictive modeling, business insights

While both roles involve AI and ML, Ai Reliability Engineers focus on ensuring AI system robustness and uptime, whereas Data Scientists analyze data to generate insights and models. The roles often collaborate but serve different primary functions within AI projects.

What are AI Reliability Engineers?

AI Reliability Engineers are professionals responsible for ensuring that artificial intelligence systems function reliably, safely, and effectively over time. They work on monitoring AI models in production, identifying and mitigating potential failures, and improving the robustness of AI systems. Their tasks often include testing, validation, performance monitoring, and implementing best practices for maintaining AI infrastructure. By focusing on reliability, they help organizations deploy AI solutions that are dependable and trustworthy in real-world environments.

What is a $900,000 AI job?

A $900,000 AI job typically refers to highly senior roles such as AI executives, chief AI officers, or lead AI engineers at top technology companies, often involving advanced expertise in machine learning, deep learning, and AI strategy. These positions usually require extensive experience, specialized skills, and may include performance-based bonuses or stock options that contribute to the high total compensation.

What are some common challenges Ai Reliability Engineers face when ensuring model robustness in production environments?

Ai Reliability Engineers often encounter challenges such as monitoring AI model performance for drift or unexpected behavior, managing data quality issues, and implementing automated alerting systems for anomalies. In production, it's crucial to ensure that AI models operate consistently and remain reliable under varying conditions and data inputs. Collaborating closely with data scientists, software engineers, and DevOps teams is essential to address these challenges and to continuously improve model reliability and uptime.
What are popular job titles related to Ai Reliability Engineer jobs in Raleigh, NC? For Ai Reliability Engineer jobs in Raleigh, NC, the most frequently searched job titles are:
What job categories do people searching Ai Reliability Engineer jobs in Raleigh, NC look for? The top searched job categories for Ai Reliability Engineer jobs in Raleigh, NC are:
What cities near Raleigh, NC are hiring for Ai Reliability Engineer jobs? Cities near Raleigh, NC with the most Ai Reliability Engineer job openings:
Sr. Network Operations VoIP Engineer (Platform & SRE)

Sr. Network Operations VoIP Engineer (Platform & SRE)

Bandwidth

Raleigh, NC

Other

Medical, Dental, Vision, PTO

Posted 17 days ago


Job description

Who We Are:

Bandwidth, a prior "Best of EC" award winner, is a global software company that helps enterprises deliver exceptional experiences through voice, messaging, and emergency services. Reaching 65+ countries and over 90 percent of the global economy, we're the only provider offering an owned communications cloud that delivers advanced automation, AI integrations, global reach, and premium human support. Bandwidth is trusted for mission-critical communications by the Global 2000, hyperscalers, and SaaS builders!

At Bandwidth, your music matters when you are part of the BAND.  We celebrate differences and encourage BANDmates to be their authentic selves.  #jointheband

What We Are Looking For:

We are seeking a Senior VoIP Engineer with a modern engineering mindset to join Bandwidth's Network Operations team. While you possess deep expertise in SIP and carrier-grade environments, you view infrastructure through the lens of Software Reliability Engineering (SRE). This role is critical for ensuring reliable, secure, and automated SIP connectivity across multiple carrier platforms and enterprise environments.

What You'll Do:

  • Platform Automation: Design and implement automation to reduce operational burden, replacing manual workflows with scripting and Infrastructure as Code (IaC).
  • Voice Orchestration: Use tools like Ansible/AWX and Terraform to automate the provisioning and scaling of voice services and carrier interconnects.
  • Open Source & Interoperability: Conduct detailed interoperability testing between carrier platforms and open-source SIP implementations (e.g., Kamailio, OpenSIPS), resolving signaling mismatches and codec issues.
  • Carrier Interconnect Strategy: Design and maintain SIP trunking solutions with multiple carriers to ensure reliability, compliance, and optimal routing.
  • Advanced Observability: Act as the highest-level escalation point for complex SIP/RTP problems. Use advanced tools like Wireshark, SIPp, and HEPIC/Homer to analyze call flows and media streams.
  • System Integration: Collaborate with network and security teams to integrate SIP services through proxy servers and modern routing policies.
  • Documentation & Mentorship: Maintain "Documentation as Code," including SIP call flow diagrams and interconnect standards, while providing guidance to junior staff on modern troubleshooting methodologies.

What You Need:

  • Experience: 5+ years of hands-on VoIP engineering experience, with a heavy emphasis on carrier-grade SIP environments.
  • SIP Mastery: Expert-level knowledge of SIP signaling (INVITE, BYE, etc.), SDP negotiation, and RTP/RTCP.
  • Automation Toolkit: 3+ years of proficiency in Python, Bash, or Ansible for automating provisioning and troubleshooting workflows.
  • Infrastructure as Code: Experience using IaC tools to manage infrastructure at scale.
  • SIP Proxies: Hands-on experience with open-source SIP proxies (e.g., Kamailio, OpenSIPS) alongside traditional SBCs (e.g., Ribbon, Oracle/Acme Packet).
  • Linux/OS: Strong Linux administration skills for VoIP application performance and system troubleshooting.
  • Diagnostics: Proficiency with diagnostic tools such as Wireshark, sngrep, and Homer/HEP.

Bonus Points:

  • Carrier Expertise: Direct experience with carrier-to-carrier SIP interconnects and SS7-to-SIP interworking.
  • Routing & Databases: Experience with ENUM, LNP, and complex routing logic.
  • Voice Security: Familiarity with TLS, SRTP, and regulatory compliance including STIR/SHAKEN and E911.
  • Cloud Infrastructure: Experience deploying voice services in containerized or cloud-native environments.
  • Networking: Solid knowledge of IP, DNS, and routing protocols.

The Whole Person Promise:

At Bandwidth, we're pretty proud of our corporate culture, which is rooted in our "Whole Person Promise." We promise all employees that they can have meaningful work AND a full life, and we provide a work environment geared toward enriching your body, mind, and spirit. How do we do that? Well...

  • 100% company-paid Medical, Vision, & Dental coverage for you and your family with low deductibles and low out-of-pocket expenses.
  • All new hires receive four weeks of PTO.
  • PTO Embargo. When you take time off (of any kind!) you're embargoed from working. Bandmates and managers are not allowed to interrupt your PTO - not even with email.
  • Additional PTO can be earned throughout the year through volunteer hours and Bandwidth challenges.
  • "Mahalo moments" program grants additional time off for life's most important moments like graduations, buying a first home, getting married, wedding anniversaries (every five years), and the birth of a grandchild.
  • 90-Minute Workout Lunches and unlimited meetings with our very own nutritionist.

Are you excited about the position and its responsibilities, but not sure if you're 100% qualified? Do you feel you can work to help us crush the mission? If you answered 'yes' to both of these questions, we encourage you to apply! You won't want to miss the opportunity to be a part of the BAND.

Applicant Privacy Notice