1

Ai Reliability Engineer Jobs in Minnesota (NOW HIRING)

Senior Site Reliability Engineer

Eagan, MN · On-site

$58 - $77.25/hr

... Reliability Engineer working on our Publishing Systems on the AWS Cloud platform. About the Role ... an AI-enabled future. * Industry Competitive Benefits: We offer comprehensive benefit plans to ...

Senior Site Reliability Engineer

Eagan, MN · Hybrid

$58 - $77.25/hr

Keepup to datewith emerging cloud technology trends, especially around DevOps, Service Reliability ... an AI-enabled future. * Industry Competitive Benefits: We offer comprehensive benefit plans to ...

Lead AI Forward Engineer

Eagan, MN · Remote

$104K - $137K/yr

Establish and track SLOs/SLIs for critical AI services to meet enterprise reliability and ... Mentor engineers and share patterns, practices, and lessons learned to raise overall AI solution ...

next page

Showing results 1-20

Ai Reliability Engineer information

What are the key skills and qualifications needed to thrive as an AI Reliability Engineer, and why are they important?

To thrive as an AI Reliability Engineer, you need a solid background in computer science or engineering, expertise in AI/ML concepts, and experience with software testing and reliability methodologies. Familiarity with tools like TensorFlow, PyTorch, CI/CD pipelines, and reliability testing frameworks, along with certifications in cloud platforms (e.g., AWS Certified Machine Learning), is highly valuable. Analytical thinking, problem-solving abilities, and strong collaboration skills set top performers apart in this role. These skills ensure robust, dependable AI systems that meet performance standards and maintain trust in critical applications.

What is the difference between Ai Reliability Engineer vs Data Scientist?

AspectAi Reliability EngineerData Scientist
Required CredentialsBachelor's or master's in CS, engineering, or related; certifications in AI/MLBachelor's or master's in CS, statistics, or related; certifications in data analysis or ML
Work EnvironmentTech companies, AI-focused teams, engineering departmentsResearch labs, tech firms, analytics teams
Employer & Industry UsageAI product development, machine learning systems, reliability testingData analysis, predictive modeling, business insights

While both roles involve AI and ML, Ai Reliability Engineers focus on ensuring AI system robustness and uptime, whereas Data Scientists analyze data to generate insights and models. The roles often collaborate but serve different primary functions within AI projects.

What are AI Reliability Engineers?

AI Reliability Engineers are professionals responsible for ensuring that artificial intelligence systems function reliably, safely, and effectively over time. They work on monitoring AI models in production, identifying and mitigating potential failures, and improving the robustness of AI systems. Their tasks often include testing, validation, performance monitoring, and implementing best practices for maintaining AI infrastructure. By focusing on reliability, they help organizations deploy AI solutions that are dependable and trustworthy in real-world environments.

What are some common challenges Ai Reliability Engineers face when ensuring model robustness in production environments?

Ai Reliability Engineers often encounter challenges such as monitoring AI model performance for drift or unexpected behavior, managing data quality issues, and implementing automated alerting systems for anomalies. In production, it's crucial to ensure that AI models operate consistently and remain reliable under varying conditions and data inputs. Collaborating closely with data scientists, software engineers, and DevOps teams is essential to address these challenges and to continuously improve model reliability and uptime.
What are popular job titles related to Ai Reliability Engineer jobs in Minnesota? For Ai Reliability Engineer jobs in Minnesota, the most frequently searched job titles are:
What job categories do people searching Ai Reliability Engineer jobs in Minnesota look for? The top searched job categories for Ai Reliability Engineer jobs in Minnesota are:
What cities in Minnesota are hiring for Ai Reliability Engineer jobs? Cities in Minnesota with the most Ai Reliability Engineer job openings:
Principle Site Reliability Engineer

Principle Site Reliability Engineer

UnitedHealth Group

Minnetonka, MN • Hybrid

$58 - $77.25/hr

Full-time

Retirement

Posted 12 days ago


UnitedHealth Group rating

7.6

Company rating: 7.6 out of 10

Based on 141 frontline employees who took The Breakroom Quiz

187th of 872 rated healthcare providers


Job description

Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by inclusion, talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health optimization on a global scale. Join us to start Caring. Connecting. Growing together.

If you are located in MN, you will follow a hybrid schedule with four in-office days per week. 

Primary Responsibilities:

Leadership and Strategy

  • Develop and execute a comprehensive strategy for SRE, SecOps, and TechOps aligned with organizational goals, with a focus on improving stability, security, and supportability of all digital properties
  • Build, lead, and mentor a high-performing team of SRE, SecOps, and TechOps professionals, fostering a culture of collaboration, innovation, and continuous improvement
  • Collaborate with cross-functional leaders and engineering teams to integrate best practices into all aspects of consumer products and platforms, 'baking in' resilience from design to deployment
  • Guide teams on priorities, mentor individual contributors, and report to CIOs on critical paths, mitigation plans, and strategic initiatives

Site Reliability Engineering (SRE)

  • Oversee teams to map and proactively secure and harden end-to-end customer journeys across all business units
  • Analyze and model dependencies (applications, APIs, infrastructure) and run threat models for various risks, including natural disasters, cyberattacks, and software failures
  • Develop AIOPs and MLOPs strategy, oversee implementation and rollout across multiple apps
  • Experience building Reusable Agentic AI solution for SRE and OPs function
  • Develop and enforce reliability standards, including Service Level Agreements (SLAs), Service Level Indicators (SLIs), and Service Level Objectives (SLOs), utilizing these key metrics to continuously improve system reliability and performance and prioritize work
  • Implement automation to reduce manual tasks, enhance operational efficiency, and design resilience features such as automated failovers, geo-redundancy, circuit breakers, and automated rollbacks
  • Ensure minimal downtime and optimal performance of systems through proactive risk threat modeling and mitigation

Security Operations (SecOps)

  • Oversee proactive threat detection and response processes to identify and mitigate security threats in real-time
  • Fosters collaboration between security and IT operations teams to enhance incident response capabilities and proactive prevention

Technology Operations (TechOps)

  • Ensure the stability, scalability, and resilience of the technology infrastructure, including hardware, software, and network operations
  • Develop and maintain robust processes for infrastructure provisioning, configuration, and maintenance
  • Promote a culture of continuous improvement and innovation, optimizing processes and accelerating technology delivery

You'll be rewarded and recognized for your performance in an environment that will challenge you and give you clear direction on what it takes to succeed in your role as well as provide development for other roles you may be interested in.

Required Qualifications:

  • Bachelor's degree in Computer Science, Information Technology, or a related field, or equivalent practical experience
  • 15 years of experience in software engineering, site reliability engineering, and/or security or technology operations, with 7 years serving in a leadership role in one of these areas
  • Demonstrated experience driving AIled innovation, including building or implementing AI/Ops solutions to improve system reliability, security posture, or cloud operations
  • Demonstrated experience driving AI led innovation, including building agentic AI solution for OPS or implementing AI/Ops solutions to improve system reliability, security posture, or cloud operations
  • Proven solid knowledge of infrastructure management, system monitoring, incident response, software engineering principles, and security frameworks at scale
  • Proven track record of developing and implementing successful strategies in a global digital consumer environment
  • Demonstrated history of remediating technology infrastructure issues at the root cause
  • Extensive experience with operational and security tools and technologies (e.g., monitoring systems, automation tools, SIEM, IDS/IPS)
  • Proven exceptional leadership, communication, and collaboration skills, with the ability to translate complex technical issues into clear, concise, and impactful communications for C-suite and executive leadership
  • Proven ability to thrive in a fast-paced, dynamic environment and effectively manage multiple high-priority situations simultaneously

Preferred Qualification:

  • Solid vendor relationship management experience

Pay is based on several factors including but not limited to local labor markets, education, work experience, certifications, etc. In addition to your salary, we offer benefits such as, a comprehensive benefits package, incentive and recognition programs, equity stock purchase and 401k contribution (all benefits are subject to eligibility requirements). No matter where or when you begin a career with us, you'll find a far-reaching choice of benefits and incentives. The salary for this role will range from $134,600 to $230,800 annually based on full-time employment. We comply with all minimum wage laws as applicable.

At UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone. We believe everyone-of every race, gender, sexuality, age, location and income-deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes. We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes - an enterprise priority reflected in our mission.

UnitedHealth Group is an Equal Employment Opportunity employer under applicable law and qualified applicants will receive consideration for employment without regard to race, national origin, religion, age, color, sex, sexual orientation, gender identity, disability, or protected veteran status, or any other characteristic protected by local, state, or federal laws, rules, or regulations.

UnitedHealth Group is a drug-free workplace. Candidates are required to pass a drug test before beginning employment.


What UnitedHealth Group employees say

Pay

Benefits

Hours and flexibility

Workplace

Get the full story on Breakroom