1

Ai Reliability Engineer Jobs in Oregon (NOW HIRING)

OR ยท On-site

$57 - $75.75/hr

As a Senior Site Reliability Engineer, you'll help us balance development velocity with the ... Set the conditions for AI agents to do reliable work in our environment, including repository ...

Site Reliability Engineer TELCOR Inc, a leading innovator in laboratory software, is looking for a Site Reliability Engineer to join our TELCOR AI Systems team! Do you have strong experience in cloud ...

Senior Site Reliability Engineer, Government

OR ยท Remote

$57 - $75.75/hr

As a Senior Site Reliability Engineer, you will join our Government SRE team and own both the ... AI is redefining how the world operates and rewriting the rules of security in real time, and ...

OR

$548K - $899K/yr

Demonstrated experience improving SRE AI fluency. Generally, our compensation structure consists solely of an annual salary; we do not have bonuses. You choose each year how much of your compensation ...

OR ยท On-site

$548K - $899K/yr

Demonstrated experience improving SRE AI fluency. Generally, our compensation structure consists solely of an annual salary; we do not have bonuses. You choose each year how much of your compensation ...

OR ยท Hybrid

$136K - $152K/yr

By integrating AI tools into our daily workflows, collaboration is enhanced, outcomes are improved ... The Database Reliability Engineer (DBRE) is responsible for managing, building, maintaining ...

OR

$158K - $200K/yr

Your Impact As a Customer Reliability Engineer (CRE), you are the tip of the spear in interacting ... the AI era - and beyond. We've been innovating fearlessly for 40 years to create solutions that ...

$57 - $75.75/hr

... ready AI to federal agencies at commercial speed. Leveraging our mission-ready technology and ... s Engineer with a strong focus on observability and reliability to join our health project team.

Ensure AI reliability, security, and scalability across deployed systems, including logging ... Engage with product, engineering, and data teams to align AI work with broader business priorities ...

Senior Infrastructure Engineer/SRE

OR ยท On-site +1

$108K - $147K/yr

Building machine learning infrastructure that enables AI teams to train, test, and deploy on large-scale datasets. What we are looking for: * 5+ years experience in DevOps, Site Reliability ...

OR ยท On-site

Integrate AI review signals with our SRE observability stack (logs, metrics, traces) to correlate code changes with incidents and anomalies, closing the loop from PR to production behavior. * Encode ...

OR

$179K - $231K/yr

As VP of Engineering, AI Innovations, you will lead the our team of talented software developers ... Deep knowledge of SRE principles: SLIs/SLOs, incident management, error budgets. * Experience ...

... AI agents, and serves the Financial Intelligence Graph to Fortune 500 customers who expect ... Strong SRE instincts: you think in terms of SLOs, capacity planning, incident response, and ...

OR ยท Hybrid

... AI tools on top of LLMs, training models on our own proprietary data to automate diagnostics and streamline workflows. Please note that this is a highly customer-facing role. If you have an SRE or ...

OR ยท Hybrid

$104K - $143K/yr

... AI tools on top of LLMs, training models on our own proprietary data to automate diagnostics and streamline workflows. Please note that this is a highly customer-facing role. If you have an SRE or ...

OR

$142K/yr

Our team partners closely with platform, infrastructure, SRE, product engineering, risk, and ... Partner with engineering teams to improve the security of AI-assisted developer workflows and GenAI ...

$111K - $159K/yr

Cloud, Linux, Windows, ITIL, or SRE certifications. Working Model Hybrid operations role with ... As we evolve into a more technology-, data-, and AI-enabled organization, we remain grounded in the ...

$124K - $177K/yr

Drive adoption of Site Reliability Engineering (SRE) practices to reduce manual toil with templates ... Account for any GCP implementation with MLOps and Agentic AI tools in Vertex AI * Enhance ...

next page

Showing results 1-20

Ai Reliability Engineer information

What are the key skills and qualifications needed to thrive as an AI Reliability Engineer, and why are they important?

To thrive as an AI Reliability Engineer, you need a solid background in computer science or engineering, expertise in AI/ML concepts, and experience with software testing and reliability methodologies. Familiarity with tools like TensorFlow, PyTorch, CI/CD pipelines, and reliability testing frameworks, along with certifications in cloud platforms (e.g., AWS Certified Machine Learning), is highly valuable. Analytical thinking, problem-solving abilities, and strong collaboration skills set top performers apart in this role. These skills ensure robust, dependable AI systems that meet performance standards and maintain trust in critical applications.

What is the difference between Ai Reliability Engineer vs Data Scientist?

AspectAi Reliability EngineerData Scientist
Required CredentialsBachelor's or master's in CS, engineering, or related; certifications in AI/MLBachelor's or master's in CS, statistics, or related; certifications in data analysis or ML
Work EnvironmentTech companies, AI-focused teams, engineering departmentsResearch labs, tech firms, analytics teams
Employer & Industry UsageAI product development, machine learning systems, reliability testingData analysis, predictive modeling, business insights

While both roles involve AI and ML, Ai Reliability Engineers focus on ensuring AI system robustness and uptime, whereas Data Scientists analyze data to generate insights and models. The roles often collaborate but serve different primary functions within AI projects.

What are AI Reliability Engineers?

AI Reliability Engineers are professionals responsible for ensuring that artificial intelligence systems function reliably, safely, and effectively over time. They work on monitoring AI models in production, identifying and mitigating potential failures, and improving the robustness of AI systems. Their tasks often include testing, validation, performance monitoring, and implementing best practices for maintaining AI infrastructure. By focusing on reliability, they help organizations deploy AI solutions that are dependable and trustworthy in real-world environments.

What are some common challenges Ai Reliability Engineers face when ensuring model robustness in production environments?

Ai Reliability Engineers often encounter challenges such as monitoring AI model performance for drift or unexpected behavior, managing data quality issues, and implementing automated alerting systems for anomalies. In production, it's crucial to ensure that AI models operate consistently and remain reliable under varying conditions and data inputs. Collaborating closely with data scientists, software engineers, and DevOps teams is essential to address these challenges and to continuously improve model reliability and uptime.
What job categories do people searching Ai Reliability Engineer jobs in Oregon look for? The top searched job categories for Ai Reliability Engineer jobs in Oregon are:
What cities in Oregon are hiring for Ai Reliability Engineer jobs? Cities in Oregon with the most Ai Reliability Engineer job openings:
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Jamf

OR โ€ข On-site

$57 - $75.75/hr

Other

Posted 26 days ago


Job description

What you'll do at Jamf:

As a Senior Site Reliability Engineer, you'll help us balance development velocity with the reliability our customers depend on. You'll partner with engineering teams to shape how their services are measured, lead the work to improve them, and use what you learn from production to build the automation and agentic tooling that improves reliability globally. You'll work fluently with agentic development tools as part of your everyday practice, using them to move faster, to investigate harder problems and to multiply your impact.ย This is a senior individual contributor role at the intersection of Engineering, Product, Customer Success and Technical Support, where you'll play a meaningful part in shaping how we practice SRE at Jamf.

You may be required to work periodically at a Jamf office or collaborative work location with other Jamf employees in your area for certain events or moments that matter. We are only able to accept applications for those based in one of these locations.

What you can expect to do in this role:

  • Partner with engineering teams to define service-level objectives, error budgets, and supporting indicators for their services, and help them use those measures to inform prioritization and reliability investment.ย 
  • Investigate complex production issues end-to-end across application, data, infrastructure, and network layers, using AI to correlate logs, metrics, and code and to pressure-test hypotheses before acting.
  • Produce clear technical documentation, runbooks, architecture notes, postmortems and proofs of concept for both technical and non-technical audiences, in a form that engineers and AI tools can re-use.
  • Identify systemic sources of toil and lead the work to eliminate them through automation, AI agents, tooling, and process change.ย 
  • Set the conditions for AI agents to do reliable work in our environment, including repository context, well-specified tasks, integrations such as MCP servers that give AI safe access to the systems it needs, and the tests and guardrails needed for AI-authored change to be trusted.
  • Participate in team ceremonies to identify and refine work, communicate findings, and drive opportunities to collaborate.
  • Drive cross-team and cross-department collaboration on reliability initiatives, including reviewing designs, influencing roadmaps, and mentoring engineers on SRE practices, including effective AI use in their reliability work.
  • Advise senior leadership and stakeholders during critical customer escalations, translating between technical reality and business impact.
  • Contribute to scaling the SRE practice itself: improving our standards, our tooling, and how we partner with product engineering teams.
  • #LIRemote

What we are looking for:

  • Minimum of 5 years experience in software engineering, SRE or production operations roles. (Required)
  • Strong production troubleshooting skills across the stack. Ability to diagnose issues from first principles using the tools available (profilers, heap and thread dumps, query plans, traces, logs, metrics).ย (Required)
  • Experience working within a form of the Agile development framework process. (Required)
  • Hands-on experience operating production services on AWS (e.g. EC2, S3, EKS, RDS/Aurora, CloudFront). (Required)
  • Experience utilizing observability tools (i.e. Grafana, Prometheus, LogicMonitor). (Required)
  • Experience creating clear and concise technical documentation that is targeted at both technical and non-technical audiences. (Required)
  • Experience writing infrastructure as a code. (Required)
  • Experience writing automation in a general-purpose language (e.g. Python, Go, Java, or similar) to a production standard. (Required)
  • Strong judgement about how to apply AI effectively across the full range of SRE work, including high-stakes areas such as production access and sensitive data, knowing how to scope and verify work to make it safe. (Required)
  • Hands-on experience using agentic development tools (e.g. Claude Code, Cursor, Copilot) to deliver engineering and operational work, scoping and delegating bounded tasks, verifying the output, and shipping with confidence. (Required)
  • Experience improving how a team works with AI, for example authoring reusable skills, repository context files, or prompt patterns that others adopt. (Required)
  • Experience optimizing SQL queries and database engine tuning. (Preferred)
  • Experience with CI/CD Tooling (e.g. Github Actions, Jenkins). (Preferred)
  • Exposure to chaos engineering, fault injection and disaster recovery exercises. (Preferred)
  • Familiar with FinOps practices. (Preferred)
  • 2 year / Associates (Required)
  • 4 year / Bachelor's Degree (Preferred)
  • A combination of relevant experience and education may be considered

SECURITY AND PRIVACY REQUIREMENTS

  • Participation in ongoing security training is mandatory
  • Established security protocols will be adhered to, sensitive data will be handled responsibly, and data protection practices are followed, including understanding relevant privacy regulations and reporting breaches
  • Acknowledging the Jamf Code of Conduct, where applicable security and privacy policies can be found, is a requirement of all roles at Jamf

How we help you reach your best potential:

  • Named a 2025 Best Companies to Work For by U.S. News
  • Named a 2024 Best Technology Company to Work For by U.S. News
  • Named one of Forbes Most Trusted Companies in 2024
  • Named a 2024 Best Companies to Work For by U.S. News
  • Our developers work in agile delivery teams to produce new features, improve software components, and are the subject matter experts for our Jamf product offerings.
  • You will have the opportunity to make a real and meaningful impact for more than 75,000 global customers with the best Apple device management solution in the world.
  • We constantly push the boundaries of technology, our developers support new innovations and OS releases the moment they are made available by Apple.
  • Several Jamf engineers are named in patents and with team names like CatDog, ThunderSnow and Dalek you can expect to have some fun while building cutting-edge software.
  • You will have the opportunity to work with a small and empowered team where the culture is based on trust, ownership, and respect.
  • We offer a clear career path that enables you to grow under supportive leadership and management
  • Visit our Jamf Engineering blog to learn more about the innovative projects our team is working on and what we learn from each challenge we solve. A blog written by engineers, for engineers atย medium.com/jamf-engineering
  • 22 of 25 world's most valuable brands rely on Jamf to do their best work (as ranked by Forbes).
  • Over 100,000 Jamf Nation users, the largest online IT community in the world.

Pay Transparency
At Jamf, base pay is one part of our total compensation package and is set within a defined range. These ranges can vary based on hiring location. Where an individual's pay falls within that range depends on several factors, including role scope, location, budget, skills, experience, and qualifications. This approach helps ensure fair, competitive pay and provides room to grow as you develop in your role.