2

Remote Reliability Engineer Jobs in Phoenix, AZ (NOW HIRING)

Site Reliability Engineer III

Chandler, AZ ยท On-site +1

$58.25 - $77.25/hr

Job#: 3033636 Site Reliability Engineer III Location: Chandler, Arizona (Hybrid) Employment Type: Contract Duration: 12 Months Role Overview This position is responsible for ensuring the internal ...

Software Engineer

Phoenix, AZ ยท Remote

$110K - $135K/yr

Develop and run unit and performance tests to ensure scalability and reliability. * Review and ... Other Employee Perks Job Type: * Full-Time * Remote Compensation: * $110k - $135k DOE Required ...

This full-time, fully remote position offers the opportunity to collaborate with a talented team of ... Ensure performance, reliability, and scalability across all systems and applications. Continuously ...

next page

Showing results 1-20

Remote Reliability Engineer information

See Phoenix, AZ salary details

$60.6K

$117.1K

$140K

How much do remote reliability engineer jobs pay per year?

As of May 28, 2026, the average yearly pay for remote reliability engineer in Phoenix, AZ is $117,136.00, according to ZipRecruiter salary data. Most workers in this role earn between $101,800.00 and $128,100.00 per year, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive as a Remote Reliability Engineer, and why are they important?

To thrive as a Remote Reliability Engineer, you need a strong background in systems engineering, software development, and infrastructure management, often supported by a degree in computer science or a related field. Proficiency with cloud platforms (such as AWS, Azure, or GCP), monitoring tools (like Prometheus, Grafana), and relevant certifications (e.g., AWS Certified DevOps Engineer) is highly valuable. Excellent problem-solving, communication, and collaboration skills are crucial for working effectively across distributed teams and responding to incidents. These abilities ensure system reliability, quick incident resolution, and seamless remote teamwork, which are vital for maintaining high service uptime and user satisfaction.

How do Remote Reliability Engineers typically collaborate with on-site teams to address urgent technical issues?

Remote Reliability Engineers often utilize a combination of video conferencing, instant messaging, and collaborative monitoring tools to stay closely connected with on-site teams. When urgent technical issues arise, they participate in real-time troubleshooting sessions, analyze system logs remotely, and may guide on-site staff through step-by-step resolution procedures. Building strong communication channels and regular check-ins are essential to ensure swift and effective collaboration, even across different time zones. This structure allows Remote Reliability Engineers to contribute significantly to system uptime while working from a distance.

What is a Remote Reliability Engineer?

A Remote Reliability Engineer is a professional who works from a remote location to ensure that systems, applications, or infrastructure are reliable, available, and performing well. Their responsibilities typically include monitoring system health, diagnosing issues, implementing preventative measures, and collaborating with teams to improve system reliability. They often use tools for automation, incident response, and performance monitoring, all while working offsite. This role is critical in minimizing downtime and ensuring a smooth user experience, especially for companies with complex technical environments. Remote Reliability Engineers must have strong problem-solving skills and be proficient in cloud technologies, automation, and incident management.

What is the difference between Remote Reliability Engineer vs Remote Site Reliability Engineer?

AspectRemote Reliability EngineerRemote Site Reliability Engineer
CredentialsTypically requires certifications like AWS Certified Solutions Architect, Linux Foundation certificationsSimilar credentials, often with additional focus on site-specific tools and monitoring
Work EnvironmentPrimarily remote, focusing on cloud infrastructure and system reliabilityRemote with some on-site responsibilities, focusing on infrastructure and operational stability
Industry UsageUsed across tech, cloud providers, SaaS companiesCommon in data centers, cloud providers, and large enterprise IT
Search & Comparison IntentOften compared due to overlapping roles in system reliability and cloud infrastructureCompared for on-site vs remote operational responsibilities

The main difference is that Remote Reliability Engineers focus on cloud and system reliability remotely, while Remote Site Reliability Engineers may have some on-site duties related to infrastructure. Both roles require similar skills and certifications but differ in their work environment and specific responsibilities.

What are the most commonly searched types of Reliability Engineer jobs in Phoenix, AZ? The most popular types of Reliability Engineer jobs in Phoenix, AZ are:
What are popular job titles related to Remote Reliability Engineer jobs in Phoenix, AZ? For Remote Reliability Engineer jobs in Phoenix, AZ, the most frequently searched job titles are:
What job categories do people searching Remote Reliability Engineer jobs in Phoenix, AZ look for? The top searched job categories for Remote Reliability Engineer jobs in Phoenix, AZ are:
What cities near Phoenix, AZ are hiring for Remote Reliability Engineer jobs? Cities near Phoenix, AZ with the most Remote Reliability Engineer job openings:
Infographic showing various Remote Reliability Engineer job openings in Phoenix, AZ as of May 2026, with employment types broken down into 86% Full Time, 10% Part Time, 3% Contract, and 1% Nights. Highlights an 84% Physical, 5% Hybrid, and 11% Remote job distribution, with an average salary of $117,136 per year, or $56.3 per hour.
Site Reliability Engineering Manager - Remote

Site Reliability Engineering Manager - Remote

Arcoro

Phoenix, AZ โ€ข On-site, Remote

$56.50 - $75/hr

Other

Retirement, PTO

Posted 8 days ago


Job description

Why Arcoro?ย ย 

Want to work with a solid company that's transforming HR for the construction industry? Our team of dedicated professionals helps construction, contracting and field services companies hire, manage and grow their workforce with a market-leading SaaS solution. As a member of the A-Team, you'll enjoy a top-notch employee experience where you can embrace your problem-solving skills and innovation, work with a team of great colleagues and see the impact of your contribution each day. Our culture is collaborative, and we believe strongly in training, growth and internal advancement. We offer competitive compensation including comprehensive benefits and a generous time-off policy. We offer both on-site and remote opportunities.

At Arcoro, you will help create software products that are cutting edge, easy to use, and that make an appreciated and notable difference in our customers' daily lives.ย 

About the Job:ย ย 

The Site Reliability Engineering Manager is responsible for leading the SRE team to ensure the availability, performance, scalability, and operational excellence of Arcoro's production systems. This role combines people leadership with deep technical oversight, ensuring services meet defined reliability targets and that the team is effective, engaged, and aligned with product and business goals.ย 

The SRE Manager partners closely with Engineering and Product to drive reliability engineering practices, incident response, observability, and continuous improvement across the production environment.ย 

This is a hands-on role. In addition to leading and developing the team, the SRE Manager is expected to contribute as an individual contributor by writing code and automation, building tooling, participating in on-call, and working directly in production systems alongside the team.ย 

What You'll Doย 

  • Lead and manage a team of Site Reliability Engineers responsible for the reliability, performance, and operational health of production systemsย 
  • Serve as a hands-on technical contributor by writing code and automation, building reliability tooling, participating in on-call, and working directly in production systems alongside the teamย 
  • Support career growth and development of team members through coaching, mentoring, and performance managementย 
  • Define, measure, and drive Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets in partnership with engineering and product teamsย 
  • Own incident response, including on-call rotations, escalation processes, severity management, and blameless postmortemsย 
  • Drive continuous improvement in monitoring, observability, alerting, and on-call practices to reduce toil and mean-time-to-recoveryย 
  • Lead the adoption of AI and automation across SRE practices, including AI-assisted incident response, intelligent alerting, automated remediation, and the use of AI tooling to reduce toil and accelerate operational workflowsย 
  • Partner with Engineering to refine our products to better support agentic AI development, including improving APIs, telemetry, environments, and platform capabilities that enable AI agents to safely build on and operate against our systemsย 
  • Drive cloud cost optimization and FinOps practices in partnership with Engineering, including vendor management, cost allocation, rightsizing, and engineering best practices that reduce cloud spendย 
  • Partner with Engineering on operational readiness reviews, production change management, and release safetyย 
  • Champion reliability best practices and ensure they are embedded across the engineering organizationย 
  • Track and report on key reliability metrics, incident trends, and team health to leadershipย 
  • Stay current with emerging SRE practices, tooling, and industry standardsย 

What We're Looking For: ย 

  • Proven experience leading SRE, operations, or reliability-focused engineering teams in a production software environmentย 
  • Willingness and ability to operate as a hands-on individual contributor in addition to managing the team, including writing code, building automation, and participating in on-callย 
  • Strong understanding of SRE principles, including SLOs/SLIs, error budgets, and blameless postmortemsย 
  • Hands-on background in incident response, on-call management, and production troubleshootingย 
  • Experience with modern observability practices, including metrics, logging, tracing, and alertingย 
  • Demonstrated experience applying AI and automation to reliability work, including using AI-assisted tooling, building automated remediation, and leading the adoption of AI-driven practices on a teamย 
  • Solid grasp of distributed systems, cloud infrastructure, and the operational characteristics of web-scale applicationsย 
  • Strong leadership, coaching, and team development skillsย 
  • Excellent communication skills, including the ability to lead through high-pressure incidents and communicate clearly with technical and non-technical stakeholdersย 
  • Strong analytical and problem-solving abilitiesย 
  • Ability to work across teams and influence at multiple levels of the organizationย 

Preferred Qualificationsย 

  • Bachelor's degree in Computer Science, a related field, or equivalent professional experienceย 
  • 10+ years of experience in software engineering, systems engineering, DevOps, or site reliability engineeringย 
  • 3+ years of experience in a technical leadership, team lead, Lead, or Principal roleย 
  • Previous experience as an SRE Manager, Lead SRE, Principal DevOps/SRE, Operations Manager, or similar leadership roleย 
  • Strong experience with Microsoft Azure; additional experience with AWS or Google Cloud Platform a plusย 
  • Experience with Microsoft technologies (.NET, C#, SQL Server) in a production environmentย 
  • Experience with container orchestration (Kubernetes, AKS, or EKS) and tools such as Helm or Argoย 
  • Experience with observability platforms (e.g., Datadog, ELK, Grafana, OpenTelemetry, Azure Monitor)ย 
  • Experience with infrastructure-as-code (e.g., Bicep, Terraform, CloudFormation) and modern CI/CD pipelines (e.g., Azure DevOps, GitHub Actions)ย 
  • Experience with cloud cost optimization and FinOps practicesย 
  • Familiarity with incident management and ITSM tooling (e.g., PagerDuty, Opsgenie, ServiceNow)ย 
  • Hands-on experience with AI-assisted engineering tools (e.g., coding copilots, LLM-powered runbooks or agents) and automation platforms used in production operationsย 
  • Microsoft Azure certifications (e.g., AZ-305 Solutions Architect Expert, AZ-400 DevOps Engineer Expert) a plusย 

Salary Range:

$200,000-$220,000 DOEย 

What We Offerย 

  • Competitive salary and benefits package.ย 
  • 401(k) with Company matchย 
  • Flexible PTO and Company-paid holidaysย 
  • Remote Workย 
  • Opportunities for professional growth and development.ย 
  • A collaborative and innovative work environment.ย 

About the Companyย 

A rapidly growing SaaS company, Arcoro offers proven modular HR solutions for the construction and contracting industries. Our product suite and software platform provide end-to-end HR functionality to help drive business outcomes, enabling companies to better manage the entire employee lifecycle through improved candidate quality and flow, shortened time to hire, centralized learning and improved employee productivity. Our HR solutions integrate with top construction ERP systems further positioning Arcoro as a leader in proven modular HR solutions. With Arcoro's flexible solutions, customers select the modules that meet their needs for talent acquisition, talent management, core HR, benefits administration, time and attendance tracking and more. Arcoro has over 7000 customers across North America.ย 

Arcoro is a Fair and Equal Opportunity Employerย ย 

Arcoro is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. We prohibit discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local laws.ย