2

Remote Hardware Reliability Engineer Jobs (NOW HIRING)

Principal Site Reliability Engineer

OR · On-site +1

$57 - $75.75/hr

Upstart's Site Reliability Engineering (SRE) team owns the reliability, resiliency, and ... Remote-US, Remote-Canada Time zone requirements The team operates on the East/West coast time zones.

Site Reliability Engineer

San Diego, CA · Remote

$60.50 - $80.50/hr

The Site Reliability Engineer will focus on the execution and maintenance of reliability ... This is a remote, contract opportunity for a project Arctiq is delivering for a client. Candidates ...

Site Reliability Engineer

San Diego, CA · Remote

$58.25 - $77.50/hr

The Site Reliability Engineer will focus on the execution and maintenance of reliability ... This is a remote, contract opportunity for a project Arctiq is delivering for a client. Candidates ...

... hardware across land, sea, air, and space. Role Overview: This isn't a "keep the lights on" SRE ... Flexibility: Flexible working arrangements including hybrid remote/in-office schedules.

This is a remote position, but if you're near one of our local offices, you're welcome to come ... Our Site Reliability Engineer should help keep our systems steady, secure, and running like a well ...

Site Reliability Engineer - SRE

Atlanta, GA · On-site +1

$54.25 - $72/hr

Role: Site Reliability Engineer * Location: Atlanta, GA OR Dallas OR Austin, TX * Duration ... Remote Possible, however candidates will move to work onsite/Hybrid eventually. Please make sure ...

They are seeking a Customer Reliability Engineer to interact with customers, applying Site ... Cisco develops, manufactures, and sells networking hardware, telecommunications equipment, and ...

They are seeking a Customer Reliability Engineer to interact with customers, applying Site ... Cisco develops, manufactures, and sells networking hardware, telecommunications equipment, and ...

They are seeking a Customer Reliability Engineer to interact with customers, applying Site ... Cisco develops, manufactures, and sells networking hardware, telecommunications equipment, and ...

They are seeking a Customer Reliability Engineer to interact with customers, applying Site ... Cisco develops, manufactures, and sells networking hardware, telecommunications equipment, and ...

Fleet Reliability Engineer

Torrance, CA · On-site +1

$107K - $134.70K/yr

You will be the extreme owner of uptime and performance for our deployed hardware worldwide-leading ... Some days you'll be in a remote country diagnosing an antenna issue in the field; other days you'll ...

They are seeking a Customer Reliability Engineer to interact with customers, applying Site ... Cisco develops, manufactures, and sells networking hardware, telecommunications equipment, and ...

They are seeking a Customer Reliability Engineer to interact with customers, applying Site ... Cisco develops, manufactures, and sells networking hardware, telecommunications equipment, and ...

They are seeking a Customer Reliability Engineer to interact with customers, applying Site ... Cisco develops, manufactures, and sells networking hardware, telecommunications equipment, and ...

Site Reliability Engineer

$58.25 - $77.50/hr

Remote Duration: 6 Months (Contract) Details of the role: Need a Senior Site Reliability Engineer to work on Disaster Recovery (DR) initiative for different applications in the Cloud and onsite ...

next page

Showing results 1-20

People also search for

Remote Hardware Reliability Engineer information

See salary details

$61K

$118K

$141K

How much do remote hardware reliability engineer jobs pay per year?

As of May 31, 2026, the average yearly pay for remote hardware reliability engineer in the United States is $117,973.00, according to ZipRecruiter salary data. Most workers in this role earn between $102,500.00 and $129,000.00 per year, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive as a Remote Hardware Reliability Engineer, and why are they important?

To thrive as a Remote Hardware Reliability Engineer, you typically need a degree in electrical or mechanical engineering, strong analytical skills, and experience with hardware testing and failure analysis. Familiarity with reliability testing tools like HALT/HASS, statistical analysis software, and reliability modeling systems is important, along with certifications such as CRE (Certified Reliability Engineer). Outstanding problem-solving abilities, attention to detail, and effective remote communication are crucial soft skills for collaborating with global teams and addressing issues proactively. These competencies ensure the development and maintenance of robust, reliable hardware products, minimizing failures and optimizing performance in diverse environments.

How does a Remote Hardware Reliability Engineer collaborate effectively with cross-functional teams while working offsite?

As a Remote Hardware Reliability Engineer, effective collaboration with cross-functional teams—such as design, manufacturing, and quality assurance—is typically achieved through regular virtual meetings, collaborative platforms, and detailed documentation. You’ll often participate in remote design reviews, data analysis sessions, and troubleshooting calls to address reliability concerns. Clear communication and proactive project updates are essential to ensure everyone is aligned, especially when working across different time zones or locations. Building strong relationships with team members via consistent online interaction helps maintain project momentum and ensures reliability standards are met.

What is a Remote Hardware Reliability Engineer?

A Remote Hardware Reliability Engineer is a professional who assesses, tests, and improves the reliability and durability of hardware components and systems, all while working remotely. They analyze failure data, design tests, and recommend modifications to ensure products meet quality and performance standards. This role often involves collaborating with engineering teams using digital communication tools, running simulations, and reviewing data to prevent future hardware failures. By working remotely, these engineers can support projects from anywhere, offering flexibility while maintaining high standards for hardware reliability.

What is the difference between Remote Hardware Reliability Engineer vs Remote Hardware Test Engineer?

AspectRemote Hardware Reliability EngineerRemote Hardware Test Engineer
CredentialsEngineering degree, certifications in reliability or hardware engineeringEngineering degree, certifications in testing or quality assurance
Work EnvironmentDesigning reliability strategies, analyzing failure data, improving hardware durabilityDeveloping and executing testing procedures, validating hardware performance
Industry UsageManufacturers, tech companies, aerospace, automotiveManufacturers, consumer electronics, hardware development firms

While both roles focus on hardware, the Remote Hardware Reliability Engineer emphasizes ensuring long-term durability and reliability through analysis and design improvements. In contrast, the Remote Hardware Test Engineer concentrates on testing hardware components to verify performance and quality before deployment.

More about Remote Hardware Reliability Engineer jobs
What cities are hiring for Remote Hardware Reliability Engineer jobs? Cities with the most Remote Hardware Reliability Engineer job openings:
What are the most commonly searched types of Hardware Reliability Engineer jobs? The most popular types of Hardware Reliability Engineer jobs are:
What states have the most Remote Hardware Reliability Engineer jobs? States with the most job openings for Remote Hardware Reliability Engineer jobs include:
What job categories do people searching Remote Hardware Reliability Engineer jobs look for? The top searched job categories for Remote Hardware Reliability Engineer jobs are:
Infographic showing various Remote Hardware Reliability Engineer job openings in the United States as of May 2026, with employment types broken down into 91% Full Time, and 9% Contract. Highlights an 100% Remote job distribution, with an average salary of $117,973 per year, or $56.7 per hour.

Principal Site Reliability Engineer (SRE)

INFINITE CHOICE LLC

San Francisco, CA • Remote

$180K - $210K/yr

Full-time

Posted 6 days ago


Job description

About the Role

We're seeking an exceptional Principal Site Reliability Engineer to architect, design, and build our SRE foundation from the ground up at InfiniteChoice. This is a rare greenfield opportunity to establish SRE practices, develop custom tooling, and create the reliability culture that will support our platform serving millions of users and billions in transaction volume.

As our Principal SRE, you'll combine deep technical expertise with strategic vision to build world-class monitoring, observability, and automation systems. You'll have the autonomy to define our SRE processes, select technologies, and create the framework that ensures our systems are reliable, scalable, and performant.

Location: Remote - US based

What You Will DoSRE Foundation & Process Development
  • Build SRE practices from scratch - define SLIs, SLOs, error budgets, and reliability metrics

  • Establish incident response procedures, on-call rotations, and post-mortem processes

  • Create reliability engineering standards and best practices across all engineering teams

  • Develop disaster recovery and business continuity strategies

  • Design and implement capacity planning and performance optimization frameworks

Architecture & Tool Development
  • Drive architecture decisions for comprehensive application and infrastructure monitoring solutions

  • Design and develop custom SRE tools for automated monitoring, alerting, and remediation

  • Build observability platforms that provide deep insights into system performance and user experience

  • Create automation frameworks for deployment, scaling, and incident response

  • Architect logging, metrics, and tracing systems for distributed microservices environments

Google Cloud Infrastructure Excellence
  • Leverage Google Cloud Platform services to build resilient, scalable infrastructure

  • Implement cloud-native monitoring using Stackdriver, Cloud Monitoring, and Cloud Logging

  • Design auto-scaling and self-healing systems using GKE, Cloud Functions, and managed services

  • Optimize cloud costs while maintaining high availability and performance standards

  • Establish security and compliance frameworks within GCP environments

Innovation & Continuous Improvement
  • Research and implement cutting-edge SRE tools and methodologies

  • Leverage AI and machine learning for predictive analytics, anomaly detection, and automated remediation

  • Create dashboards and reporting systems that provide actionable insights to engineering and business teams

  • Establish feedback loops for continuous improvement of reliability and performance

  • Stay current with industry best practices and emerging technologies in the SRE space

What You Must HaveSRE & Infrastructure Expertise
  • 12+ years of experience in Site Reliability Engineering or Infrastructure Engineering

  • 5+ years in lead SRE roles building and scaling SRE teams and processes

  • Proven track record designing and implementing monitoring and observability solutions at scale

  • Deep understanding of distributed systems, microservices architectures, and cloud-native patterns

  • Experience with infrastructure as code, configuration management, and deployment automation

Google Cloud Platform Proficiency
  • Hands-on experience with Google Cloud Platform is required

  • Expertise with GCP monitoring and observability stack (Cloud Monitoring, Cloud Logging, Cloud Trace)

  • Experience with GKE, Compute Engine, Cloud Functions, and other core GCP services

  • Knowledge of GCP networking, security, and compliance capabilities

  • Understanding of GCP cost optimization and resource management

Technical Skills
  • Strong programming skills in Python, Go, Java, or similar languages

  • Experience with monitoring tools (Prometheus, Grafana, Datadog, New Relic, or similar)

  • Proficiency with containerization (Docker, Kubernetes) and orchestration platforms

  • Knowledge of CI/CD pipelines, automated testing, and deployment strategies

  • Understanding of database performance tuning and optimization (SQL and NoSQL)

AI & Automation
  • Familiarity with AI-driven development tools and methodologies is a huge plus

  • Experience with machine learning for operations (AIOps), anomaly detection, or predictive analytics

  • Knowledge of automated incident response and self-healing systems

  • Understanding of AI/ML tools for log analysis, pattern recognition, and intelligent alerting

Problem-Solving & Mindset
  • Strong analytical and troubleshooting skills for complex distributed systems

  • Experience with high-pressure incident response and crisis management

  • Detail-oriented with commitment to operational excellence and continuous improvement

  • Comfortable with ambiguity and building processes in a fast-growing environment

  • Passion for reliability, automation, and engineering best practices

  • Demonstrated experience building SRE programs and processes from the ground up is a HUGE plus

Education
  • Bachelor's degree in Computer Science, Engineering, or equivalent professional experience

  • Industry certifications (Google Cloud Professional, SRE or related certifications preferred)

What We Offer
  • Ground-floor opportunity to build SRE practices and culture from scratch

  • Full autonomy to define processes, select technologies, and establish best practices

  • Direct impact on platform reliability serving millions of users

  • Opportunity to create lasting engineering culture and operational excellence

  • Remote-first culture with in-person meeting in Dallas, TX on need basis

  • Collaborative environment with smart, passionate engineers and cross-functional teams

  • Access to cutting-edge technologies and AI-driven development tools

  • Competitive compensation, equity participation, and comprehensive benefits

Ready to Build World-Class Reliability?

Join us in creating the SRE foundation that will power InfiniteChoice's next phase of growth. If you're passionate about reliability engineering, love building systems from scratch, and want to establish the operational excellence that scales with our business, we'd love to hear from you.

About InfiniteChoice

InfiniteChoice was founded to help people find the experiences they want simply and effortlessly. We leverage a new type of business model and platform that uniquely applies automation and technology to solve the challenges of scale and complexity in experience discovery.


Existing business and marketing technologies can no longer handle the demands of connecting millions of consumers with vast inventories of experiences across a fragmented, global marketplace of people, partners, and providers.


Our mission is to disrupt this status quo by creating seamless connections between consumers and experiences. We're just at the beginning of this journey, but our approach is working: we've helped over 275 million visitors connect to millions of experiences, generating over $2 billion in revenue for our brands and partners.