1

Data Center Reliability Engineer Jobs (NOW HIRING)

Data Center Reliability Engineer

Abilene, TX · On-site

$55.25 - $73.25/hr

As a Reliability Engineer - Data Center Facilities, NA , you will support the operational health, maintainability, and reliability of mission-critical facility systems across OCI's North America data ...

$112K - $141K/yr

Who You Are As a Data Center Reliability Engineer on the Data Science team, you are the "bridge" between raw infrastructure telemetry and actionable operational intelligence. You don't just see ...

Data Center Reliability Engineer

Seattle, WA · On-site

$116K - $146K/yr

Who You Are As a Data Center Reliability Engineer on the Data Science team, you are the "bridge" between raw infrastructure telemetry and actionable operational intelligence. You don't just see ...

Data Center Reliability Engineer

Abilene, TX

$55.25 - $73.25/hr

As a Reliability Engineer - Data Center Facilities, NA , you will support the operational health, maintainability, and reliability of mission-critical facility systems across OCI's North America data ...

Data Center Reliability Engineer

Abilene, TX

$55.25 - $73.25/hr

As a Reliability Engineer - Data Center Facilities, NA , you will support the operational health, maintainability, and reliability of mission-critical facility systems across OCI's North America data ...

Site Reliability Engineer

Richmond, VA · On-site

$56.50 - $75/hr

Work with log data from multiple platforms including Datadog, ElasticSearch, MySQL, and Splunk. * Collaborate with Technology Operations Center (TOC) and SRE teams to support operational visibility ...

Monitor physical environments and systems to ensure data center reliability * Respond to, resolve, and document incidents affecting data center infrastructure * Maintain accurate hardware inventory ...

Site Reliability Engineer (Oracle DB)

Plano, TX · On-site

$54.50 - $72.50/hr

Site Reliability Engineer - Data Center (Level 3) - Oracle DB Job Summary We are seeking an experienced Site Reliability Engineer (SRE) to join our Data Center Engineering team at Level 3. This role ...

Site Reliability Engineer (Oracle EBS)

Plano, TX · On-site

$54.50 - $72.50/hr

Site Reliability Engineer - Data Center (Level 3) - Oracle EBS Location: Remote - WFH Job Summary We are seeking an experienced Site Reliability Engineer (SRE) to join our Data Center Engineering ...

Site Reliability Engineer (SQL Server DBA)

Plano, TX · On-site

$54.50 - $72.50/hr

Site Reliability Engineer - Data Center (Level 3) - SQL Server DBA Location: Remote - WFH Job Summary We are seeking an experienced Site Reliability Engineer (SRE) to join our Data Center Engineering ...

Site Reliability Engineer (PostgreSQL)

Plano, TX · On-site

$54.50 - $72.50/hr

Site Reliability Engineer - Data Center (Level 3) - PostgreSQL Job Summary We are seeking an experienced Site Reliability Engineer (SRE) to join our Data Center Engineering team at Level 3. This role ...

Linux Site Reliability Engineer

Nashville, TN · On-site

$55 - $73.25/hr

Linux SRE will administer and support Linux-based applications, Linux-based data systems, Linux-based infrastructures, data center technologies, and other hosted/managed technologies. Additionally ...

Linux Site Reliability Engineer

Irvine, CA · On-site

$61.25 - $81.25/hr

Linux SRE will administer and support Linux-based applications, Linux-based data systems, Linux-based infrastructures, data center technologies, and other hosted/managed technologies. Additionally ...

Linux Site Reliability Engineer

Livonia, MI · On-site

$53.25 - $71/hr

Linux SRE will administer and support Linux-based applications, Linux-based data systems, Linux-based infrastructures, data center technologies, and other hosted/managed technologies. Additionally ...

Linux Site Reliability Engineer

Livonia, MI

$50.50 - $67/hr

Linux SRE will administer and support Linux-based applications, Linux-based data systems, Linux-based infrastructures, data center technologies, and other hosted/managed technologies. Additionally ...

Linux Site Reliability Engineer

Irvine, CA

$61.25 - $81.25/hr

Linux SRE will administer and support Linux-based applications, Linux-based data systems, Linux-based infrastructures, data center technologies, and other hosted/managed technologies. Additionally ...

Linux SRE will administer and support Linux-based applications, Linux-based data systems, Linux-based infrastructures, data center technologies, and other hosted/managed technologies. Additionally ...

Reliability Engineer We are seeking a skilled and detail-oriented Reliability Engineer to join our ... data center environments. You maybe a good fit if you have * Bachelor's or Master's degree in ...

Reliability Engineer We are seeking a skilled and detail-oriented Reliability Engineer to join our ... data center environments. You maybe a good fit if you have * Bachelor's or Master's degree in ...

next page

Showing results 1-20

Data Center Reliability Engineer information

See salary details

$61K

$118K

$141K

How much do data center reliability engineer jobs pay per year?

As of Jun 7, 2026, the average yearly pay for data center reliability engineer in the United States is $117,973.00, according to ZipRecruiter salary data. Most workers in this role earn between $102,500.00 and $129,000.00 per year, depending on experience, location, and employer.

What are some typical challenges faced by Data Center Reliability Engineers, and how can I prepare for them?

Data Center Reliability Engineers often encounter challenges such as minimizing downtime, ensuring redundancy, and proactively identifying potential points of failure within complex infrastructures. You may be required to respond quickly to incidents, balance multiple priorities, and collaborate closely with both IT and facilities teams. Preparing for these challenges involves developing strong troubleshooting skills, staying up-to-date with best practices in reliability engineering, and gaining experience with monitoring and automation tools commonly used in data centers.

What are the key skills and qualifications needed to thrive as a Data Center Reliability Engineer, and why are they important?

To thrive as a Data Center Reliability Engineer, you need a strong background in electrical and mechanical systems, critical facility operations, and often a degree in engineering or a related field. Familiarity with Building Management Systems (BMS), Computerized Maintenance Management Systems (CMMS), and certifications like Uptime Institute’s Accredited Tier Specialist or ASHRAE are common requirements. Exceptional problem-solving abilities, attention to detail, and effective communication are vital soft skills in this role. These competencies are crucial for maintaining optimal uptime, preventing failures, and ensuring the continuous operation of critical infrastructure.

What does a Data Center Reliability Engineer do?

A Data Center Reliability Engineer is responsible for ensuring the continuous and efficient operation of data center infrastructure. They monitor systems for potential issues, perform maintenance, and implement strategies to prevent downtime. Their work often involves collaborating with IT and facilities teams to optimize performance, improve reliability, and respond to emergencies. By proactively identifying risks, these engineers help maintain critical services and minimize disruptions to business operations.
Data Center Reliability Engineer

Data Center Reliability Engineer

Oracle

Abilene, TX • On-site

$55.25 - $73.25/hr

Full-time

Medical, Dental, Vision, Life, Retirement, PTO

Posted 21 days ago


Oracle rating

8.7

Company rating: 8.7 out of 10

Based on 133 frontline employees who took The Breakroom Quiz

38th of 186 rated software companies


Job description

Job Description
As a Reliability Engineer - Data Center Facilities, NA, you will support the operational health, maintainability, and reliability of mission-critical facility systems across OCI's North America data center portfolio. This role contributes to commissioning readiness, maintenance program design, failure analysis, and technical support to Site Operations across electrical, mechanical, and associated controls systems.
You will work cross-functionally with Site Operations, Design Engineering, Construction, Building Automation, Commissioning, and Reliability peers to help ensure critical infrastructure is supportable, reliable, and ready for sustained operations. This is an individual contributor role focused on analysis, technical execution, and operational partnership.
Responsibilities
Key Responsibilities
  • Support reliability activities for critical electrical, mechanical, and controls-related infrastructure across assigned sites or programs.

  • Review commissioning and startup plans to ensure systems meet design intent and are operationally supportable at turnover.

  • Assist in developing maintenance programs that improve operability, reduce downtime, and balance lifecycle cost.

  • Analyze equipment performance, maintenance data, and operational trends to identify risks and improvement opportunities.

  • Support root cause analysis and corrective action development for reliability-related issues and recurring failures.

  • Partner with Site Operations to provide technical guidance during equipment failures, abnormal conditions, or troubleshooting efforts.

  • Review construction submittals, O&M documentation, and turnover materials to evaluate maintainability and operational readiness.

  • Support risk assessments, spare parts analysis, lifecycle planning, and end-of-useful-life considerations for critical assets.

  • Contribute feedback to Design Engineering teams on reliability, maintainability, and operating experience from live sites.

  • Help improve site response procedures, documentation quality, and repeatable reliability practices across the portfolio.

Ideal Candidate Profile
  • 3-5 years of experience in critical facilities, data center operations, industrial maintenance, commissioning, or reliability-related environments.

  • Working knowledge of mission-critical facility systems across electrical, mechanical, and/or controls domains.

  • Experience supporting maintenance planning, system testing, troubleshooting, or failure analysis in operational environments.

  • Bachelor's degree in Engineering or related field preferred; equivalent field experience also valued.

Skills and Competencies
  • Strong analytical and problem-solving capability.

  • Ability to work across multiple teams in a fast-paced environment.

  • Strong written and verbal communication skills.

  • Attention to detail and process discipline.

  • Ability to balance technical rigor with practical operational needs.

Preferred Skills / Certifications
  • Familiarity with CMMS, asset management systems, commissioning processes, or maintenance planning tools.

  • Exposure to RAM analysis, spare parts analysis, or lifecycle cost analysis.

  • Working knowledge of one-lines, P&IDs, sequences of operation, or controls architecture documentation.

  • Data center, utility, healthcare, semiconductor, telecom, or other uptime-critical experience is a plus.

Physical Demands / Work Environment
This role supports mission-critical data center environments where reliability, responsiveness, and execution discipline are essential. Travel may be required to support site reviews, turnover activities, incident follow-up, and cross-functional coordination. You must be able to walk sites, climb stairs, and work safely in active operational environments, with or without reasonable accommodation. Source roles also note occasional lifting up to 25 pounds.
Why Oracle Cloud Infrastructure?
Global impact at scale: Contribute directly to how mission-critical OCI data centers operate across regions and continents, influencing infrastructure reliability, security, sustainability, and long-term capacity growth.
Technically rigorous environment: Work alongside experienced engineers, automation specialists, and compliance teams in a rapidly scaling hyperscale cloud infrastructure, where disciplined execution and technical depth matter.
Culture built on operational excellence: Join an organization that values safety, process rigor, clear accountability, and continuous improvement as foundational to protecting uptime and customer trust.
Long-term career development: Benefit from internal mobility, role-based technical training, and development opportunities designed for professionals building long-term careers in cloud infrastructure and facilities operations.
Qualifications
Disclaimer:
Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.
Range and benefit information provided in this posting are specific to the stated locations only
US: Hiring Range in USD from: $97,500 to $199,500 per annum. May be eligible for bonus and equity.
Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.
Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.
Oracle US offers a comprehensive benefits package which includes the following:
1. Medical, dental, and vision insurance, including expert medical opinion
2. Short term disability and long term disability
3. Life insurance and AD&D
4. Supplemental life insurance (Employee/Spouse/Child)
5. Health care and dependent care Flexible Spending Accounts
6. Pre-tax commuter and parking benefits
7. 401(k) Savings and Investment Plan with company match
8. Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
9. 11 paid holidays
10. Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
11. Paid parental leave
12. Adoption assistance
13. Employee Stock Purchase Plan
14. Financial planning and group legal
15. Voluntary benefits including auto, homeowner and pet insurance
The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level - IC4
About Us
Only Oracle brings together the data, infrastructure, applications, and expertise to power everything from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers turn that promise into a better future for all. Discover your potential at a company leading the way in AI and cloud solutions that impact billions of lives.
True innovation starts when everyone is empowered to contribute. That's why we're committed to growing a workforce that promotes opportunities for all with competitive benefits that support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_mb@oracle.com or by calling 1-888-404-2494 in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.

What Oracle employees say

Pay

Benefits

Hours and flexibility

Workplace

Get the full story on Breakroom


Oracle logo

About Oracle

Sourced by ZipRecruiter

An Oracle career can span industries, roles, Countries and cultures, giving you the opportunity to flourish in new roles and innovate, while blending work life in. Oracle has thrived through 40+ years of change by innovating and operating with integrity while delivering for the top companies in almost every industry. In order to nurture the talent that makes this happen, we are committed to an inclusive culture that celebrates and values diverse insights and perspectives, a workforce that inspires thought leadership and innovation. Oracle offers a highly competitive suite of Employee Benefits designed on the principles of parity, consistency, and affordability. The overall package includes certain core elements such as Medical, Life Insurance, access to Retirement Planning, and much more. We also encourage our employees to engage in the culture of giving back to the communities where we live and do business. At Oracle, we believe that innovation starts with diversity and inclusion and to create the future we need talent from various backgrounds, perspectives, and abilities. We ensure that individuals with disabilities are provided reasonable accommodation to successfully participate in the job application, interview process, and in potential roles. to perform crucial job functions. That's why we're committed to creating a workforce where all individuals can do their best work. It's when everyone's voice is heard and valued that we're inspired to go beyond what's been done before.

Industry

It services

Company size

10,000+ Employees

Headquarters location

Redwood City, CA, US

Year founded

1977

Social media