1

Sre Jobs (NOW HIRING)

SRE

Charlotte, NC ยท On-site

$55.75 - $74/hr

Role: SRE Location: Charlotte, NC Skills: Grafana, Python, Splunk, Linux, Scripting. Microsoft 360 or Power BI Job Summary The Senior Support Lead in Site Reliability engineering (SRE) will be ...

Site Reliability Engineer (SRE)

Austin, TX ยท On-site

$56.50 - $75/hr

Site Reliability Engineer (SRE) Location: Austin, TX Job Type: Full Time Job Summary - Seasoned Site Reliability Engineer (SRE) with 7+ years of experience in supporting complex, large-scale ...

Site Reliability Engineer

Dallas, TX ยท Remote

$35 - $40/hr

We are seeking a highly skilled Site Reliability Engineer (SRE ) with strong observability expertise, proven communication skills, and the ability to drive reliability maturity across multi-team ...

$53 - $70.50/hr

- SRE Manager / SRE Architect (Hands-on) Location: New York City, NY / Fort Mill, SC (Hybrid) Employment Type: Full-Time / Contract Industry: Financial Services Position Overview We are seeking a ...

SRE Manager / SRE Architect

New York, NY ยท On-site

$62.25 - $82.75/hr

- SRE Manager / SRE Architect (Hands-on) Location: New York City, NY / Fort Mill, SC (Hybrid) Employment Type: Full-Time / Contract Industry: Financial Services Position Overview We are seeking a ...

Site Reliability Engineer (SRE)

Austin, TX ยท On-site

$56.50 - $75/hr

Site Reliability Engineer (SRE) Location: Austin, TX Job Type: Full Time Technical Skills: * 6+ years of professional engineering experience developing, managing, or supporting distributed systems ...

Site Reliability Engineer

Plano, TX ยท On-site

$54.50 - $72.50/hr

Site Reliability Engineer Hybrid 3 times a week in Iselin, NJ OR Hybrid 3 times a week in PLANO, TX Interview Process: Virtual- 30 min round Onsite- 1-2 hours Needs: Openshift Kubernetes Development ...

SITE RELIABILITY ENGINEER

Camden, NJ ยท On-site

$130K - $150K/yr

Site Reliability Engineer (SRE) Engineer Reliability into the Systems That Move the Nation's Food Supply Who We Are US Cold owns and operates one of the most complex temperature-controlled logistics ...

Site Reliability Engineer (SRE)

Decatur, TX ยท On-site

$129K - $160K/yr

Join Our Team as a Site Reliability Engineer (SRE)! About Us At Energy Worldnet, Inc. (EWN), we deliver innovative technology solutions that empower our clients and support the future of the energy ...

Site Reliability Engineer (SRE)

Omaha, NE ยท On-site

$54.50 - $72.50/hr

Site Reliability Engineer (SRE) Location: Omaha, NE / Dallas, TX Job Type: Full Time Job Summary :- Seasoned Site Reliability Engineer (SRE) with 5+ years of experience in supporting complex, large ...

next page

Showing results 1-20

Sre information

See salary details

$10

$63

$91

How much do sre jobs pay per hour?

As of Jun 27, 2026, the average hourly pay for sre in the United States is $63.74, according to ZipRecruiter salary data. Most workers in this role earn between $54.81 and $72.84 per hour, depending on experience, location, and employer.

What engineers make $500,000?

Senior software engineers, especially those with expertise in high-demand areas like cloud computing, machine learning, or cybersecurity, can earn $500,000 or more annually, often through a combination of base salary, bonuses, and stock options. Achieving this level typically requires extensive experience, advanced skills, and working at large tech companies or startups with significant funding.

What is the job of SRE?

A Site Reliability Engineer (SRE) is responsible for maintaining and improving the reliability, availability, and performance of software systems. They use automation, monitoring tools, and coding skills to prevent outages and ensure system stability, often working closely with development and operations teams. SREs typically have expertise in systems engineering, scripting, and cloud platforms.

Is SRE still a thing?

Site Reliability Engineering (SRE) is an active and evolving role focused on maintaining system reliability, scalability, and automation in technology companies. SREs use tools like monitoring, incident response, and scripting to ensure high availability and performance of services. The role remains in demand across industries that rely on large-scale, complex infrastructure.

What is the difference between Sre vs DevOps Engineer?

AspectSreDevOps Engineer
CertificationsOften includes cloud certifications (AWS, GCP), Linux, and scripting skillsSimilar certifications, with focus on automation and cloud platforms
Work EnvironmentFocuses on reliability, monitoring, and incident response in production systemsEmphasizes automation, CI/CD pipelines, and infrastructure management
Industry UsagePrimarily in tech companies with large-scale systems, especially cloud-basedWidely used across tech, startups, and enterprises implementing DevOps practices

Both Sre and DevOps Engineer roles require overlapping skills in automation, cloud platforms, and scripting. Sre emphasizes system reliability and incident management, while DevOps focuses on continuous integration, deployment, and infrastructure automation. The roles often collaborate but have distinct primary focuses within the software development lifecycle.

What are SREs?

Site Reliability Engineers (SREs) are IT professionals who use a combination of software engineering and systems administration skills to build and maintain reliable, scalable, and efficient IT infrastructure. SREs are responsible for ensuring that services are available, performant, and resilient by automating operations, monitoring systems, and responding to incidents. Their work focuses on reducing manual processes, improving service reliability, and enabling faster development cycles. SREs often collaborate with development and operations teams to implement best practices and ensure system health.

What engineers make $300,000 a year?

Senior software engineers, site reliability engineers (SREs), and specialized technical leads often earn $300,000 or more annually, especially with extensive experience, advanced skills in cloud computing, distributed systems, and certifications like AWS or Google Cloud. Compensation varies by industry, location, and company size, with roles in tech giants and finance firms typically offering higher salaries.

How does an SRE typically collaborate with development and operations teams to maintain system reliability?

Site Reliability Engineers (SREs) work closely with both development and operations teams to ensure systems are reliable, scalable, and efficient. They often participate in code reviews, incident response, and post-mortem analyses, bridging gaps between software development and IT operations. SREs also help define service-level objectives (SLOs) and implement automation to reduce manual work, fostering a culture of shared responsibility for uptime and performance. Effective communication and cross-team collaboration are central to success in this role.

What are the key skills and qualifications needed to thrive as an SRE (Site Reliability Engineer), and why are they important?

To thrive as an SRE, you need a solid background in computer science or engineering, strong programming/scripting abilities, and experience with systems administration and cloud platforms. Familiarity with tools such as Kubernetes, Docker, Prometheus, Terraform, and CI/CD pipelines, along with relevant certifications like AWS Certified Solutions Architect or Google Professional Cloud DevOps Engineer, is highly beneficial. Excellent problem-solving skills, effective communication, and a proactive approach to incident management and collaboration are vital soft skills. These competencies are crucial for maintaining reliable, scalable systems and ensuring rapid incident response in dynamic production environments.
More about Sre jobs
What cities are hiring for Sre jobs? Cities with the most Sre job openings:
What are the most commonly searched types of Sre jobs? The most popular types of Sre jobs are:
What states have the most Sre jobs? States with the most job openings for Sre jobs include:
Infographic showing various Sre job openings in the United States as of June 2026, with employment types broken down into 95% Full Time, and 5% Contract. Highlights an 78% Physical, 7% Hybrid, and 15% Remote job distribution, with an average salary of $132,583 per year, or $63.7 per hour.

Principal Site Reliability Engineer (SRE)

INFINITE CHOICE LLC

Dallas, TX โ€ข On-site

$180K - $210K/yr

Full-time

Posted 2 days ago


Job description

About the Role

We're seeking an exceptional Principal Site Reliability Engineer to architect, design, and build our SRE foundation from the ground up at InfiniteChoice. This is a rare greenfield opportunity to establish SRE practices, develop custom tooling, and create the reliability culture that will support our platform serving millions of users and billions in transaction volume.

As our Principal SRE, you'll combine deep technical expertise with strategic vision to build world-class monitoring, observability, and automation systems. You'll have the autonomy to define our SRE processes, select technologies, and create the framework that ensures our systems are reliable, scalable, and performant.

Location: Remote - US based

What You Will DoSRE Foundation & Process Development
  • Build SRE practices from scratch - define SLIs, SLOs, error budgets, and reliability metrics

  • Establish incident response procedures, on-call rotations, and post-mortem processes

  • Create reliability engineering standards and best practices across all engineering teams

  • Develop disaster recovery and business continuity strategies

  • Design and implement capacity planning and performance optimization frameworks

Architecture & Tool Development
  • Drive architecture decisions for comprehensive application and infrastructure monitoring solutions

  • Design and develop custom SRE tools for automated monitoring, alerting, and remediation

  • Build observability platforms that provide deep insights into system performance and user experience

  • Create automation frameworks for deployment, scaling, and incident response

  • Architect logging, metrics, and tracing systems for distributed microservices environments

Google Cloud Infrastructure Excellence
  • Leverage Google Cloud Platform services to build resilient, scalable infrastructure

  • Implement cloud-native monitoring using Stackdriver, Cloud Monitoring, and Cloud Logging

  • Design auto-scaling and self-healing systems using GKE, Cloud Functions, and managed services

  • Optimize cloud costs while maintaining high availability and performance standards

  • Establish security and compliance frameworks within GCP environments

Innovation & Continuous Improvement
  • Research and implement cutting-edge SRE tools and methodologies

  • Leverage AI and machine learning for predictive analytics, anomaly detection, and automated remediation

  • Create dashboards and reporting systems that provide actionable insights to engineering and business teams

  • Establish feedback loops for continuous improvement of reliability and performance

  • Stay current with industry best practices and emerging technologies in the SRE space

What You Must HaveSRE & Infrastructure Expertise
  • 12+ years of experience in Site Reliability Engineering or Infrastructure Engineering

  • 5+ years in lead SRE roles building and scaling SRE teams and processes

  • Proven track record designing and implementing monitoring and observability solutions at scale

  • Deep understanding of distributed systems, microservices architectures, and cloud-native patterns

  • Experience with infrastructure as code, configuration management, and deployment automation

Google Cloud Platform Proficiency
  • Hands-on experience with Google Cloud Platform is required

  • Expertise with GCP monitoring and observability stack (Cloud Monitoring, Cloud Logging, Cloud Trace)

  • Experience with GKE, Compute Engine, Cloud Functions, and other core GCP services

  • Knowledge of GCP networking, security, and compliance capabilities

  • Understanding of GCP cost optimization and resource management

Technical Skills
  • Strong programming skills in Python, Go, Java, or similar languages

  • Experience with monitoring tools (Prometheus, Grafana, Datadog, New Relic, or similar)

  • Proficiency with containerization (Docker, Kubernetes) and orchestration platforms

  • Knowledge of CI/CD pipelines, automated testing, and deployment strategies

  • Understanding of database performance tuning and optimization (SQL and NoSQL)

AI & Automation
  • Familiarity with AI-driven development tools and methodologies is a huge plus

  • Experience with machine learning for operations (AIOps), anomaly detection, or predictive analytics

  • Knowledge of automated incident response and self-healing systems

  • Understanding of AI/ML tools for log analysis, pattern recognition, and intelligent alerting

Problem-Solving & Mindset
  • Strong analytical and troubleshooting skills for complex distributed systems

  • Experience with high-pressure incident response and crisis management

  • Detail-oriented with commitment to operational excellence and continuous improvement

  • Comfortable with ambiguity and building processes in a fast-growing environment

  • Passion for reliability, automation, and engineering best practices

  • Demonstrated experience building SRE programs and processes from the ground up is a HUGE plus

Education
  • Bachelor's degree in Computer Science, Engineering, or equivalent professional experience

  • Industry certifications (Google Cloud Professional, SRE or related certifications preferred)

What We Offer
  • Ground-floor opportunity to build SRE practices and culture from scratch

  • Full autonomy to define processes, select technologies, and establish best practices

  • Direct impact on platform reliability serving millions of users

  • Opportunity to create lasting engineering culture and operational excellence

  • Remote-first culture with in-person meeting in Dallas, TX on need basis

  • Collaborative environment with smart, passionate engineers and cross-functional teams

  • Access to cutting-edge technologies and AI-driven development tools

  • Competitive compensation, equity participation, and comprehensive benefits

Ready to Build World-Class Reliability?

Join us in creating the SRE foundation that will power InfiniteChoice's next phase of growth. If you're passionate about reliability engineering, love building systems from scratch, and want to establish the operational excellence that scales with our business, we'd love to hear from you.

About InfiniteChoice

InfiniteChoice was founded to help people find the experiences they want simply and effortlessly. We leverage a new type of business model and platform that uniquely applies automation and technology to solve the challenges of scale and complexity in experience discovery.


Existing business and marketing technologies can no longer handle the demands of connecting millions of consumers with vast inventories of experiences across a fragmented, global marketplace of people, partners, and providers.


Our mission is to disrupt this status quo by creating seamless connections between consumers and experiences. We're just at the beginning of this journey, but our approach is working: we've helped over 275 million visitors connect to millions of experiences, generating over $2 billion in revenue for our brands and partners.