Site Reliability Engineer
$64.25 - $85.50/hr
... to improve service reliability and performance · Support backlog refinement and reliability engineering initiatives · Document runbooks, procedures, and knowledge articles · Contribute to ...
$64.25 - $85.50/hr
... to improve service reliability and performance · Support backlog refinement and reliability engineering initiatives · Document runbooks, procedures, and knowledge articles · Contribute to ...
$64.25 - $85.50/hr
... to improve service reliability and performance · Support backlog refinement and reliability engineering initiatives · Document runbooks, procedures, and knowledge articles · Contribute to ...
Millersville, MD · On-site +1
$55.50 - $73.50/hr
Participate in incident response activities, service restoration efforts, and post-incident ... engineering, observability, automation, and reliability practices through hands-on work and ...
Millersville, MD · On-site +1
$55.50 - $73.50/hr
Participate in incident response activities, service restoration efforts, and post-incident ... engineering, observability, automation, and reliability practices through hands-on work and ...
Reston, VA · On-site
$59.50 - $79/hr
This role will span from the OpenShift platform to services provided by Azure. We're proud of the ... Engineering, Site Reliability Engineering, Platform Engineering or other similar role, with ...
Quick apply
Reston, VA · On-site
$59.50 - $79/hr
This role will span from the OpenShift platform to services provided by Azure. We're proud of the ... Engineering, Site Reliability Engineering, Platform Engineering or other similar role, with ...
The SRE will champion the overall health of OF core technical platforms, lead the response to ... This role will span from the OpenShift platform to services provided by Azure. We're proud of the ...
Quick apply
The SRE will champion the overall health of OF core technical platforms, lead the response to ... This role will span from the OpenShift platform to services provided by Azure. We're proud of the ...
Respond to and resolve system outages, impairments, and service disruptions while coordinating with ... Expert knowledge of site reliability engineering practices, system monitoring, incident management ...
Respond to and resolve system outages, impairments, and service disruptions while coordinating with ... Expert knowledge of site reliability engineering practices, system monitoring, incident management ...
Millersville, MD · On-site +1
$55.50 - $73.50/hr
Define, track, and report service level indicators (SLIs), service level objectives (SLOs), and error budgets to guide engineering decisions and service improvements. Automation, CI/CD ...
Millersville, MD · On-site +1
$55.50 - $73.50/hr
Define, track, and report service level indicators (SLIs), service level objectives (SLOs), and error budgets to guide engineering decisions and service improvements. Automation, CI/CD ...
... Services contract. Responsibilities : • Monitor and maintain system reliability, availability ... Required : • Expert knowledge of site reliability engineering practices, system monitoring ...
... Services contract. Responsibilities : • Monitor and maintain system reliability, availability ... Required : • Expert knowledge of site reliability engineering practices, system monitoring ...
Respond to and resolve system outages, impairments, and service disruptions while coordinating with ... Expert knowledge of site reliability engineering practices, system monitoring, incident management ...
Respond to and resolve system outages, impairments, and service disruptions while coordinating with ... Expert knowledge of site reliability engineering practices, system monitoring, incident management ...
Bethesda, MD · On-site
$61 - $81/hr
Site Reliability Engineer (SRE) / Service Availability Manager Location: 7750 Wisconsin Avenue, Bethesda, MD 20814 Duration: Fulltime/Contract Required Qualifications: 5+ years of experience in an ...
Quick apply
Bethesda, MD · On-site
$61 - $81/hr
Site Reliability Engineer (SRE) / Service Availability Manager Location: 7750 Wisconsin Avenue, Bethesda, MD 20814 Duration: Fulltime/Contract Required Qualifications: 5+ years of experience in an ...
Respond to and resolve system outages, impairments, and service disruptions while coordinating with ... Expert knowledge of site reliability engineering practices, system monitoring, incident management ...
Quick apply
Respond to and resolve system outages, impairments, and service disruptions while coordinating with ... Expert knowledge of site reliability engineering practices, system monitoring, incident management ...
$118K - $177K/yr
The Reliability Systems Engineer will be required to interact with other OTECH personnel and the ... We develop products and services for use throughout the lifecycle of an offshore oilfield, from ...
$118K - $177K/yr
The Reliability Systems Engineer will be required to interact with other OTECH personnel and the ... We develop products and services for use throughout the lifecycle of an offshore oilfield, from ...
Alexandria, VA · Hybrid
$60.75 - $81/hr
RiVidium is seeking a Site Reliability Engineer / Platform Reliability Engineer to support our ... Modernization & Innovation and helps deliver mission-focused outcomes for service members, families ...
Alexandria, VA · Hybrid
$60.75 - $81/hr
RiVidium is seeking a Site Reliability Engineer / Platform Reliability Engineer to support our ... Modernization & Innovation and helps deliver mission-focused outcomes for service members, families ...
$116K - $146K/yr
Ardent is seeking a Reliability Engineer to join our team. This is an onsite role in Ashburn, VA ... Hands-on experience with Amazon Web Services (AWS) and cloud-based monitoring tools. * Ability to ...
$116K - $146K/yr
Ardent is seeking a Reliability Engineer to join our team. This is an onsite role in Ashburn, VA ... Hands-on experience with Amazon Web Services (AWS) and cloud-based monitoring tools. * Ability to ...
Vienna, VA · On-site
$57.25 - $76/hr
The AWS Site Reliability Engineer (SRE) is responsible for the operational health, availability ... You will define and track Service Level Objectives (SLOs) to balance reliability with innovation as ...
Vienna, VA · On-site
$57.25 - $76/hr
The AWS Site Reliability Engineer (SRE) is responsible for the operational health, availability ... You will define and track Service Level Objectives (SLOs) to balance reliability with innovation as ...
Washington, DC · On-site
$64.25 - $85.50/hr
... Service Level Objectives (SLOs) and Service Level Agreements (SLAs). Qualifications : Required : • Bachelor's degree in Computer Science, Engineering, or a related technical discipline. • 5 or ...
Washington, DC · On-site
$64.25 - $85.50/hr
... Service Level Objectives (SLOs) and Service Level Agreements (SLAs). Qualifications : Required : • Bachelor's degree in Computer Science, Engineering, or a related technical discipline. • 5 or ...
Washington, DC · On-site
$158K - $178K/yr
Koniag Management Solutions, LLC (KMS), a Koniag Government Services (KGS) company, is hiring a Site Reliability Engineer (SRE). Position requires an active Top Secret/SCI clearance with ability to ...
Washington, DC · On-site
$158K - $178K/yr
Koniag Management Solutions, LLC (KMS), a Koniag Government Services (KGS) company, is hiring a Site Reliability Engineer (SRE). Position requires an active Top Secret/SCI clearance with ability to ...
Arlington, VA · On-site
$65.75 - $87.25/hr
Responsibilities : • Define, implement, and maintain site reliability engineering practices for mission-critical applications and shared services, with emphasis on uptime, resiliency ...
Arlington, VA · On-site
$65.75 - $87.25/hr
Responsibilities : • Define, implement, and maintain site reliability engineering practices for mission-critical applications and shared services, with emphasis on uptime, resiliency ...
Alexandria, VA · On-site
$61 - $81/hr
Work with cloud and development teams on observability, automation, and service health improvements. * Contribute to performance tuning, incident follow-up, and reliability reporting. Basic ...
Alexandria, VA · On-site
$61 - $81/hr
Work with cloud and development teams on observability, automation, and service health improvements. * Contribute to performance tuning, incident follow-up, and reliability reporting. Basic ...
Washington, DC · On-site
$64.50 - $85.75/hr
Koniag Management Solutions, LLC (KMS), a Koniag Government Services (KGS) company, is hiring a Site Reliability Engineer (SRE). Position requires an active Top Secret/SCI clearance with ability to ...
Washington, DC · On-site
$64.50 - $85.75/hr
Koniag Management Solutions, LLC (KMS), a Koniag Government Services (KGS) company, is hiring a Site Reliability Engineer (SRE). Position requires an active Top Secret/SCI clearance with ability to ...
Arlington, VA · On-site
$65.75 - $87.25/hr
Responsibilities : • Define, implement, and maintain site reliability engineering practices for mission-critical applications and shared services, with emphasis on uptime, resiliency ...
Arlington, VA · On-site
$65.75 - $87.25/hr
Responsibilities : • Define, implement, and maintain site reliability engineering practices for mission-critical applications and shared services, with emphasis on uptime, resiliency ...
$69.1K - $77.3K
0% of jobs
$77.3K - $85.6K
2% of jobs
$85.6K - $93.8K
3% of jobs
$93.8K - $102K
8% of jobs
$102K - $110.3K
7% of jobs
$118.5K is the 25th percentile. Wages below this are outliers.
$110.3K - $118.5K
5% of jobs
$118.5K - $126.7K
4% of jobs
$126.7K - $135K
3% of jobs
$135K - $143.2K
2% of jobs
The median wage is $145.2K / yr.
$143.2K - $151.5K
63% of jobs
$151.5K - $159.7K
2% of jobs
$69.1K
$133.6K
$159.7K
| Aspect | Service Reliability Engineer | Site Reliability Engineer |
|---|---|---|
| Credentials | Typically requires experience in software engineering, cloud platforms, and monitoring tools | Similar credentials, often with a focus on software development and systems engineering |
| Work Environment | Works closely with development and operations teams to ensure service reliability | Works on maintaining and improving system reliability, often in cloud or data center environments |
| Industry Usage | Common in tech companies focusing on service uptime and customer experience | Widely used in tech, especially in cloud and large-scale infrastructure companies |
Both roles focus on ensuring system reliability, often requiring similar skills and certifications. The main difference lies in terminology preference and specific organizational focus, but they generally perform comparable functions in maintaining high service availability.

$64.25 - $85.50/hr
Other
Posted 12 days ago
Job Title: Site Reliability Engineer (SRE)
Location: Washington, DC (Onsite)
Clearance: TS/SCI
Position Overview
Seeking a highly motivated Site Reliability Engineer (SRE) to support mission-critical enterprise applications and infrastructure in a high-availability environment. The SRE will be responsible for ensuring system reliability, performance, scalability, and operational efficiency through proactive monitoring, automation, and rapid incident response.
This role bridges development and operations, partnering closely with engineering teams to ensure new capabilities are delivered without compromising production stability. The ideal candidate brings strong Linux expertise, automation skills, and hands-on experience with cloud-native and containerized environments.
Key Responsibilities
Monitoring & Performance
· Monitor system health, availability, and performance using enterprise observability tools
· Analyze metrics and logs to proactively detect and remediate issues
· Tune alerting to reduce noise and prioritize mission impact
Incident Management & Reliability
· Respond to and resolve production incidents across distributed environments
· Perform root cause analysis and lead post-incident reviews
· Implement corrective and preventive actions to improve resilience
· Participate in on-call rotation for outages, upgrades, and urgent activities
Automation & DevOps Enablement
· Automate repetitive operational tasks to improve efficiency and reduce human error
· Support CI/CD pipelines and automated deployment workflows
· Develop scripts and tooling to improve reliability and repeatability
Platform & Infrastructure Support
· Maintain Linux/Unix systems and containerized workloads
· Support Kubernetes/Docker environments and microservices architectures
· Assist with configuration management and environment standardization
· Ensure secure and compliant system configurations
Collaboration & Continuous Improvement
· Partner with development teams to improve service reliability and performance
· Support backlog refinement and reliability engineering initiatives
· Document runbooks, procedures, and knowledge articles
· Contribute to continuous service improvement efforts
Required Qualifications
Education & Experience
· Bachelor’s degree in Computer Science, Engineering, or related technical field
· Minimum 5 years of relevant technical experience
· At least 3 years of systems programming or SRE/DevOps experience
Technical Skills
· Strong proficiency in Python, Bash, or similar scripting languages
· Hands-on experience with Linux/Unix administration
· Experience with Kubernetes and Docker
· Familiarity with cloud platforms (AWS, Azure, or Google Cloud)
· Experience with monitoring and logging tools (e.g., Grafana, Kibana, Prometheus, ELK)
· Working knowledge of CI/CD tools (e.g., GitLab, Jenkins, ArgoCD)
· Understanding of microservices architecture and DevOps practices
· Experience with Git-based workflows
Infrastructure & Networking
· Knowledge of networking fundamentals, load balancers, and firewalls
· Experience with identity and access management (IAM, SSH, VPN, security groups)
· Experience deploying to on-premises or data center environments
Professional Skills
· Strong analytical and troubleshooting abilities
· Excellent time management and ability to work independently
· Effective written and verbal communication skills
· Experience using Jira and Confluence in an Agile environment
Preferred Qualifications
· Experience defining or working with SLIs, SLOs, and error budgets
· Familiarity with Helm and Kubernetes deployment pipelines
· Experience supporting high-availability or mission-critical systems
· Knowledge of security best practices and compliance frameworks