1

Linux Site Reliability Engineer Jobs in Silver Spring, MD

We are seeking an experienced Site Reliability Engineer (SRE) to blend software engineering and systems administration practices to ensure the reliability, availability, and performance of ...

We are seeking an experienced Site Reliability Engineer (SRE) to blend software engineering and systems administration practices to ensure the reliability, availability, and performance of ...

Site Reliability Engineer (SRE)

Vienna, VA · On-site

$57.25 - $76/hr

Up to 2 years in duration MUST HAVES: • Minimum of 8 years of experience as a Site Reliability Engineer with a strong understanding of SRE principles for highly scalable and reliable systems • ...

Red Hat/OpenShift SRE

Upper Marlboro, MD · On-site

$56.50 - $75/hr

Experience with Linux administration (Red Hat Enterprise Linux preferred) * Proficiency in ... Familiarity with SRE practices: monitoring, alerting, incident response, and blameless post-mortems

Staff Site Reliability Engineer

Reston, VA

$59.25 - $78.75/hr

The Site Reliability Engineering team drives reliability strategy, elevates engineering standards ... Deep Linux expertise - from kernel internals and system performance tuning to hardening and ...

Staff Site Reliability Engineer

Reston, VA · On-site

$59.25 - $78.75/hr

The Site Reliability Engineering team drives reliability strategy, elevates engineering standards ... Deep Linux expertise - from kernel internals and system performance tuning to hardening and ...

Staff Site Reliability Engineer

Reston, VA

$59.25 - $78.75/hr

The Site Reliability Engineering team drives reliability strategy, elevates engineering standards ... Deep Linux expertise - from kernel internals and system performance tuning to hardening and ...

Site Reliability Engineer

Washington, DC · On-site

$114K - $190K/yr

MANTECH seeks motivated, career, and customer-oriented Site Reliability Engineer (SRE) for a new initiative. This effort supports the rapid design, deployment, operation, and sustainment of ...

Senior Site Reliability Engineer

Reston, VA · On-site

$59.25 - $78.75/hr

As a Senior Site Reliability Engineer , you will be responsible for: * System Design and Operation ... Design and manage distributed Unix-based systems, particularly Oracle Linux. * Implement auto ...

Site Reliability Engineer

Reston, VA

$59.25 - $78.75/hr

Site Reliability Engineer - The Site Reliability Engineer (i.e., SRE ) role is responsible for the optimization and reliability of core technical platforms and platform services, and exerting ...

next page

Showing results 1-20

Linux Site Reliability Engineer information

See Silver Spring, MD salary details

$11

$65

$94

How much do linux site reliability engineer jobs pay per hour?

As of Jun 8, 2026, the average hourly pay for linux site reliability engineer in Silver Spring, MD is $65.89, according to ZipRecruiter salary data. Most workers in this role earn between $56.63 and $75.29 per hour, depending on experience, location, and employer.

What are some common challenges faced by Linux Site Reliability Engineers when scaling infrastructure, and how can they be addressed?

Linux Site Reliability Engineers often encounter challenges related to maintaining system stability and performance as infrastructure scales. Issues such as configuration drift, automation bottlenecks, and monitoring gaps can arise when managing numerous servers or services. Addressing these challenges typically involves implementing robust configuration management tools, investing in automated deployment pipelines, and enhancing observability through comprehensive monitoring and alerting solutions. Collaboration with development and operations teams is essential to ensure that scalability solutions align with business needs and technical requirements.

What are the key skills and qualifications needed to thrive as a Linux Site Reliability Engineer, and why are they important?

To thrive as a Linux Site Reliability Engineer, you need deep expertise in Linux system administration, scripting (such as Bash or Python), and a solid understanding of networking concepts, usually backed by a computer science degree or equivalent experience. Familiarity with configuration management tools (like Ansible, Puppet, or Chef), containerization (Docker, Kubernetes), and cloud platforms (AWS, GCP, or Azure) is typically required, along with relevant certifications like RHCE or AWS Certified SysOps Administrator. Strong problem-solving skills, effective communication, and the ability to work under pressure are crucial soft skills for this role. These competencies ensure the reliability, scalability, and security of complex infrastructure, minimizing downtime and supporting seamless operations.

What is the difference between Linux Site Reliability Engineer vs Linux DevOps Engineer?

AspectLinux Site Reliability EngineerLinux DevOps Engineer
CredentialsLinux certifications, SRE-specific trainingLinux certifications, DevOps tools certifications
Work EnvironmentFocus on system reliability, monitoring, incident responseFocus on automation, CI/CD pipelines, deployment
Employer & IndustryTech companies, cloud providers, large enterprisesStartups, tech firms, software development teams
Search & Comparison IntentUnderstanding reliability roles, incident managementAutomation, deployment, continuous integration

While both roles involve Linux expertise, a Linux Site Reliability Engineer primarily focuses on maintaining system reliability, monitoring, and incident response. In contrast, a Linux DevOps Engineer emphasizes automation, continuous integration, and deployment processes. Both roles require Linux skills and often overlap, but their core responsibilities differ based on organizational needs.

What is a Linux Site Reliability Engineer?

A Linux Site Reliability Engineer (SRE) is an IT professional responsible for ensuring the reliability, scalability, and performance of systems running on the Linux operating system. They bridge the gap between software development and operations by automating processes, monitoring infrastructure, and managing incidents. Linux SREs focus on system availability, building tools for deployment and monitoring, and improving system robustness through best practices and automation. Their work helps organizations deliver reliable online services and quickly recover from outages or system failures.
What are popular job titles related to Linux Site Reliability Engineer jobs in Silver Spring, MD? For Linux Site Reliability Engineer jobs in Silver Spring, MD, the most frequently searched job titles are:
What job categories do people searching Linux Site Reliability Engineer jobs in Silver Spring, MD look for? The top searched job categories for Linux Site Reliability Engineer jobs in Silver Spring, MD are:
What cities near Silver Spring, MD are hiring for Linux Site Reliability Engineer jobs? Cities near Silver Spring, MD with the most Linux Site Reliability Engineer job openings:

Site Reliability Engineer (SRE) (TS)

kgs

Washington, DC

$64.50 - $85.75/hr

Other

Medical, Dental, Vision, Retirement, PTO

Posted 21 days ago


Job description

Koniag Management Solutions, LLC (KMS), a Koniag Government Services (KGS) company, is hiring a Site Reliability Engineer (SRE).  Position requires an active Top Secret/SCI clearance with ability to obtain additional security requirements.  Please do not apply if you do not possess the required Top-Secret Clearance.
We offer competitive compensation and an extraordinary benefits package including health, dental and vision insurance, 401K with company matching, flexible spending accounts, paid holidays, three weeks paid time off, and more.
Position Summary:

We are seeking an experienced Site Reliability Engineer (SRE) to blend software engineering and systems administration practices to ensure the reliability, availability, and performance of missioncritical applications. This role focuses on automation, observability, and incident response while upholding strict Service Level Objectives (SLOs). The SRE will help build resilient systems that scale, automate manual processes, manage fleetwide configurations, and ensure robust system monitoring. The selected candidate will support operations at Joint Base Anacostia–Bolling and must maintain an active TS/SCI clearance.

Key Responsibilities:

  • Ensure application reliability, performance, and availability through automation, monitoring, and systems engineering.
  • Develop infrastructure-as-code (IaC) solutions using Terraform, Ansible, and Desired State Configuration (DSC).
  • Build and manage containerized workloads using Kubernetes, Rancher, Docker, Helm, and related ecosystem tools.
  • Support service mesh and networking constructs such as Cilium, load balancing, ingress management, and distributed storage.
  • Engineer and maintain storage and object systems including Rook, Ceph, MinIO, and S3-compatible platforms.
  • Implement and maintain comprehensive observability platforms (metrics, logging, tracing) to support SLO monitoring and incident response.
  • Lead and participate in incident response activities, postmortem analysis, and reliability engineering improvements.
  • Develop automations, scripts, and tools using Python, PowerShell, and shell scripting.
  • Support CI/CD pipelines and cloud-native deployment methodologies.
  • Collaborate with development and operations teams to embed SRE practices into the application lifecycle.

Required Technical Certifications (at least two):

  • Security +
  • Cloud Associate (such as AWS Solutions Architect Associate, Azure AZ104, or Google Cloud Associate Cloud Engineer)
  • Terraform Associate
  • Cloud Professional/Architect (such as AWS Solutions Architect Professional or Azure Architect Expert)
  • CKA (Certified Kubernetes Administrator)

Preferred Certifications (Plus):

  • CKA (if not used to meet required cert)
  • RHCSA
  • AWS DevOps Engineer or AZ400
  • CCSP
  • Advanced observability certifications (Datadog, New Relic, Dynatrace, etc.)
  • Formal incident management or SREfocused training

Required Technical Knowledge:

Strong understanding of the following technologies:

  • Kubernetes, Rancher, Helm, Docker
  • Cilium, Rook, Ceph, MinIO, S3, PortWorx
  • Load balancing, ingress, and service networking
  • Ansible, Terraform, Desired State Configuration
  • Python, PowerShell, and scripting/automation
  • Distributed systems, cloud computing, and microservices architecture
  • Monitoring/observability practices and tools
  • Incident response frameworks and SLObased operations

Preferred Experience:

  • Building scalable, fault-tolerant cloud-native systems across hybrid or multicloud environments.
  • Developing or supporting enterprise CI/CD pipelines.
  • Managing complex Kubernetes clusters across onprem and cloud platforms.
  • Implementing enterprise observability stacks (e.g., Prometheus, Loki, Grafana, ELK, Open Telemetry).
  • Supporting large-scale infrastructure within DoD or Intelligence Community environments.

Requirements:

TS/SCI security clearance required, candidate will not be considered without.


Our Equal Employment Opportunity Policy:
 

The company is an equal opportunity employer. The company shall not discriminate against any employee or applicant because of race, color, religion, creed, ethnicity, sex, sexual orientation, gender or gender identity (except where gender is a bona fide occupational qualification), national origin or ancestry, age, disability, citizenship, military/veteran status, marital status, genetic information or any other characteristic protected by applicable federal, state, or local law. We are committed to equal employment opportunity in all decisions related to employment, promotion, wages, benefits, and all other privileges, terms, and conditions of employment.

 
The company is dedicated to seeking all qualified applicants. If you require an accommodation to navigate or to apply to a position on our website, please contact Heaven Wood via e-mail at accommodations@koniag-gs.com or by calling 703-488-9377 to request accommodations. 

 

About our Company:
 

Koniag Government Services (KGS) is an Alaska Native Owned corporation supporting the values and traditions of our native communities through an agile employee and corporate culture that delivers Enterprise Solutions, Professional Services and Operational Management to Federal Government Agencies. As a wholly owned subsidiary of Koniag, we apply our proven commercial solutions to a deep knowledge of Defense and Civilian missions to provide forward leaning technical, professional, and operational solutions. KGS enables successful mission outcomes for our customers through solution-oriented business partnerships and a commitment to exceptional service delivery. We ensure long-term success with a continuous improvement approach while balancing the collective interests of our customers, employees, and native communities. For more information, please visit www.koniag-gs.com.

 Equal Opportunity Employer/Veterans/Disabled. Shareholder Preference in accordance with Public Law 88-352

#LI-CT1Â