1

Observability Site Reliability Engineer Jobs in Indiana

Site Reliability Engineer II

South Bend, IN

$55.75 - $74.25/hr

Site Reliability Engineer II The SRE II sits at the intersection of software engineering and ... Reliability & Observability * Define, instrument, and enforce SLIs and SLOs in partnership with ...

Director, DevSecOps& SRE

Carmel, IN · On-site

$57 - $75.75/hr

Create and own an Observability Platform to track and support application health * Hire, mentor, and guide SRE and cloud infrastructure engineers. * Determine technical objectives and manage software ...

Director, DevSecOps& SRE

Carmel, IN

$57 - $75.75/hr

Create and own an Observability Platform to track and support application health * Hire, mentor, and guide SRE and cloud infrastructure engineers. * Determinetechnicalobjectivesand manage software ...

Site Reliability Engineer

Denver, IN · Hybrid

$81K - $142K/yr

As a Site Reliability Engineer reporting to Director, System Operations, you'll play a critical role in the delivery, integration, and support of complex, distributed, high-availability solutions for ...

Scope spans four major capabilities and platform components Core Capabilities & Responsibilities * SRE & DevOps integration, Monitoring and observability * Quality Engineering & Test Engineering ...

Site Reliability Program Support: Support and contribute to the site's reliability program while ... Bachelor's Degree in Engineering (Mechanical Engineering preferred). * Required Experience:

$170K - $200K/yr

Drive modernization across cloud connectivity, observability, security, and AI-assisted development without compromising legacy stability. * Uncompromising Quality & SRE: Define and enforce rigorous ...

Senior DevOps Engineer

Indianapolis, IN · On-site

$124K - $159K/yr

Implement observability solutions using metrics, logging, and tracing to enable proactive issue ... Apply Site Reliability Engineering (SRE) practices such as SLIs/SLOs, error budgets, capacity ...

Senior DevOps Engineer

Indianapolis, IN · On-site

$124K - $159K/yr

Implement observability solutions using metrics, logging, and tracing to enable proactive issue ... Apply Site Reliability Engineering (SRE) practices such as SLIs/SLOs, error budgets, capacity ...

Senior DevOps Engineer

Indianapolis, IN

$124K - $159K/yr

Implement observability solutions using metrics, logging, and tracing to enable proactive issue ... Apply Site Reliability Engineering (SRE) practices such as SLIs/SLOs, error budgets, capacity ...

You'll partner closely with Product, Infrastructure, Data, Security, and SRE teams to deliver ... Experience with observability, monitoring, and incident response * Ability to own features and ...

You'll partner closely with Product, Infrastructure, Data, Security, and SRE teams to deliver ... Experience with observability, monitoring, and incident response * Ability to own features and ...

Senior DevOps Engineer

Carmel, IN · On-site

$129K - $166K/yr

... observability, and security of our cloud platforms. Hybrid: At Allegion, we are driven by a bold ... Proactively improve site reliability and operational SLAs (e.g., uptime, performance, time to ...

Senior DevOps Engineer

Carmel, IN

$129K - $166K/yr

... observability, and security of our cloud platforms. Hybrid: At Allegion, we are driven by a bold ... Proactively improve site reliability and operational SLAs (e.g., uptime, performance, time to ...

Regional Reliability Engineer

Gary, IN · On-site

$102K - $128K/yr

This includes factory and site acceptance testing that will assure adherence to functional ... Collaborate with Project Management and Engineering on new projects and carry out design reviews on ...

next page

Showing results 1-20

Observability Site Reliability Engineer information

What is the difference between Observability Site Reliability Engineer vs Monitoring Engineer?

AspectObservability Site Reliability EngineerMonitoring Engineer
FocusEnsuring system reliability through observability, automation, and incident responseImplementing and managing monitoring tools and dashboards
SkillsCloud platforms, scripting, incident management, observability toolsMonitoring tools, alerting systems, data analysis
Work EnvironmentDevOps teams, cloud infrastructure, large-scale systemsOperations teams, infrastructure monitoring

While both roles involve system health, the Observability Site Reliability Engineer focuses on comprehensive system reliability using observability practices, whereas Monitoring Engineers primarily manage monitoring tools and alerts. The SRE role emphasizes automation, incident response, and system resilience, making it broader in scope.

What are popular job titles related to Observability Site Reliability Engineer jobs in Indiana? For Observability Site Reliability Engineer jobs in Indiana, the most frequently searched job titles are:
What job categories do people searching Observability Site Reliability Engineer jobs in Indiana look for? The top searched job categories for Observability Site Reliability Engineer jobs in Indiana are:
What cities in Indiana are hiring for Observability Site Reliability Engineer jobs? Cities in Indiana with the most Observability Site Reliability Engineer job openings:
Infographic showing various Observability Site Reliability Engineer job openings in Indiana as of June 2026, with employment types broken down into 100% Full Time. Highlights an 100% In-person job distribution.
Site Reliability Engineer II

Site Reliability Engineer II

Kastle Systems

South Bend, IN

$55.75 - $74.25/hr

Other

Medical, Dental, Vision, Retirement

Posted 17 days ago


Kastle Systems rating

9.2

Company rating: 9.2 out of 10

Based on 6 frontline employees who took The Breakroom Quiz

3rd of 102 rated security


Job description

Overview

Join the leader in providing smarter solutions for a safer world.

The property technology space is growing rapidly, and Kastle Systems is leading the way. Kastle Systems is the leader in managed security, with a track record of introducing innovative technologies to serve over 460M square feet of real estate globally. Clients span the commercial and multifamily real estate, education, and construction industries and the customers they serve. Delivering a world class customer experience drives everything we do, and Kastle’s mission is to be our customers’ best service provider and to ensure that their security the most effective, efficient and convenient. Kastle's integrated security solution, including access control, video, and remote video monitoring, significantly reduces costs and improves the critically important 24x7 performance for building owners, developers and tenants.

Site Reliability Engineer II

The SRE II sits at the intersection of software engineering and platform operations. You will own the reliability, scalability, and operational hygiene of Kastle’s core infrastructure – engineering away toil, hardening deployment pipelines, and partnering with product engineering teams to make new services production-ready from day one.

This is a mid-level individual contributor role. You are expected to execute technical work independently, drive reliability improvements end-to-end, and participate meaningfully in architecture discussions. You will carry on-call responsibilities as part of a shared rotation with a well-defined escalation model and a strong blameless post-incident review culture.

The team is in the middle of a meaningful platform evolution: formalizing multi-tier release pipelines (Dev → QA → Integration → UAT → Prod) with ArgoCD-based approval gates, building out SLI/SLO frameworks, and migrating toward full GitOps. You will be a hands-on contributor to all of it.

Key Responsibilities:Release Engineering & GitOps
  • Own and evolve the multi-stage deployment pipeline using ArgoCD, including approval gates, promotion policies, and rollback mechanisms.
  • Maintain trunk-based branching discipline and enforce release governance standards across the engineering organization.
  • Manage feature flag lifecycle – from creation and gradual rollout to deprecation – in coordination with product and QA teams.
  • Build and maintain CI/CD pipelines that enable safe, frequent, and auditable deployments.
Infrastructure as Code & Cloud Operations
  • Provision and manage Azure infrastructure using Terraform or OpenTofu, maintaining drift-free state aligned with GitOps principles.
  • Own Kubernetes cluster operations including workload scheduling, resource optimization, RBAC, network policy, and cost governance.
  • Identify and act on infrastructure cost optimization opportunities (compute rightsizing, storage tier selection, idle resource elimination).
  • Support Crossplane or similar operator patterns for Kubernetes-native infrastructure management where applicable.
Reliability & Observability
  • Define, instrument, and enforce SLIs and SLOs in partnership with product engineering teams.
  • Build and maintain observability infrastructure – metrics, logs, and distributed traces – using Prometheus, Grafana, OpenTelemetry, or equivalent tooling.
  • Conduct proactive capacity planning and performance tuning across multi-tenant, distributed environments.
  • Establish and maintain runbooks, dashboards, and alerting policies that reduce cognitive overhead during incidents.
Incident Management
  • Participate in shared on-call rotation covering core platform and infrastructure services; on-call load is balanced across the team with structured handoff practices.
  • Lead mitigation of live production incidents with a focus on minimizing MTTR and clear stakeholder communication under pressure.
  • Facilitate blameless post-incident reviews and drive preventative engineering to closure – not just documentation.
Engineering Partnership
  • Embed with product engineering teams during design and architecture phases to establish reliability, scalability, and security requirements before code is written.
  • Maintain clear, comprehensive documentation for infrastructure architecture, operational procedures, and onboarding guides.
  • Push back constructively when proposed designs compromise reliability or operability, proposing alternatives rather than just raising concerns.

In addition to a great work environment, we provide excellent benefits (Medical/Dental/Vision, 401K, Tuition/Training Assistance, BrightHorizons Lifestyle Assistance, Wellness Program, etc.) and we're proud to be a Certified Great Place to Work! For more information about what it's like to work with us, please visit Kastle Careers.


Responsibilities
  • Experience: 4–6 years in an SRE, Platform Engineering, or Infrastructure Engineering role, with demonstrated ownership of production systems.
  • Cloud – Azure: Hands-on experience managing production infrastructure in Azure: AKS, Azure Container Registry, Azure Monitor, Cosmos DB, Key Vault, Azure Front Door, or equivalent services. AWS/GCP backgrounds considered with clear willingness to operate in Azure.
  • Kubernetes: Deep operational experience with Kubernetes in production: resource management, network policies, RBAC, HPA/VPA, persistent volumes, and debugging live workload issues.
  • GitOps & Release Tooling: Experience with ArgoCD, Flux, or equivalent GitOps deployment tools. Familiarity with multi-stage progressive delivery and approval gate patterns is a strong plus.
  • Infrastructure as Code: Proven track record with Terraform, OpenTofu, or Pulumi in a production GitOps context – not just writing HCL, but maintaining drift-free state and managing state backends safely.
  • Observability: Hands-on configuration of Prometheus, Grafana, OpenTelemetry, and/or ELK/OpenSearch. Ability to go from symptom to instrumentation to dashboard without hand-holding.
  • Programming & Scripting: Proficiency in Python or Go for automation and tooling; strong Bash scripting. Ability to read and reason about application code when debugging production issues. Proficiency in C# and SQL for reviewing deliverables and participating in triage.
  • Linux & Networking: Solid understanding of Linux internals, TCP/IP, DNS, TLS, and HTTP semantics. Comfortable debugging at the network and OS layer.

Qualifications
  • Experience with Crossplane or other Kubernetes-native infrastructure operators.
  • Familiarity with feature flag platforms (LaunchDarkly, Flagsmith, or similar) and gradual rollout strategies.
  • Background in IoT, physical security, access control, or other latency-sensitive, event-driven domains.
  • Comfort with async collaboration across distributed time zones (US + India team structure).
  • Experience with AI-assisted development tooling and an appetite to incorporate it into engineering workflows.
  • Knowledge of CMMC 2.0, SOC 2, or FedRAMP compliance postures as they apply to infrastructure and access control.

Equal Opportunity Statement

At Kastle, we believe that diversity makes us stronger -  at work and in the world.  Kastle Systems International, LLC is an Equal Opportunity / Affirmative Action employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, age, protected veteran status, marital status, pregnancy or any other basis protected by applicable federal or state laws.

Qualifications:
  • Experience with Crossplane or other Kubernetes-native infrastructure operators.
  • Familiarity with feature flag platforms (LaunchDarkly, Flagsmith, or similar) and gradual rollout strategies.
  • Background in IoT, physical security, access control, or other latency-sensitive, event-driven domains.
  • Comfort with async collaboration across distributed time zones (US + India team structure).
  • Experience with AI-assisted development tooling and an appetite to incorporate it into engineering workflows.
  • Knowledge of CMMC 2.0, SOC 2, or FedRAMP compliance postures as they apply to infrastructure and access control.
Education:UNAVAILABLEEmployment Type: UNAVAILABLE