Improve telemetry and observability with intent. Build alerts as structured payloads with runbook ... direct hands-on competence where it isn't. * Incident Response: Serve as an escalation point for ...
Improve telemetry and observability with intent. Build alerts as structured payloads with runbook ... direct hands-on competence where it isn't. * Incident Response: Serve as an escalation point for ...
Senior Network Reliability Engineer
Zionsville, IN · On-site +1
Improve telemetry and observability with intent. Build alerts as structured payloads with runbook ... direct hands-on competence where it isn't. * Incident Response: Serve as an escalation point for ...
Senior Network Reliability Engineer
Zionsville, IN · On-site +1
Improve telemetry and observability with intent. Build alerts as structured payloads with runbook ... direct hands-on competence where it isn't. * Incident Response: Serve as an escalation point for ...
Artificial Intelligence Specialist
Fort Wayne, IN · On-site +1
$109K - $203K/yr
... promptfoo); observability (LangSmith, Phoenix); coursework or project experience in machine ... GovCloud) Direct experience with AI-assisted software development tools and agentic coding ...
Artificial Intelligence Specialist
Fort Wayne, IN · On-site +1
$109K - $203K/yr
... promptfoo); observability (LangSmith, Phoenix); coursework or project experience in machine ... GovCloud) Direct experience with AI-assisted software development tools and agentic coding ...
Senior Software Engineer
Austin, IN · On-site
$117K - $154K/yr
Director of Engineering Location: Austin (in-person) Build the infrastructure powering the future ... Design systems with observability, monitoring, and failure recovery in mind * Debug complex ...
Senior Software Engineer
Austin, IN · On-site
$117K - $154K/yr
Director of Engineering Location: Austin (in-person) Build the infrastructure powering the future ... Design systems with observability, monitoring, and failure recovery in mind * Debug complex ...
ConvergeProsperity - Product Forward Deployed Engineer, Insurance Growth Suite (Manager) - Innova...
This is a high-impact, high-visibility role with direct influence on product direction. You are ... observability * Proficiency across multi-cloud environments , with AWS as a primary platform ...
ConvergeProsperity - Product Forward Deployed Engineer, Insurance Growth Suite (Manager) - Innova...
This is a high-impact, high-visibility role with direct influence on product direction. You are ... observability * Proficiency across multi-cloud environments , with AWS as a primary platform ...
Advanced Integrations Engineer
Carmel, IN · On-site
$98K - $132K/yr
Implement proactive monitoring, alerting, and observability for interfaces and FHIR services; drive ... Able to work independently and be self-directed * Excellent analytical, programming ...
Advanced Integrations Engineer
Carmel, IN · On-site
$98K - $132K/yr
Implement proactive monitoring, alerting, and observability for interfaces and FHIR services; drive ... Able to work independently and be self-directed * Excellent analytical, programming ...
Software Engineer III - Java Full Stack Developer
$50.25 - $64.75/hr
Communicate regularly with Engagement Managers (Directors), project team members, and ... Establish and enforce standards for code quality, testing, CI/CD, documentation, and observability ...
Software Engineer III - Java Full Stack Developer
$50.25 - $64.75/hr
Communicate regularly with Engagement Managers (Directors), project team members, and ... Establish and enforce standards for code quality, testing, CI/CD, documentation, and observability ...
Advanced Integrations Engineer
$99K - $133K/yr
Implement proactive monitoring, alerting, and observability for interfaces and FHIR services; drive ... Able to work independently and be self-directed * Excellent analytical, programming ...
Advanced Integrations Engineer
$99K - $133K/yr
Implement proactive monitoring, alerting, and observability for interfaces and FHIR services; drive ... Able to work independently and be self-directed * Excellent analytical, programming ...
Advanced Integrations Engineer
$98K - $132K/yr
Implement proactive monitoring, alerting, and observability for interfaces and FHIR services; drive ... Able to work independently and be self-directed * Excellent analytical, programming ...
Advanced Integrations Engineer
$98K - $132K/yr
Implement proactive monitoring, alerting, and observability for interfaces and FHIR services; drive ... Able to work independently and be self-directed * Excellent analytical, programming ...
Senior React Native Software Engineer
Austin, IN · On-site
$127K - $159K/yr
Director of Engineering Location: Austin, TX About the Role inKind is seeking a Senior React Native ... Improve application observability, reliability, and scalability * Help shape frontend architecture ...
Senior React Native Software Engineer
Austin, IN · On-site
$127K - $159K/yr
Director of Engineering Location: Austin, TX About the Role inKind is seeking a Senior React Native ... Improve application observability, reliability, and scalability * Help shape frontend architecture ...
Advanced Integrations Engineer
Carmel, IN · On-site
$98K - $132K/yr
Implement proactive monitoring, alerting, and observability for interfaces and FHIR services; drive ... Able to work independently and be self-directed * Excellent analytical, programming ...
Advanced Integrations Engineer
Carmel, IN · On-site
$98K - $132K/yr
Implement proactive monitoring, alerting, and observability for interfaces and FHIR services; drive ... Able to work independently and be self-directed * Excellent analytical, programming ...
Advanced Integrations Engineer
$98K - $132K/yr
Implement proactive monitoring, alerting, and observability for interfaces and FHIR services; drive ... Able to work independently and be self-directed * Excellent analytical, programming ...
Advanced Integrations Engineer
$98K - $132K/yr
Implement proactive monitoring, alerting, and observability for interfaces and FHIR services; drive ... Able to work independently and be self-directed * Excellent analytical, programming ...
ML Engineer
Indianapolis, IN · On-site +1
Collaborate on model serving, monitoring, logging, and observability. * Assist with infrastructure ... Direct influence on product architecture and technical strategy. * Remote-first work environment.
Quick apply
ML Engineer
Indianapolis, IN · On-site +1
Collaborate on model serving, monitoring, logging, and observability. * Assist with infrastructure ... Direct influence on product architecture and technical strategy. * Remote-first work environment.
Advanced Integrations Engineer
Carmel, IN · On-site
$98K - $132K/yr
Implement proactive monitoring, alerting, and observability for interfaces and FHIR services; drive ... Able to work independently and be self-directed * Excellent analytical, programming ...
Advanced Integrations Engineer
Carmel, IN · On-site
$98K - $132K/yr
Implement proactive monitoring, alerting, and observability for interfaces and FHIR services; drive ... Able to work independently and be self-directed * Excellent analytical, programming ...
Cloud Platform Delivery Lead - AWS: Senior Manager
$60.75 - $83.25/hr
Direct the team through complexity, demonstrating composure through ambiguous, challenging and ... observability tooling. Demonstrates the ability to design, implement, and validate architecture ...
Cloud Platform Delivery Lead - AWS: Senior Manager
$60.75 - $83.25/hr
Direct the team through complexity, demonstrating composure through ambiguous, challenging and ... observability tooling. Demonstrates the ability to design, implement, and validate architecture ...
Advanced Integrations Engineer
Carmel, IN · On-site
$98K - $132K/yr
Implement proactive monitoring, alerting, and observability for interfaces and FHIR services; drive ... Able to work independently and be self-directed * Excellent analytical, programming ...
Advanced Integrations Engineer
Carmel, IN · On-site
$98K - $132K/yr
Implement proactive monitoring, alerting, and observability for interfaces and FHIR services; drive ... Able to work independently and be self-directed * Excellent analytical, programming ...
Advanced Integrations Engineer
Carmel, IN · On-site
$98K - $132K/yr
Implement proactive monitoring, alerting, and observability for interfaces and FHIR services; drive ... Able to work independently and be self-directed * Excellent analytical, programming ...
Advanced Integrations Engineer
Carmel, IN · On-site
$98K - $132K/yr
Implement proactive monitoring, alerting, and observability for interfaces and FHIR services; drive ... Able to work independently and be self-directed * Excellent analytical, programming ...
Artificial Intelligence Specialist
Fort Wayne, IN · On-site
$109K - $203K/yr
... promptfoo); observability (LangSmith, Phoenix); coursework or project experience in machine ... GovCloud) Direct experience with AI-assisted software development tools and agentic coding ...
Artificial Intelligence Specialist
Fort Wayne, IN · On-site
$109K - $203K/yr
... promptfoo); observability (LangSmith, Phoenix); coursework or project experience in machine ... GovCloud) Direct experience with AI-assisted software development tools and agentic coding ...
Director Observability information
What is the difference between Director Observability vs Site Reliability Engineer?
| Aspect | Director Observability | Site Reliability Engineer |
|---|---|---|
| Primary Focus | Oversees observability strategies, tools, and teams to ensure system visibility and performance | Builds and maintains reliable systems, automates deployment, and manages incident response |
| Credentials | Typically requires advanced knowledge of monitoring, cloud platforms, and leadership experience | Often has software engineering background, with skills in scripting, automation, and systems engineering |
| Work Environment | Leads teams in tech companies, focusing on monitoring and analytics tools | Works closely with development and operations teams to ensure system reliability |
While both roles focus on system performance and reliability, the Director Observability primarily manages observability strategies and teams, whereas the Site Reliability Engineer is hands-on, building and maintaining reliable systems. The roles complement each other in ensuring optimal system performance and uptime.
What are the key skills and qualifications needed to thrive as a Director of Observability, and why are they important?
How does a Director of Observability typically collaborate with engineering and operations teams to drive organizational goals?
What does a Director of Observability do?
Full-time
Medical, Dental, Vision, Life, Retirement
Posted 14 days ago
Group1001 rating
9.5
Based on 8 frontline employees who took The Breakroom Quiz
9th of 261 rated insurance
Job description
Group 1001is a consumer-centric, technology-driven family of insurance companies on a mission to deliver outstanding value and operational performance by combining financial strength and stability with deep insurance expertise and a can-do culture. Group1001's culture emphasizes the importance of collaboration, communication, core business focus, risk management, and striving for outcomes. This goal extends to how we hire and onboard our most valuable assets - our employees.
Why This Role Matters:
The Platform Engineering Services team at Group 1001 is building a Site Reliability Engineering practice with a network scope. We're hiring an Sr. Network Reliability Engineer who embodies Innovation and Excellence, and will apply SRE principles - code-as-source-of-truth, SLOs and error budgets, alerting on symptoms rather than causes, failure-mode-first design, and the elimination of toil - to the firm's network platform from carrier edge through cloud fabric to Kubernetes pod boundary. This is not a "keep the lights on" role. You will systematically engineer the lights-on work out of existence, build the abstractions that let other engineering teams express network intent in code, and treat the network as a single engineered system rather than a collection of vendor consoles. You will operate inside a DevSecOps practice spanning multi-cloud, multi-region environments, and you will partner closely with Cloud and Data Platforms, the NOC/SOC, and Cyber Security to extend reliability practice across the firm.
How You'll Contribute:
- Treat reliability as an engineered property. Define SLOs and error budgets for the network platform - DNS resolution, edge availability, mesh ingress success, cross-region path health - and use them to gate changes, not just to color dashboards. Lead postmortems with a focus on permanent remediation, not pattern-recognition. Alert on symptoms users feel, not on causes that may or may not produce impact.
- Move network state into code. Use Terraform (or Pulumi), Ansible, and Python to replace CLI-driven configuration with declarative, version-controlled, peer-reviewed change running through Infra CI/CD. This applies equally to the edge tier (Cloudflare), security platforms (Zscaler ZIA/ZPA, ZTNA policies, next-gen firewalls), the cloud network fabric (Transit Gateway, Cloud WAN, VPCs, Route53, IPAM), and increasingly the Kubernetes and service-mesh layer.
- Build network policy as intent, not rule lists. Express what flows are permitted, what segments are isolated, what egress is inspected, what zones share DNS - and engineer the compilers that turn that intent into per-vendor configuration. Use Policy as Code (OPA/Rego, Sentinel, Cilium NetworkPolicy) to catch invariant violations at plan time, not apply time.
- Infrastructure as Code (IaC): Design, deploy, and manage network infrastructure using Terraform or Ansible, moving the firm away from manual configuration to a code-first approach.
- Engineer the cloud network platform. Operate and extend our multi-account AWS Landing Zone - Cloud WAN segmentation, Transit Gateway peering, IPAM-driven CIDR allocation, shared private DNS, cross-account telemetry pipelines. Build the platform abstractions that make a new account or service land correctly with policy and connectivity composed from declarative inputs.
- Extend platform thinking into the container tier. Kubernetes networking, service mesh (Istio, Linkerd, Consul Connect), eBPF-based observability and policy (Cilium, Hubble), and the integration points where mesh-level authz meets cloud-tier identity. Recognize that an "internal" service is one logical hop on a chain of policy enforcement points and engineer for that explicitly.
- Improve telemetry and observability with intent. Build alerts as structured payloads with runbook links, suspected blast radius, and dependency-aware suppression. Author both system-health dashboards for operators and end-user monitoring dashboards that reflect actual user experience. Use Grafana, Elastic, Open Telemetry where each fits.
- Mentor and grow the team. Provide technical guidance to junior engineers, foster a culture of learning, and work out loud across Platform Engineering so the patterns you build cross-pollinate to adjacent domains.
- Handle hardware when required. Provide maintenance and configuration support for routers, switches, and firewalls at data centers and offices when needed - bringing code-first practices to physical hardware where possible (templating, change validation, zero-touch provisioning) and direct hands-on competence where it isn't.
- Incident Response: Serve as an escalation point for network issues, some complex and some basic but not yet covered by runbooks. Troubleshooting with a focus on root cause analysis and permanent remediation with a documentation-first mindset.
- Reduce toil and hand off cleanly. Repetitive operational tasks are scoped engineering problems with measurable payoff. Author runbooks and SOPs that the NOC can execute confidently; package routine work for L1/L2 handoff so engineering interrupt drops over time. Coordinate across Data Platforms, NOC/SOC, and Cyber Security so reliability practices spread instead of staying siloed.
What We're Looking For:
- Network Engineering: Deep understanding of TCP/IP, BGP, OSPF, VPNs, and SD-WAN architecture.
- Automation: Proven experience with Terraform (state management, modules) and Ansible (playbooks, roles) - or similar - in a production environment. Proficiency in Python for automation and API interaction, or similar.
- Security Platforms: Hands-on experience with Cloudflare, zScaler, and/or enterprise firewalls.
- Observability: Experience configuring monitoring tools (e.g., Datadog, Prometheus, Grafana) to create meaningful alerts and dashboards.
Nice to Have
- Service mesh experience (Istio, Linkerd, Consul Connect, Cilium).
- eBPF-based observability (Hubble, Pixie).
- AWS Multi-account landing zone tooling experience (AFT, Control Tower, or equivalent).
- Policy as Code experience (OPA/Rego, Sentinel, Cilium NetworkPolicy).
- Professional Attributes
- Documentation First: A strong belief that a job isn't done until the documentation in written.
- Toil Reduction: A mindset that actively seeks to automate repetitive tasks.
- Hybrid Capability: Willingness to handle physical hardware tasks when required while maintaining a software-centric engineering mindset.
Compensation:
Our compensation reflects the cost of labor across several U.S. geographic markets. The base pay for this position ranges from $135,000/year in our lowest geographic market up to $190,000/year in our highest geographic market. Pay is based on factors such as market location, job-related skills, and experience.
Benefits Highlights:
Employees who meet benefit eligibility guidelines and work 30 hours or more weekly, have the ability to enroll in Group 1001's benefits package. Employees (and their families) are eligible to participate in the Company's comprehensive health, dental, and vision insurance plan options. Employees are also eligible for Basic and Supplemental Life Insurance, Short and Long-Term Disability. All employees (regardless of hours worked) have immediate access to the Company's Employee Assistance Program and wellness programs-no enrollment is required. Employees may also participate in the Company's 401K plan, with matching contributions by the Company.
Group 1001, and its affiliated companies, is strongly committed to providing a supportive work environment where employee differences are valued. Diversity is an essential ingredient in making Group 1001 a welcoming place to work and is fundamental in building a high-performance team. Diversity embodies all the differences that make us unique individuals. All employees share the responsibility for maintaining a workplace culture of dignity, respect, understanding and appreciation of individual and group differences.
#LI-REMOTEAbout Group1001
Sourced by ZipRecruiter
Company size
201 - 500 Employees
Headquarters location
Indianapolis, IN, US
Year founded
2013