1

Senior Observability Engineer Jobs in Delaware (NOW HIRING)

Sr. DevOps Platform Engineer

Wilmington, DE

$126.10K - $162.10K/yr

As a Senior DevOps Platform Engineer , you will play a critical role in ensuring the reliability ... Implement and evolve monitoring, alerting, and observability solutions, including AIOps ...

... observability, automation, and reliability across both cloud and on‑premises environments ... E teams. - Identify and address performance, scalability, and reliability bottlenecks across ...

New

Sr. DevOps Platform Engineer

Wilmington, DE · On-site +1

$51.25 - $70/hr

... observability, automation, and reliability across both cloud and on‑premises environments ... E teams. - Identify and address performance, scalability, and reliability bottlenecks across ...

New

Sr. DevOps Platform Engineer

Wilmington, DE · On-site

$51.25 - $70/hr

Implement and evolve monitoring, alerting, and observability solutions, including AIOps ... E teams. * Identify and address performance, scalability, and reliability bottlenecks across ...

Sr. DevOps Platform Engineer

Wilmington, DE

$51.25 - $70/hr

Implement and evolve monitoring, alerting, and observability solutions, including AIOps ... E teams. * Identify and address performance, scalability, and reliability bottlenecks across ...

Apply cloud best practices--security, cost awareness, performance, and operational efficiency--under guidance of senior engineers. * Contribute to observability through metrics, logging, tracing ...

New

next page

Showing results 1-20

Senior Observability Engineer information

What are the key skills and qualifications needed to thrive as a Senior Observability Engineer, and why are they important?

To thrive as a Senior Observability Engineer, you need expertise in monitoring, logging, and tracing systems, with a solid background in computer science or a related field. Familiarity with tools like Prometheus, Grafana, ELK stack, and cloud platforms, as well as certifications such as AWS Certified DevOps Engineer, are typically required. Strong problem-solving, collaboration, and communication skills are critical for effectively diagnosing and resolving complex infrastructure issues. These skills ensure reliable system performance, rapid incident response, and continuous improvement of the technology environment.

How does a Senior Observability Engineer typically collaborate with development and operations teams?

A Senior Observability Engineer works closely with both development and operations teams to ensure robust monitoring, logging, and tracing solutions are in place across all applications and infrastructure. They often participate in architecture discussions to advise on best practices for instrumenting code and systems for observability. By analyzing metrics and alerting patterns, they help teams proactively resolve issues and optimize system performance. This role also involves mentoring engineers on observability tools and fostering a culture of transparency and accountability in incident response.

What is a Senior Observability Engineer?

A Senior Observability Engineer is a seasoned IT professional responsible for designing, implementing, and maintaining systems that monitor and provide insights into the performance, health, and reliability of software applications and infrastructure. They utilize tools for logging, monitoring, tracing, and alerting to ensure that systems are observable and any issues can be quickly detected and resolved. In addition to technical expertise, they often collaborate with development and operations teams to establish best practices, improve incident response, and optimize system performance. Their work is crucial for maintaining uptime, enhancing customer experiences, and supporting the scalability of technology platforms.

What is the difference between Senior Observability Engineer vs Site Reliability Engineer?

AspectSenior Observability EngineerSite Reliability Engineer
CredentialsExperience with monitoring tools, scripting, cloud platformsSame as Senior Observability Engineer, often with SRE certifications
Work EnvironmentFocus on monitoring, logging, and tracing systemsFocus on system reliability, automation, and incident response
Industry UsageUsed in tech companies emphasizing system observabilityCommon in large-scale tech and cloud services
Search/Comparison IntentOften compared for monitoring rolesCompared for reliability and system stability roles

While both roles require expertise in cloud platforms and scripting, the Senior Observability Engineer primarily focuses on designing and maintaining monitoring, logging, and tracing systems to ensure system visibility. In contrast, a Site Reliability Engineer emphasizes system reliability, automation, and incident management to maintain service uptime. Both roles are vital in tech environments but serve different core functions related to system health and stability.

What are popular job titles related to Senior Observability Engineer jobs in Delaware? For Senior Observability Engineer jobs in Delaware, the most frequently searched job titles are:
What job categories do people searching Senior Observability Engineer jobs in Delaware look for? The top searched job categories for Senior Observability Engineer jobs in Delaware are:
What cities in Delaware are hiring for Senior Observability Engineer jobs? Cities in Delaware with the most Senior Observability Engineer job openings:
Infographic showing various Senior Observability Engineer job openings in Delaware as of May 2026, with employment types broken down into 89% Full Time, 5% Part Time, and 6% Contract. Highlights an 91% Physical, 4% Hybrid, and 5% Remote job distribution.

Sr. DevOps Platform Engineer

Berkley Technology Services

Wilmington, DE • On-site

$126.10K - $162K/yr

Full-time

Posted 27 days ago


Job description

Job Summary:
Berkley Technology Services (BTS) is a dynamic technology solution for W. R. Berkley Corporation, a Fortune 500 Commercial Lines Insurance Company. They are seeking a Senior DevOps Platform Engineer who will ensure the reliability, scalability, security, and performance of Berkley’s software systems while collaborating closely with product engineering, infrastructure, and architecture teams to build and operate an enterprise DevOps platform.
Responsibilities:
• Maintain a strong understanding of the entire technology stack (networking, storage, OS, virtualization, databases, development frameworks, and applications) to design, observe, troubleshoot, and automate systems across the Berkley environment.
• Design, build, and mature enterprise CI/CD pipelines and shared DevOps platform services, enabling secure, reliable, and scalable software delivery for multiple teams.
• Define, implement, and track reliability and observability OKRs, including SLIs and SLOs, to guide reliability engineering, deployment practices, and operational decision-making.
• Implement and evolve monitoring, alerting, and observability solutions, including AIOps capabilities, to proactively assess system health, detect anomalies, enable self-healing, and support rapid incident response.
• Drive automation initiatives to eliminate operational toil, streamline platform and pipeline workflows, reduce manual intervention, and improve efficiency for product engineering and SRE teams.
• Identify and address performance, scalability, and reliability bottlenecks across applications, infrastructure, and delivery pipelines to improve system efficiency and user experience.
• Partner with incident management and operations teams to respond to, resolve, and prevent system outages or degradation, minimizing downtime and customer impact.
• Collaborate actively with development, operations, and platform teams to embed resiliency, observability, security, and reliability requirements into system design, CI/CD pipelines, and runtime environments.
• Lead cross-functional coordination with product, development, infrastructure, and architecture teams to perform capacity planning, anticipate growth, and ensure systems scale reliably with business demand.
• Continuously improve platform resilience by identifying and closing gaps in architecture, tooling, processes, and operational practices.
• Modernize and strengthen disaster recovery capabilities for both on-premises and cloud-based Berkley solutions, ensuring recoverability, resilience, and compliance with enterprise standards.
Qualifications:
Required:
• 5+ years of experience in DevOps and Site Reliability Engineering, with hands-on ownership of infrastructure, CI/CD platforms, and software delivery in enterprise environments.
• Strong software engineering and automation skills, including proficiency in Python, Go, Bash, or JavaScript, and experience building production-grade automation.
• Proven expertise in enterprise CI/CD, GitOps, and containerized platforms, including Kubernetes, Helm, and cloud-native delivery patterns.
• Deep experience with reliability and observability, including monitoring, alerting, logging, and tracing platforms (e.g., Dynatrace, Datadog, ELK), and defining SLIs, SLOs, and reliability metrics.
• Strong understanding of cloud, on-prem, and hybrid architectures, including high availability, disaster recovery, capacity planning, and scalability.
• Hands-on experience with infrastructure as code and configuration management (e.g., Terraform, Ansible, GitHub Actions) to reduce operational toil and enable self-service.
• Solid knowledge of security and networking fundamentals, including applying industry-standard security frameworks in enterprise environments.
• Demonstrated ability to lead technical initiatives, influence system design decisions, mentor engineers, and collaborate effectively across product, engineering, infrastructure, and security teams.
• Bachelor’s degree with emphasis in related field or equivalent experience.
Company:
Berkley Technology Services offers networking, software development, UI/UX design, project management and IT shared services. Founded in 2001, the company is headquartered in Wilmington, USA, with a team of 201-500 employees. The company is currently Growth Stage.