2

Remote Observability Engineer Jobs in Raleigh, NC

Senior ITSMA Observability Engineer

Raleigh, NC ยท On-site +1

$101K - $139K/yr

HedgeServ supports employees through a variety of offerings, including remote and hybrid working ... The Senior ITSMA Observability Engineer is responsible for the design and development of the ...

Senior Platform Engineer

Apex, NC ยท On-site +1

$80K - $109K/yr

Senior Platform Engineer (AWS / Kubernetes) Remote (United States) Security Journey is hiring a ... Drive observability improvements using DataDog, including dashboards, alerting, and incident ...

Principle AWS Cloud Engineer

Raleigh, NC ยท On-site +1

$54.25 - $72.50/hr

We are open to hiring on a remote basis in the United States, with the understanding that the ... Experience with observability and SRE practices, including monitoring, logging, alerting, incident ...

Lead DevOps Engineer

Raleigh, NC ยท Remote

$54 - $74/hr

Drive adoption of observability tools like DataDog and establish logging standards * Coordinate ... Remote * Contract or B2B arrangement Our values We are a company that seeks the best for both our ...

DevOps Engineer

Raleigh, NC ยท Remote

$54 - $74/hr

Help implement and maintain observability solutions for monitoring system performance and ... We offer a hybrid work schedule to perfectly combine the benefits of remote work and the essential ...

Software Engineer

Raleigh, NC ยท On-site +1

$135K - $154K/yr

Monitor and optimize system observability and performance using Splunk and Grafana, executing load ... For positions with Remote-US locations, the actual salary range for the position may differ based ...

Senior AI/ML Engineer

Raleigh, NC ยท On-site +1

$101K - $139K/yr

Remote/Hybrid: This role is based remotely but if you live within a 50-mile radius of Sunnyvale, CA ... Experience with A/B testing and telemetry/observability systems to measure impact and reliability.

Senior AI Systems Engineer

Raleigh, NC ยท On-site +1

$92K - $126K/yr

Maintain observability across AI systems through logging, metrics, performance monitoring, alerting ... This position may be performed fully remote, hybrid, or onsite at an ARA office. Preference will be ...

Machine Learning & Operations Engineer

Durham, NC ยท Remote

$71K - $96K/yr

This is a fully remote position, working cross-functionally with research and engineering teams ... management, and observability. * Improve system reliability, performance, and security.

Machine Learning & Operations Engineer

Durham, NC ยท Remote

$67K - $90K/yr

This is a fully remote position, working cross-functionally with research and engineering teams ... management, and observability. * Improve system reliability, performance, and security.

Machine Learning & Operations Engineer

Durham, NC ยท Remote

$67K - $90K/yr

This is a fully remote position, working cross-functionally with research and engineering teams ... management, and observability. * Improve system reliability, performance, and security.

Senior Database Engineer

Raleigh, NC ยท Remote

$130K - $155K/yr

This role is a remote position open to applicants based in Canada and USA. What You'll Do ... Familiarity with observability and monitoring: * Metrics, alerting, and performance dashboards ...

Manager, Data Engineer (Remote)

Raleigh, NC ยท Remote

$100K - $174K/yr

You will also guide engineers and reinforce strong delivery practices, while advancing the team ... Data quality and observability practices (testing, reconciliation, monitoring) Analytics & AI ...

next page

Showing results 1-20

Remote Observability Engineer information

See Raleigh, NC salary details

$36.9K

$112.6K

$186.2K

How much do remote observability engineer jobs pay per year?

As of Jun 12, 2026, the average yearly pay for remote observability engineer in Raleigh, NC is $112,630.00, according to ZipRecruiter salary data. Most workers in this role earn between $80,700.00 and $147,300.00 per year, depending on experience, location, and employer.

What are the typical collaboration patterns for a Remote Observability Engineer working with distributed teams?

Remote Observability Engineers frequently collaborate with software developers, DevOps teams, and IT operations to ensure systems are monitored effectively and issues are detected early. Working remotely, you'll often use communication tools like Slack, Jira, and video conferencing to coordinate incident response, discuss monitoring strategies, and review system health dashboards. Regular sync meetings and asynchronous updates are common, and you'll likely contribute to documentation and knowledge sharing to keep all stakeholders informed. Building strong communication habits is important, as much of the troubleshooting and improvement work hinges on clear coordination with multiple teams.

What are the key skills and qualifications needed to thrive as a Remote Observability Engineer, and why are they important?

To thrive as a Remote Observability Engineer, you need strong expertise in monitoring, logging, and tracing systems, along with a background in computer science or related technical fields. Familiarity with tools like Prometheus, Grafana, ELK Stack, Datadog, and cloud platforms is typically required, as well as relevant certifications such as AWS Certified Cloud Practitioner or Google Cloud Professional DevOps Engineer. Excellent problem-solving abilities, communication skills, and a proactive mindset help you detect and resolve issues before they impact users. These competencies ensure system reliability, enable rapid incident response, and support seamless collaboration in distributed environments.

What is the difference between Remote Observability Engineer vs Site Reliability Engineer?

AspectRemote Observability EngineerSite Reliability Engineer
CredentialsKnowledge of monitoring tools, scripting, cloud platformsSame as Observability Engineer, plus SRE certifications often preferred
Work EnvironmentFocus on monitoring, logging, and tracing systems remotelyBroader scope including system reliability, incident response, and automation
Industry UsagePrimarily in tech, SaaS, cloud servicesWidely in tech, finance, and large-scale online services

The Remote Observability Engineer specializes in monitoring and analyzing system performance remotely, focusing on tools like logs and metrics. In contrast, the Site Reliability Engineer has a broader role, ensuring overall system reliability, automation, and incident management. While both roles require similar technical skills, SREs often have additional responsibilities related to system resilience and scalability.

What is a Remote Observability Engineer?

A Remote Observability Engineer is a professional responsible for designing, implementing, and maintaining systems that monitor the health, performance, and reliability of software applications and infrastructure from a remote location. They use observability tools to collect and analyze logs, metrics, and traces, helping organizations quickly detect and resolve issues. Their work ensures that distributed systems are transparent, reliable, and efficient, often collaborating with development, operations, and security teams. Remote Observability Engineers often work from anywhere, leveraging cloud-based tools and platforms to manage complex IT environments.
What are the most commonly searched types of Observability Engineer jobs in Raleigh, NC? The most popular types of Observability Engineer jobs in Raleigh, NC are:
What are popular job titles related to Remote Observability Engineer jobs in Raleigh, NC? For Remote Observability Engineer jobs in Raleigh, NC, the most frequently searched job titles are:
What job categories do people searching Remote Observability Engineer jobs in Raleigh, NC look for? The top searched job categories for Remote Observability Engineer jobs in Raleigh, NC are:
What cities near Raleigh, NC are hiring for Remote Observability Engineer jobs? Cities near Raleigh, NC with the most Remote Observability Engineer job openings:
Senior ITSMA Observability Engineer

Senior ITSMA Observability Engineer

HedgeServ

Raleigh, NC โ€ข On-site, Remote

$101K - $139K/yr

Other

Posted 14 days ago


Job description

At HedgeServ, we're redefining what's possible in fund administration. With more than $700 billion in assets under administration, we partner with the world's most forward-thinking investment managers - across private equity, private credit, endowments, hedge funds and more - to deliver seamless, tech-enabled solutions that drive performance.

Our proprietary platform, enhanced by machine learning and robotic process automation, gives clients real-time insights and unmatched control over their operations. Alongside our technology, we offer award-winning service through our team-based approach -- led by a deeply experienced team of industry experts. Our solutions span the full investment lifecycle, including fund accounting, middle office, risk, compliance, tax, and investor services.

We're a future-focused company, empowering our people through a robust career development framework, clear career trajectories with structured learning paths, training, and progression plans. We invest in leadership development and in our collaborative culture, creating space for talent to grow. Our corporate values - Relationships, Support, Innovation, and Expertise - create a sense of shared purpose and belonging, and we recognize our employees sit at the core of our success. We continue to innovate and evolve through our employees, working together to achieve our shared vision and mission.

HedgeServ supports employees through a variety of offerings, including remote and hybrid working arrangements, and fully paid comprehensive health and well-being benefits. We've been recognized as an employer of choice, earning a top 100 workplaces designation.

Founded in 2008, HedgeServ has grown into a global organization with over 2,000 experts across the globe, with offices in the United States, Grand Cayman, Ireland, Poland, Bulgaria, Luxembourg, the Philippines, and Australia. We've earned numerous accolades, including Top Overall Administrator, along with #1 rankings for providing alternative asset services in Accounting, Technology, Client Service, Investor Services, Alternative Fund Expertise, Reporting, and Regulatory Expertise.

Job Description

The Senior ITSMA Observability Engineer is responsible for the design and development of the Elastic and Prometheus Stack, as well as, AWS Observability tools that monitor and manage critical applications and infrastructure at HedgeServ. As an important member of the ITSMA Monitoring and Analytics Team, the Senior Engineer will be responsible for the operation and design of the portfolio of tools, which include alerting mechanisms and escalation, dashboards, and the overall framework to support the management of HedgeServ's infrastructure, systems, and applications. Additionally, this role entails leading IT infrastructure monitoring projects and vendor management and handling daily operations with SME (Subject Matter Expert) escalation support as needed. The successful applicant should possess the ability to collaborate with various IT teams to gather requirements and develop solutions by means of existing monitoring capabilities or customized monitors (scripts).

Role Responsibilities

The Senior ITSMA Observability Engineer will collaborate with the ITSMA Monitoring and Analytics Team to design, build, secure, maintain, optimize, and document solutions utilizing Elastic Cloud Stack and AWS-managed Prometheus.

  • Proficiency with Elasticsearch, Logstash, Kibana, Beats, APM with X-Pack, Prometheus, Grafana, AWS CloudWatch, and other observability tools
  • Experience with OTEL Collectors
  • Engage closely with application owners, engineers, and development teams to evaluate requirements, architect, and support an Elasticsearch Stack solution, as well as structure queries to enhance system performance and efficiency
  • Design and configure ETL data pipelines using Elastic Common Schema for onboarding application logs and metrics
  • Configure index templates and manage data lifecycle (ILM) for effective data retention
  • Develop Ansible playbooks for automated deployment of Beat agents across on-premises and AWS systems; utilize Terraform for safe management of production infrastructure, employing methodologies such as Infrastructure as Code within AWS environments
  • Create Elastic alerting solutions via Watcher and Kibana Alerts integrated with existing ticketing tools and MS Teams
  • Develop Machine Learning jobs to dynamically monitor and provide alerts based on specific metrics and KPIs
  • Build Elastic and AWS observability AI solutions that enable infrastructure engineering and operations teams to address production issues efficiently
  • Adhere to lifecycle processes for transitioning solutions from Development to QA to Production
  • Actively participate in collaborative group sessions, attend agile sprint daily meetings, and share progress to ensure solution development aligns with organizational requirements

Pre-Requisite Knowledge, Skills and Experience

  • Technical Degree in Information Technology
  • Experience with Elastic Cloud and AWS Managed Prometheus
  • Knowledge of installation, system tasks, data collection, network troubleshooting, data pipelines, and cluster administration
  • Proficient in Python, Bash, PowerShell, Painless, and other scripting languages
  • Extensive ELK Stack expertise, including Elasticsearch, Logstash, Kibana, Beats, Machine Learning, APM, X-Pack, and REST API integration
  • Skilled in evaluating and tuning Elastic clusters, configurations, indexing, search performance, security, and administration
  • Proficient with Prometheus, Grafana, AWS observability tools, and their performance, security, and management
  • Experienced with security integrations (Windows SAML, LDAP, Kerberos) in Elasticsearch
  • Adept with AWS services: CloudWatch, CloudTrail, Kubernetes, Docker, Lambda
  • Integrated Elastic alerting with third-party ticketing tools
  • Experienced in implementing and integrating observability AI agents and frameworks for automated analysis, incident detection, and proactive resolution across complex systems