1

Elasticsearch Observability Engineer Jobs (NOW HIRING)

Site Reliability Engineer Tech Lead

Dallas, TX ยท On-site

$56.75 - $75.25/hr

... (SRE) who will blend software engineering with IT operations to ensure the reliability ... Extensive experience with observability and monitoring platforms (Elasticsearch Observability ...

Site Reliability Engineer Tech Lead

Dallas, TX ยท On-site

$56.50 - $75/hr

... (SRE) who will blend software engineering with IT operations to ensure the reliability ... Extensive experience with observability and monitoring platforms (Elasticsearch Observability ...

Design, deploy, and maintain Elastic Stack environments, including Elasticsearch, Kibana, Logstash ... Solid understanding of observability, logging, metrics, and distributed systems. * Experience ...

Design, deploy, and maintain Elastic Stack environments, including Elasticsearch, Kibana, Logstash ... Solid understanding of observability, logging, metrics, and distributed systems. * Experience ...

next page

Showing results 1-20

Elasticsearch Observability Engineer information

How does an Elasticsearch Observability Engineer typically collaborate with development and operations teams?

As an Elasticsearch Observability Engineer, you frequently partner with both development and operations teams to design and implement monitoring solutions using the Elastic Stack. You'll help developers instrument applications for better traceability and support operations in troubleshooting and optimizing system performance. Regular communication is key, as you'll often lead workshops, create dashboards, and respond to incidents collaboratively. This cross-functional teamwork ensures observability solutions align with organizational goals and provide actionable insights.

What does an Elasticsearch Observability Engineer do?

An Elasticsearch Observability Engineer is responsible for designing, implementing, and maintaining observability solutions using the Elasticsearch stack (Elasticsearch, Logstash, Kibana, and Beats). Their primary role is to ensure that systems are monitored effectively, logs and metrics are collected and analyzed, and issues are detected and diagnosed quickly. They collaborate with development and operations teams to build dashboards, set up alerts, and optimize performance for monitoring infrastructure and applications. These engineers play a key role in improving system reliability and supporting incident response.

What are the key skills and qualifications needed to thrive as an Elasticsearch Observability Engineer, and why are they important?

To thrive as an Elasticsearch Observability Engineer, you need expertise in Elasticsearch, log management, and data analysis, often supported by a degree in computer science or a related field. Familiarity with observability tools such as Kibana, Logstash, Beats, and experience with cloud platforms and scripting languages like Python or Bash are typically required. Strong problem-solving abilities, attention to detail, and effective communication skills help you stand out in this role. These competencies are vital for ensuring system reliability, quickly detecting issues, and delivering actionable insights for continuous improvement.

What is the difference between Elasticsearch Observability Engineer vs Elasticsearch Developer?

AspectElasticsearch Observability EngineerElasticsearch Developer
Primary FocusMonitoring, logging, and observability of Elasticsearch clusters and related systemsDeveloping, customizing, and optimizing Elasticsearch applications and integrations
Skills & CertificationsKnowledge of Elasticsearch, Prometheus, Grafana, scripting, and monitoring toolsProficiency in Elasticsearch APIs, Java, REST, and development frameworks
Work EnvironmentOperations teams, DevOps, SREs, cloud environmentsDevelopment teams, software engineers, backend developers

While both roles require expertise in Elasticsearch, the Elasticsearch Observability Engineer focuses on system monitoring and ensuring Elasticsearch health, whereas the Elasticsearch Developer concentrates on building and customizing Elasticsearch-based applications. Their skills and daily tasks differ, but both are essential in Elasticsearch-centric environments.

More about Elasticsearch Observability Engineer jobs
What cities are hiring for Elasticsearch Observability Engineer jobs? Cities with the most Elasticsearch Observability Engineer job openings:
What states have the most Elasticsearch Observability Engineer jobs? States with the most job openings for Elasticsearch Observability Engineer jobs include:
What job categories do people searching Elasticsearch Observability Engineer jobs look for? The top searched job categories for Elasticsearch Observability Engineer jobs are:
Infographic showing various Elasticsearch Observability Engineer job openings in the United States as of June 2026, with employment types broken down into 100% Full Time. Highlights an 84% Physical, 5% Hybrid, and 11% Remote job distribution.
Site Reliability Engineer Tech Lead

Site Reliability Engineer Tech Lead

Freddie Mac

Dallas, TX โ€ข On-site

$56.75 - $75.25/hr

Other

Posted 14 days ago


Key responsibilities

  • Design, implement, and maintain automated solutions to ensure high availability, resiliency, and scalability of applications and services.

  • Collaborate with stakeholders to respond to production incidents, develop protocols to minimize downtime, conduct postmortems, and implement preventive measures.

  • Set up monitoring systems to track performance metrics, address potential issues, and meet system health and performance targets.


Job description

At Freddie Mac, our mission of Making Home Possible is what motivates us, and it's at the core of everything we do. Since our charter in 1970, we have made home possible for more than 90 million families across the country. Join an organization where your work contributes to a greater purpose.
Position Overview:
At Freddie Mac, you will do important work to build a better housing finance system, and you'll be part of a team helping to make rental housing more accessible and affordable across the nation.
The Technology & Operational Risk department within the Multifamily (MF) division is seeking a Site Reliability Engineer (SRE) who will blend software engineering with IT operations to ensure the reliability, availability, scalability, in the performance of key systems, services, and environments.
Our Impact:
At Freddie Mac, our mission of Making Home Possible is what motivates us, and it's at the core of everything we do. Since our charter in 1970, we have made home possible for more than 90 million families across the country. Join an organization where your work contributes to a greater purpose.
Your Impact:
  • System Reliability: Design, implement, and maintain automated solutions to ensure high availability, resiliency, and scalability of applications and services.
  • Incident Management: Collaborate with stakeholders to respond to production incidents, develop protocols to minimize downtime, conduct postmortems, and implement preventive measures to avoid recurrence.
  • Monitoring & Observability: Set up monitoring systems to track performance metrics, meeting system health and performance targets and addressing potential issues before they impact users.
  • Performance Optimization: Analyze system performance, identify bottlenecks, and optimize for speed, scalability, and resource utilization.
  • Automation: Leverage automation tools to reduce manual interventions in application management tasks and ensure efficiency, repeatability, and minimal human error.
  • Collaboration: Work closely with stakeholders to support new features, deployments, and compliance initiatives.
  • Capacity Planning: Forecast resource needs and plan for future growth to ensure system stability and scalability.
  • Documentation: Create and maintain up-to-date documentation for systems, processes, and troubleshooting procedures.
  • Continuous Improvement: Exhibit the intellectual curiosity to continuously learn emerging technologies and practices to design and deliver best of breed solutions for MF Technology

Qualifications:
  • Proven expertise in designing, developing, and maintaining automation frameworks for application operations, including infrastructure provisioning, deployment pipelines, monitoring, and incident response, using tools such as Ansible, Terraform, Jenkins, and related technologies.
  • Extensive experience with observability and monitoring platforms (Elasticsearch Observability, Elasticsearch APM, OpenTelemetry), with a focus on automating system health checks, alerting, and root cause analysis.
  • Strong proficiency in programming and scripting languages (e.g., Python, Go, Bash, Java), with a track record of automating repetitive operational tasks and building self-healing solutions.
  • Hands-on experience with cloud infrastructure (AWS, Azure, Google Cloud Platform) and container orchestration (Docker, Kubernetes, EKS), including automated provisioning, scaling, and recovery of resources.
  • Demonstrated ability to lead and implement transformative initiatives that reduce manual toil, streamline operational workflows, and drive continuous improvement in reliability and efficiency.
  • Experience with CI/CD tools and configuration management for fully automated build, test, and deployment pipelines.
  • Deep understanding of SRE principles such as SLIs, SLOs, error budgets, and applying automation to enforce and improve these metrics.
  • Experience with data management platforms and automation of data workflows (e.g., MongoDB, Snowflake, SQL, Dremio, Qlik Replicate).
  • Familiarity with enterprise job schedulers (Autosys, Control-M) and automation of batch processes and job orchestration.
  • Solid foundation in networking, databases, and distributed systems, with experience automating troubleshooting and recovery procedures.
  • Experience with agile and DevOps cultures, driving adoption of automation best practices across teams.
  • Track record of championing automation-first initiatives that modernize legacy application operations and deliver measurable improvements in reliability, scalability, and team productivity.
  • Ability to mentor and guide teams in adopting automation tools and practices, fostering a culture of continuous improvement and operational excellence.
  • Relevant certifications in cloud, automation, or SRE/DevOps (e.g., AWS DevOps Engineer, Google SRE) are a plus.
  • Bachelor's degree in computer science, information technology, or related field (or equivalent experience).

Keys to Success in this Role:
  • Demonstrate a sense of accountability and ownership to identify and drive areas of improvement.
  • Focus on achieving results, influencing and collaborating with stakeholders to independently deliver desired outcomes.
  • Cultivate and maintain trusted relationships with Multifamily and Enterprise teams.
  • Ability to exhibit clear and persuasive communication skills, capable of conveying complex information and vision for excellence to stakeholders.
  • Ability to work independently, persistently, and collaboratively in a fast-paced environment.
  • Ability to work evenings and weekends as needed

Current Freddie Mac employees please apply through the internal career site.
We consider all applicants for all positions without regard to gender, race, color, religion, national origin, age, marital status, veteran status, sexual orientation, gender identity/expression, physical and mental disability, pregnancy, ethnicity, genetic information or any other protected categories under applicable federal, state or local laws. We will ensure that individuals are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.
A safe and secure environment is critical to Freddie Mac's business. This includes employee commitment to our acceptable use policy, applying a vigilance-first approach to work, supporting regulatory mandates, and using best practices to protect Freddie Mac from potential threats and risk. Employees exercise this responsibility by executing against policies and procedures and adhering to privacy & security obligations as required via training programs.
CA Applicants: Qualified applications with arrest or conviction records will be considered for employment in accordance with the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act.
Notice to External Search Firms: Freddie Mac partners with BountyJobs for contingency search business through outside firms. Resumes received outside the BountyJobs system will be considered unsolicited and Freddie Mac will not be obligated to pay a placement fee. If interested in learning more, please visit and register with our referral code: MAC.
Time-type:Full time
FLSA Status:Exempt
Freddie Mac offers a comprehensive total rewards package to include competitive compensation and market-leading benefit programs. Information on these benefit programs is available on our Careers site.
This position has an annualized market-based salary range of $145,000 - $217,000 and is eligible to participate in the annual incentive program. The final salary offered will generally fall within this range and is dependent on various factors including but not limited to the responsibilities of the position, experience, skill set, internal pay equity and other relevant qualifications of the applicant.

Freddie Mac logo

About Freddie Mac

Sourced by ZipRecruiter

Today, Freddie Mac makes home possible for one in four home borrowers and is one of the largest sources of financing for multifamily housing. Join our smart, creative and dedicated team and you'll do important work for the housing finance system and make a difference in the lives of others.

Industry

Finance and insurance

Company size

5,001 - 10,000 Employees

Headquarters location

McLean, VA, US

Year founded

1970