1

Linux Site Reliability Engineer Jobs in Wisconsin

Engineer Senior Lead, Cloud Infrastructure

Milwaukee, WI · On-site +1

$106.90K - $145.30K/yr

Knowledge of Site Reliability Engineering (SRE) and resilience engineering practices. * Experience with multicloud or hybrid cloud environments. * Familiarity with cloud security frameworks and ...

Background in developer tooling, platform engineering, or SRE/DevOps with an understanding of reliability principles applied to non-deterministic systems. Familiarity with multiple LLM providers and ...

AgenticOps Engineer

WI · On-site +1

... or SRE/DevOps with an understanding of reliability principles applied to non-deterministic systems. • Familiarity with multiple LLM providers and models; able to reason about trade-offs in ...

AI Platform Engineer

Madison, WI · Remote

$60 - $85/hr

Experience: 7+ years of experience in platform engineering, DevOps/SRE, cloud infrastructure, or software engineering, with at least 3 years supporting AI/ML platforms. Technical Skills: Hands-on ...

Firmware Embedded Linux Engineer

Waukesha, WI · On-site

$103.10K - $141.10K/yr

Firmware Embedded Linux Engineer Job Location: Waukesha, WI - Locals only Interview Mode- F2F Job ... Develop software which meets rigorous quality, reliability, performance, and testability ...

next page

Showing results 1-20

Linux Site Reliability Engineer information

What are the key skills and qualifications needed to thrive as a Linux Site Reliability Engineer, and why are they important?

To thrive as a Linux Site Reliability Engineer, you need deep expertise in Linux system administration, scripting (such as Bash or Python), and a solid understanding of networking concepts, usually backed by a computer science degree or equivalent experience. Familiarity with configuration management tools (like Ansible, Puppet, or Chef), containerization (Docker, Kubernetes), and cloud platforms (AWS, GCP, or Azure) is typically required, along with relevant certifications like RHCE or AWS Certified SysOps Administrator. Strong problem-solving skills, effective communication, and the ability to work under pressure are crucial soft skills for this role. These competencies ensure the reliability, scalability, and security of complex infrastructure, minimizing downtime and supporting seamless operations.

What are some common challenges faced by Linux Site Reliability Engineers when scaling infrastructure, and how can they be addressed?

Linux Site Reliability Engineers often encounter challenges related to maintaining system stability and performance as infrastructure scales. Issues such as configuration drift, automation bottlenecks, and monitoring gaps can arise when managing numerous servers or services. Addressing these challenges typically involves implementing robust configuration management tools, investing in automated deployment pipelines, and enhancing observability through comprehensive monitoring and alerting solutions. Collaboration with development and operations teams is essential to ensure that scalability solutions align with business needs and technical requirements.

What is a Linux Site Reliability Engineer?

A Linux Site Reliability Engineer (SRE) is an IT professional responsible for ensuring the reliability, scalability, and performance of systems running on the Linux operating system. They bridge the gap between software development and operations by automating processes, monitoring infrastructure, and managing incidents. Linux SREs focus on system availability, building tools for deployment and monitoring, and improving system robustness through best practices and automation. Their work helps organizations deliver reliable online services and quickly recover from outages or system failures.

What is the difference between Linux Site Reliability Engineer vs Linux DevOps Engineer?

AspectLinux Site Reliability EngineerLinux DevOps Engineer
CredentialsLinux certifications, SRE-specific trainingLinux certifications, DevOps tools certifications
Work EnvironmentFocus on system reliability, monitoring, incident responseFocus on automation, CI/CD pipelines, deployment
Employer & IndustryTech companies, cloud providers, large enterprisesStartups, tech firms, software development teams
Search & Comparison IntentUnderstanding reliability roles, incident managementAutomation, deployment, continuous integration

While both roles involve Linux expertise, a Linux Site Reliability Engineer primarily focuses on maintaining system reliability, monitoring, and incident response. In contrast, a Linux DevOps Engineer emphasizes automation, continuous integration, and deployment processes. Both roles require Linux skills and often overlap, but their core responsibilities differ based on organizational needs.

What are popular job titles related to Linux Site Reliability Engineer jobs in Wisconsin? For Linux Site Reliability Engineer jobs in Wisconsin, the most frequently searched job titles are:
What job categories do people searching Linux Site Reliability Engineer jobs in Wisconsin look for? The top searched job categories for Linux Site Reliability Engineer jobs in Wisconsin are:
What cities in Wisconsin are hiring for Linux Site Reliability Engineer jobs? Cities in Wisconsin with the most Linux Site Reliability Engineer job openings:
Elastic Observability Engineer - PLEX

Elastic Observability Engineer - PLEX

Rockwell Automation

Milwaukee, WI • Hybrid

Full-time

Medical, Dental, Vision, Retirement, PTO

Posted 2 days ago


Rockwell Automation rating

7.9

Company rating: 7.9 out of 10

Based on 32 frontline employees who took The Breakroom Quiz

154th of 415 rated machine equipment manufacturers


Job description

Rockwell Automation is a global technology leader focused on helping the world's manufacturers be more productive, sustainable, and agile. With more than 28,000 employees who make the world better every day, we know we have something special. Behind our customers - amazing companies that help feed the world, provide life-saving medicine on a global scale, and focus on clean water and green mobility -our people are energized problem solvers that take pride in how thework we do changes the world for the better.

We welcome all makers, forward thinkers, and problem solvers who are looking for a place to do their best work. And if that's you we would love to have you join us!

Job Description

Position Summary:

The Elastic Observability Engineer is an important member of our Cloud Operations team, building a world-class Application Performance Monitoring solution to support observability-driven development. You will handle alert response, triage, maintenance, expansion and development to ensure monitoring systems remain healthy and reliable.

Your Responsibilities:

Operational Support & Triage

  • Respond to alerts, ingestion issues, and userreported incidents.
  • Perform initial diagnosis, document findings, and resolve or escalate as needed.
  • Monitor and troubleshoot Elasticsearch clusters, pipelines, Elastic Agent, Logstash, and Fleet.
  • Participate in the oncall rotation.

Maintenance & Routine Operations

  • Perform service restarts, ingestion validation, and configuration updates.
  • Apply Linux/Unix patches and perform basic OS maintenance.
  • Conduct regular cluster health, capacity, and LVM/storage checks.
  • Support dashboard upkeep and proactive monitoring.

Data Quality & Pipeline Support

  • Verify log, metric, and trace data completeness and ECS alignment.
  • Perform periodic data validation.
  • Document known issues, parsing gaps, and operational patterns.

Dashboards, Visualization & Search

  • Use and maintain Kibana dashboards for operational visibility.
  • Create dashboards and visualizations to support platform monitoring.
  • Use KQL or Lucene queries to validate data and investigate issues.

Collaboration

  • Provide actionable escalation details to SRE, DevOps, and Security teams.
  • Maintain runbooks, SOPs, and troubleshooting guides.
  • Communicate effectively during incidents and followups.
The Essentials - You Will Have:
  • Bachelor's Degree or equivalent years of relevant work experience
  • Legal authorization to work in the US is required- we will not sponsor individuals for employment visas, not now or in the future, for this job opening
The Preferred - You Might Also Have:
  • Typically requires 2+ years of relevant experience in observability, monitoring, logging, or IT operations.
  • Linux/Unix: Handson administration, patching, networking basics, LVM, and troubleshooting.
  • Windows: Basic OS administration experience.
  • Elastic Stack: Working knowledge of Elasticsearch, Kibana, Fleet, Elastic Agent, APM, Logstash, architecture basics, and core troubleshooting.
  • Observability: Understanding of monitoring practices, observability concepts, dashboards, and visualization creation.
  • Scripting & Automation: Bash scripting, familiarity with Ansible, and experience with Python or PowerShell.
  • Soft Skills: problemsolving, ability to work in a fastpaced environment, and selfstarter mindset.
  • Linux/Unix engineering experience; Windows engineering experience.
  • OpenTelemetry (OTEL) or APM experience.
  • Knowledge of Ansible, Terraform, Rabbit MQ, Docker, Kubernetes, AZDO and CI/CD.
  • Guestlevel experience with VMware or Azure.
  • Project management exposure.
What We Offer:
  • Health Insurance including Medical, Dental and Vision
  • 401k
  • Paid Time off
  • Parental and Caregiver Leave
  • Flexible Work Schedule where you will work with your manager to enjoy a work schedule that can be flexible with your personal life.
  • To learn more about our benefits package, please visit at www.raquickfind.com.

This position is part of a job family. Experience will be the determining factor for position level and compensation.

At Rockwell Automation we are dedicated to building a diverse, inclusive and authentic workplace, so if you're excited about this role but your experience doesn't align perfectly with every qualification in the job description, we encourage you to apply anyway. You may be just the right person for this or other roles.

#LI-MG4

#LI-Hybrid

#LifeAtROK

We are an Equal Opportunity Employer including disability and veterans.

If you are an individual with a disability and you need assistance or a reasonable accommodation during the application process, please contact our services team at +1 (844) 404-7247.

Rockwell Automation's hybrid policy aligns that employees are expected to work at a Rockwell location at least Mondays, Tuesdays, and Thursdays unless they have a business obligation out of the office.


What Rockwell Automation employees say

Pay

Benefits

Hours and flexibility

Workplace

Get the full story on Breakroom


Rockwell Automation logo

About Rockwell Automation

Sourced by ZipRecruiter

Rockwell Automation is a global technology leader focused on helping the world's manufacturers be more productive, sustainable, and agile. With more than 25,000 employees who make the world better every day, we know we have something special. Behind our customers - amazing companies that help feed the world, provide life-saving medicine on a global scale, and focus on clean water and green mobility - our people are energized problem solvers that take pride in how the work we do changes the world for the better.

Industry

Industrial automation equipment manufacturing

Company size

10,000+ Employees

Headquarters location

Milwaukee, WI, US

Year founded

1903

Social media