Senior DevOps Engineer
$134K - $172K/yr
... site reliability, system observability, and operational excellence across the platform. Primary ... JSON, Protobuf, Avro [Required] SQL and NoSQL databases, in-memory data stores [Required] Java ...
New
$134K - $172K/yr
... site reliability, system observability, and operational excellence across the platform. Primary ... JSON, Protobuf, Avro [Required] SQL and NoSQL databases, in-memory data stores [Required] Java ...
New
$134K - $172K/yr
... site reliability, system observability, and operational excellence across the platform. Primary ... JSON, Protobuf, Avro [Required] SQL and NoSQL databases, in-memory data stores [Required] Java ...
New
Riverwoods, IL · Hybrid
$54.75 - $75/hr
... lag Database Operations & Reliability • Manage and validate Automated backups and snapshots ... engineers • Review changes for platform standards and best practices • Collaborate with ...
Riverwoods, IL · Hybrid
$54.75 - $75/hr
... lag Database Operations & Reliability • Manage and validate Automated backups and snapshots ... engineers • Review changes for platform standards and best practices • Collaborate with ...
... queues, and database capacities for high-frequency workspaces. * Monitor critical custom ... Work closely with Engineering and SRE teams to drive rapid remediation of identified issues
... queues, and database capacities for high-frequency workspaces. * Monitor critical custom ... Work closely with Engineering and SRE teams to drive rapid remediation of identified issues
Chicago, IL · Hybrid
The Database Administrator works closely with software developers, cybersecurity personnel ... reliability, scalability, and data integrity. The System Database Administrator will assist in ...
Chicago, IL · Hybrid
The Database Administrator works closely with software developers, cybersecurity personnel ... reliability, scalability, and data integrity. The System Database Administrator will assist in ...
... new reliability engineering capabilities. The ideal candidate has broad and deep technical ... Leverage SQL and NoSQL databases to store, query, and analyze incident data at scale using Azure ...
... new reliability engineering capabilities. The ideal candidate has broad and deep technical ... Leverage SQL and NoSQL databases to store, query, and analyze incident data at scale using Azure ...
$126K - $166K/yr
... NoSQL databases, Linux/Unix environments, and tools like Terraform and Apache Kafka. * Experience with distributed computing frameworks, such as Apache Spark. * Experience in implementing SRE ...
$126K - $166K/yr
... NoSQL databases, Linux/Unix environments, and tools like Terraform and Apache Kafka. * Experience with distributed computing frameworks, such as Apache Spark. * Experience in implementing SRE ...
Chicago, IL · On-site
$118K - $185K/yr
... NoSQL databases, Linux/Unix environments, and tools like Terraform and Apache Kafka. * Experience with distributed computing frameworks, such as Apache Spark. * Experience in implementing SRE ...
Chicago, IL · On-site
$118K - $185K/yr
... NoSQL databases, Linux/Unix environments, and tools like Terraform and Apache Kafka. * Experience with distributed computing frameworks, such as Apache Spark. * Experience in implementing SRE ...
Chicago, IL · On-site
$126K - $166K/yr
... NoSQL databases, Linux/Unix environments, and tools like Terraform and Apache Kafka. * Experience with distributed computing frameworks, such as Apache Spark. * Experience in implementing SRE ...
Chicago, IL · On-site
$126K - $166K/yr
... NoSQL databases, Linux/Unix environments, and tools like Terraform and Apache Kafka. * Experience with distributed computing frameworks, such as Apache Spark. * Experience in implementing SRE ...
$126K - $166K/yr
... NoSQL databases, Linux/Unix environments, and tools like Terraform and Apache Kafka. * Experience with distributed computing frameworks, such as Apache Spark. * Experience in implementing SRE ...
$126K - $166K/yr
... NoSQL databases, Linux/Unix environments, and tools like Terraform and Apache Kafka. * Experience with distributed computing frameworks, such as Apache Spark. * Experience in implementing SRE ...
$54.50 - $74.50/hr
... with SRE principles as defined by Google SRE practices (error budgets, toil elimination ... JSON, Protobuf, Avro * [Required] SQL and NoSQL databases, in-memory data stores * [Required] Java ...
$54.50 - $74.50/hr
... with SRE principles as defined by Google SRE practices (error budgets, toil elimination ... JSON, Protobuf, Avro * [Required] SQL and NoSQL databases, in-memory data stores * [Required] Java ...
$54.50 - $74.50/hr
... with SRE principles as defined by Google SRE practices (error budgets, toil elimination ... JSON, Protobuf, Avro * [Required] SQL and NoSQL databases, in-memory data stores * [Required] Java ...
$54.50 - $74.50/hr
... with SRE principles as defined by Google SRE practices (error budgets, toil elimination ... JSON, Protobuf, Avro * [Required] SQL and NoSQL databases, in-memory data stores * [Required] Java ...
Chicago, IL · On-site
$54.50 - $74.50/hr
... with SRE principles as defined by Google SRE practices (error budgets, toil elimination ... JSON, Protobuf, Avro * [Required] SQL and NoSQL databases, in-memory data stores * [Required] Java ...
Chicago, IL · On-site
$54.50 - $74.50/hr
... with SRE principles as defined by Google SRE practices (error budgets, toil elimination ... JSON, Protobuf, Avro * [Required] SQL and NoSQL databases, in-memory data stores * [Required] Java ...
$54.25 - $74.50/hr
... with SRE principles as defined by Google SRE practices (error budgets, toil elimination ... JSON, Protobuf, Avro * [Required] SQL and NoSQL databases, in-memory data stores * [Required] Java ...
$54.25 - $74.50/hr
... with SRE principles as defined by Google SRE practices (error budgets, toil elimination ... JSON, Protobuf, Avro * [Required] SQL and NoSQL databases, in-memory data stores * [Required] Java ...
Chicago, IL · Hybrid
$121K - $147K/yr
This position is vital for ensuring the security, reliability, and high availability of our ... Write maintainable, testable, efficient code that is clear to engineers unfamiliar with the system ...
Chicago, IL · Hybrid
$121K - $147K/yr
This position is vital for ensuring the security, reliability, and high availability of our ... Write maintainable, testable, efficient code that is clear to engineers unfamiliar with the system ...
Evanston, IL · On-site
$115K - $132K/yr
ITS/82 This will be an SRE role with a focus on maintaining and improving operations of the edge ... Familiarity with basic cloud infrastructure concepts such as time series databases (ex. InfluxDB ...
Evanston, IL · On-site
$115K - $132K/yr
ITS/82 This will be an SRE role with a focus on maintaining and improving operations of the edge ... Familiarity with basic cloud infrastructure concepts such as time series databases (ex. InfluxDB ...
Identify system deficiencies and implement solutions that improve reliability and efficiency ... Experience with Azure Data Factory, Azure Data Lakes, and Azure DevOps preferred * Familiarity with ...
New
Identify system deficiencies and implement solutions that improve reliability and efficiency ... Experience with Azure Data Factory, Azure Data Lakes, and Azure DevOps preferred * Familiarity with ...
New
Chicago, IL · On-site
$132K - $177K/yr
What You'll Need: * 2+ years of full-time, professional experience in a DevOps, SRE, or similar ... Experience with containerization technologies, database administration, and observability best ...
Chicago, IL · On-site
$132K - $177K/yr
What You'll Need: * 2+ years of full-time, professional experience in a DevOps, SRE, or similar ... Experience with containerization technologies, database administration, and observability best ...
$132K - $177K/yr
What You'll Need: * 2+ years of full-time, professional experience in a DevOps, SRE, or similar ... Experience with containerization technologies, database administration, and observability best ...
$132K - $177K/yr
What You'll Need: * 2+ years of full-time, professional experience in a DevOps, SRE, or similar ... Experience with containerization technologies, database administration, and observability best ...
Chicago, IL · On-site
$110K - $145K/yr
... reliability efforts. * Familiarity with supporting Postgres and MySQL servers, database management, and query syntax. * Experience with Linux system administration and troubleshooting. Technical ...
Chicago, IL · On-site
$110K - $145K/yr
... reliability efforts. * Familiarity with supporting Postgres and MySQL servers, database management, and query syntax. * Experience with Linux system administration and troubleshooting. Technical ...
... and reliability across channels • Mentor engineers through pairing, design reviews, and ... databases, ensuring data integrity and performance. • Implement secure authentication and ...
... and reliability across channels • Mentor engineers through pairing, design reviews, and ... databases, ensuring data integrity and performance. • Implement secure authentication and ...
$59.1K - $66.2K
0% of jobs
$66.2K - $73.2K
2% of jobs
$73.2K - $80.3K
3% of jobs
$80.3K - $87.3K
8% of jobs
$87.3K - $94.3K
7% of jobs
$101.4K is the 25th percentile. Wages below this are outliers.
$94.3K - $101.4K
5% of jobs
$101.4K - $108.4K
4% of jobs
$108.4K - $115.5K
3% of jobs
$115.5K - $122.5K
2% of jobs
The median wage is $124.2K / yr.
$122.5K - $129.6K
63% of jobs
$129.6K - $136.6K
2% of jobs
$59.1K
$114.3K
$136.6K
A Database Reliability Engineer (DBRE) is responsible for ensuring the reliability, scalability, and performance of database systems. They apply software engineering and SRE principles to database management, focusing on automation, monitoring, and incident response. DBREs work closely with development and operations teams to optimize queries, design resilient architectures, and prevent downtime. Their goal is to create efficient database systems that support business-critical applications with minimal disruptions.
To thrive as a Database Reliability Engineer, you need expertise in database architectures, performance tuning, and troubleshooting, often supported by a degree in computer science or related fields. Familiarity with database systems like MySQL, PostgreSQL, MongoDB, as well as automation tools and cloud platforms, is highly valued, and certifications such as AWS Certified Database – Specialty can be advantageous. Attention to detail, proactive problem-solving, and effective communication skills are crucial for success in this role. These abilities ensure reliable data infrastructure, minimize downtime, and facilitate smooth collaboration across technical teams.
As a Database Reliability Engineer, your daily tasks will often include monitoring database performance, proactively identifying and resolving issues, automating routine maintenance, and ensuring system security and backups. You'll collaborate closely with software developers, DevOps engineers, and IT teams to troubleshoot incidents and implement improvements that enhance database reliability and scalability. Your day may also involve creating documentation, planning for disaster recovery, and participating in on-call rotations. This dynamic role requires both hands-on technical work and teamwork to maintain seamless database operations in fast-paced environments.
Job Title: Lead Associate Principal, Software Engineering: DevOps
Position Type: Fulltime
Location: Chicago, IL (Onsite from Day 1, Hybrid Model 3 Days Onsite)
What You'll Do
Successful candidate will collaborate with various product, infrastructure, operations, security, and production control teams to elicit and fulfill technical requirements, while driving site reliability, system observability, and operational excellence across the platform.
Primary Duties and Responsibilities:
Guides the implementation using CI/CD pipelines in Kubernetes environment
Directs review, configuration, and execution of Terraform and Ansible automation pipelines delivered by product teams
Guides the setup of common infrastructure platforms like multi-region Kubernetes and Kafka clusters
Elicits requirements for application deployment and sizing to manage expected workloads
Defines and enforces Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Error Budgets in collaboration with product teams
Leads blameless post-mortems and drives resolution of action items to reduce repeat incidents
Designs and implements observability frameworks covering metrics, logs, and distributed tracing across all platform services
Drives toil reduction initiatives by identifying and automating repetitive operational work
Partners with product teams to embed reliability requirements and non-functional requirements (NFRs) early in the software development lifecycle
Monitors application performance and tunes systems working with product teams
Confers with product team leads and practitioners to create deployment and reliability plans
Confers with Enterprise Architecture and Renaissance architecture teams to devise implementation architecture
Promotes standards across application configuration towards the highest security posture
Collaborates with access management and security teams on setting up roles and permissions using least privilege strategies
Collaborates with integration/performance testing teams to leverage integrated release testing in the Release Acceptance environment
Collaborates with production controls teams on monitoring, failover, logging, and alerting strategies
Owns and continuously improves incident response runbooks, on-call rotations, and escalation procedures
Conducts capacity planning and load forecasting to proactively address scalability needs
Implements and validates infrastructure failover scenarios
Confers with Network team on all connectivity plans and issue resolution (including between on-premises and AWS)
Follows and enables program-level agile practices for efficient collaboration and delivery
Develops documentation for ORT technical infrastructure, architecture, and reliability support
Qualifications:
[Required] Understanding of Kanban and/or Agile methodologies
[Required] Familiarity with SRE principles as defined by Google SRE practices (error budgets, toil elimination, reliability hierarchy)
[Required] Able to succeed in a fast-paced environment with frequent changes
[Required] Comfortable communicating with both technical and non-technical audiences
[Required] Self-starter takes initiative to research, learn, and deliver; anticipates the play
[Required] Team player humble, collaborative, and focused on making the entire team succeed
Technical Skills & Background
[Required] AWS EC2, Kubernetes, Kafka, Jenkins, Terraform, Ansible, Hashicorp Vault
[Required] Observability tooling such as Prometheus, Grafana, OpenTelemetry, Datadog, or equivalent
[Required] Incident management platforms and on-call tooling (e.g., PagerDuty, OpsGenie)
[Required] Microservices and streaming data-intensive application architecture
[Required] Application architecture, networking, and security in the cloud
[Required] Setting up platforms in AWS for high-performance requirements
[Required] Broad experience in API-based development
[Required] Git and Artifactory for sourcing artifacts
[Required] Multi-AZ, multi-region failover architecture
[Required] Chaos engineering principles and tooling (e.g., Chaos Monkey, Gremlin, LitmusChaos)
[Required] Fluent with different data formats and structures: JSON, Protobuf, Avro
[Required] SQL and NoSQL databases, in-memory data stores
[Required] Java/Python/Scala/Golang software development
[Required] Two or more of the following: web/mobile application development, Unix/Linux environments, event-driven systems, transaction processing systems, distributed and parallel systems, large software system development, security software development, public-cloud platforms
[Required] Fluent in industry best practices, software patterns, and architecture principles
[Required] Enterprise architecture frameworks such as TOGAF
[Required] Ability to define and document architecture strategies, designs, and requirements across all enterprise architecture domains
[Required] Ability to define service-based, component architectures and demonstrate visualization of enterprise architecture concepts
Certifications
[Preferred] AWS Certified Solutions Architect / DevOps Engineer
[Preferred] Kubernetes, Kafka certification
[Preferred] Google Cloud Professional Site Reliability Engineer or equivalent SRE-focused certification
[Preferred] Project/program management certifications
Education & Training
[Required] BS degree in Computer Science, similar technical field, or equivalent experience
[Required] 7+ years of experience building large-scale, data-centric solutions
[Required] 7+ years of recent experience participating on a DevOps or SRE team, or as product owner for such a team