1

Etcd Jobs (NOW HIRING)

Senior XMPP DevOps Engineer

Tempe, AZ · On-site

$126K - $162K/yr

Experience with Nginx, ETCD in production deployment and troubleshooting * Familiar with AWS technology including Elastic Search, Elastic cache, DynamoDB, SQS and S3 * Understand gitops and familiar ...

Senior XMPP DevOps Engineer

Tempe, AZ · On-site

$126K - $162K/yr

Experience with Nginx, ETCD in production deployment and troubleshooting * Familiar with AWS technology including Elastic Search, Elastic cache, DynamoDB, SQS and S3 * Understand gitops and familiar ...

Deep understanding of Kubernetes architecture, control plane, etcd, networking, and storage. * Experience designing and managing high availability Kubernetes environments. * Strong Linux ...

Senior Web DevOps Engineer

Tempe, AZ · On-site

$126K - $162K/yr

Experience with Nginx, ETCD in production deployment and troubleshooting * Experience working with AWS services, such as Dynamodb, RDS, S3, Route53, etc. * Experience with Source Code Management ...

Senior Web DevOps Engineer

Tempe, AZ · On-site

$126K - $162K/yr

Experience with Nginx, ETCD in production deployment and troubleshooting * Experience working with AWS services, such as Dynamodb, RDS, S3, Route53, etc. * Experience with Source Code Management ...

Java Architect

New York, NY

$69 - $93/hr

... Etcd, Consul, Zookeeper, Curator, Eureka etc preferred. => Experience in working with Docker container, Kubernetes preferred. => Experience utilizing IaaS and PaaS from Amazon AWS or Google Cloud ...

next page

Showing results 1-20

Etcd information

See salary details

$18

$60

$104

How much do etcd jobs pay per hour?

As of Jun 5, 2026, the average hourly pay for etcd in the United States is $60.15, according to ZipRecruiter salary data. Most workers in this role earn between $48.08 and $72.84 per hour, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive as an Etcd Administrator, and why are they important?

To thrive as an Etcd Administrator, you need a solid understanding of distributed systems, networking, and proficiency in managing Linux environments, often backed by a degree in computer science or relevant certifications. Familiarity with container orchestration platforms like Kubernetes, command-line tools, and monitoring systems such as Prometheus is typically required. Strong problem-solving skills, attention to detail, and effective communication are critical soft skills for this role. These abilities are essential to ensure high availability, consistency, and reliability of the distributed key-value store that underpins many critical infrastructure components.

What are some common challenges faced by professionals working with etcd in a production environment?

Professionals managing etcd in production often face challenges related to maintaining cluster health and ensuring data consistency, especially as the system scales. Network partitions, slow disk I/O, and improper configuration can lead to issues such as split-brain scenarios or degraded performance. Regular monitoring, understanding best practices for cluster sizing, and being prepared for disaster recovery are crucial for smooth operations. Collaboration with DevOps, infrastructure, and application teams is also essential to align deployment strategies and incident response plans.

What is an Etcd administrator and what do they do?

An Etcd administrator is responsible for managing and maintaining Etcd clusters, which are distributed key-value stores commonly used for configuration management and service discovery in cloud-native environments like Kubernetes. Their duties include setting up Etcd clusters, monitoring performance, performing backups and restores, ensuring high availability, and securing access to the data. They may also troubleshoot issues, upgrade cluster versions, and optimize the store for scalability and reliability. Etcd administrators play a crucial role in ensuring the stability and consistency of systems that depend on reliable data storage and retrieval.

What is the difference between Etcd vs Kubernetes Administrator?

AspectEtcdKubernetes Administrator
Primary RoleDistributed key-value store for configuration data and service discoveryManaging, deploying, and maintaining Kubernetes clusters
Required SkillsKnowledge of distributed systems, etcd architecture, security, and troubleshootingKubernetes architecture, cluster management, networking, and security
Work EnvironmentDevOps, cloud infrastructure, containerized environmentsDevOps, cloud platforms, container orchestration
CertificationsNone specific, but related to cloud and DevOps certificationsKubernetes certifications (CKA, CKAD)

While both roles are essential in cloud-native environments, Etcd focuses on maintaining a reliable distributed key-value store, whereas a Kubernetes Administrator manages entire Kubernetes clusters. Understanding Etcd is crucial for Kubernetes Administrators, but their responsibilities extend beyond Etcd to include cluster deployment, scaling, and security.

Infographic showing various Etcd job openings in the United States as of May 2026, with employment types broken down into 88% Full Time, 5% Part Time, and 7% Contract. Highlights an 76% Physical, and 24% Remote job distribution, with an average salary of $125,110 per year, or $60.1 per hour.
Senior Staff+ Software Engineer, Kubernetes Platform

Senior Staff+ Software Engineer, Kubernetes Platform

Anthropic

San Francisco, CA

$144K - $190K/yr

Other

Posted 28 days ago


Job description

About the role

Anthropic runs some of the largest Kubernetes clusters in the industry. We have fleets of hundreds of thousands of nodes across multiple cloud providers and datacenters to train, research, and serve frontier AI models. The Kubernetes Platform team owns the Kubernetes control plane that makes those clusters work.

We are operating at a scale where the defaults stop working. We own the scheduler and extend it to place topology-sensitive ML workloads across thousands of accelerators at once. We scale the control plane itself - apiserver, etcd, controllers - so it stays responsive as object counts and node counts grow by orders of magnitude. And we build the core cluster services every workload depends on, like service discovery, so they hold up under the same pressure.

We make sure the control plane is fast, correct, and always available. Your work will directly determine whether Anthropic can keep reliably and safely training frontier models as our compute footprint continues to grow.

Key responsibilities
  • Own, operate, and extend the Kubernetes scheduler for Anthropic's accelerator fleets, including custom scheduling plugins and policies for gang scheduling, topology awareness, and preemption
  • Scale the Kubernetes control plane (apiserver, etcd, controller-manager) to support clusters far beyond typical limits, and find the next bottleneck before it finds us
  • Design, build, and operate core cluster services such as service discovery that every workload in the fleet depends on
  • Build and maintain custom controllers, operators, and CRDs
  • Partner with research, training, and inference to understand workload shapes and turn their requirements into platform capabilities
  • Collaborate with cloud providers on required features and escalations
  • Participate in on-call, lead incident response, and design processes (postmortems, runbooks, SLOs) that help the team avoid repeating failures
Minimum qualifications
  • Significant software engineering experience building and operating production distributed systems
  • Proficiency in at least one systems-appropriate language (e.g., Go, Python, Rust, or C++)
  • Deep, hands-on Kubernetes experience (well beyond "user of") into scheduler, controllers, apiserver, or operating large multi-tenant clusters
  • Demonstrated ability to debug complex issues across the stack, from API behavior down to node and network-level root causes
  • A track record of designing for reliability, correctness, and clear failure semantics in systems other engineers depend on
  • Strong written and verbal communication; comfort building consensus with internal stakeholders
Preferred qualifications
  • Experience with Kubernetes internals or contributions: kube-scheduler / scheduling framework, apiserver, etcd, client-go, controller-runtime, or similar
  • Experience building or operating cluster schedulers or batch systems (e.g., Kueue, Volcano, Slurm, or in-house equivalents)
  • Background scaling control planes or coordination systems (etcd, ZooKeeper, Consul, or large DNS/service-mesh deployments)
  • Familiarity with ML infrastructure: GPUs, TPUs, or Trainium; gang scheduling; topology-aware placement; collective networking such as NCCL
  • Experience with GCP and/or AWS, including GKE/EKS internals and Infrastructure as Code
  • Low-level systems experience such as Linux kernel tuning, cgroups, or eBPF
  • 12+ years of relevant industry experience, including time leading large, ambiguous infrastructure projects