Job Summary:
Base Power Company is Americaโs next-generation power company focused on transforming the centralized electricity grid into a resilient system through distributed batteries. They are seeking an Infrastructure Engineer to design and operate the infrastructure that coordinates a network of computing nodes, enabling device communication and task management at scale.
Responsibilities:
โข Design and build the core orchestration and coordination layer that manages device fleet operations โ task distribution, state synchronization, health monitoring โ with >99.9% availability.
โข Build backend systems that reliably handle device-to-cloud communication at scale, including message routing, acknowledgment, retry logic, and conflict resolution for concurrent updates.
โข Develop APIs and services that allow product teams to query device state, push updates, and execute commands on thousands of devices simultaneously without bottlenecks or data consistency issues.
โข Design architectures that scale horizontally from hundreds to millions of devices without re-architecture, while optimizing compute, storage, and network costs.
โข Implement monitoring, alerting, and operational runbooks that allow the team to understand and troubleshoot distributed system behavior in production.
โข Build reliable async communication patterns using message queues and event streaming, handling ordering guarantees, deduplication, and exactly-once semantics.
โข Own the database and storage layer decisions that support both operational and analytical workloads โ knowing when to use relational databases, NoSQL stores, or specialized systems.
โข Partner with hardware and device teams to understand their needs and translate them into scalable, reliable backend services.
โข Write infrastructure-as-code that is maintainable, tested, and reproducible, enabling safe and rapid iteration.
Qualifications:
Required:
โข 5+ years building backend infrastructure or distributed systems, preferably at scale
โข Strong experience in Go, Python, Java, or equivalent backend languages
โข Deep understanding of distributed systems concepts: eventual consistency, state synchronization, failure handling
โข Experience building APIs and services that handle high scale and high concurrency
โข Familiarity with message queues or event streaming (Kafka, RabbitMQ, SQS, or similar)
โข Solid understanding of databases and data modeling โ knowing when to use relational vs. NoSQL vs. specialized stores
โข Comfort with infrastructure-as-code and cloud platforms (AWS or GCP)
โข Proven ability to own complex systems end-to-end: design, implementation, deployment, and operational support
Preferred:
โข Experience building device management or IoT backend systems
โข Familiarity with Kubernetes and container orchestration
โข Background in energy, utilities, or other operational technology (OT) domains
โข Experience with distributed tracing and observability at scale (Datadog, Honeycomb, etc.)
โข Knowledge of fleet management, device provisioning, or OTA update systems
โข Exposure to consensus algorithms (Raft, Paxos) or distributed coordination (etcd, Zookeeper)
โข Experience with stream processing frameworks (Kafka Streams, Flink, etc.)
โข Experience operating systems in production with clear operational runbooks and runbook discipline
โข Experience with data center orchestration systems and baseboard management controllers
Company:
Base is building the future of American power. The grid is the largest, most complex machine in the world. Founded in 2023, the company is headquartered in Austin, USA, with a team of 201-500 employees. The company is currently Growth Stage.