About This Roleย We deliver mission-criticalย IT/OTย infrastructureโin cloud and on-premโfor industrial customers thatย can\'tย afford downtime.ย
Small team. Hard problems. Practical solutions. No bureaucracy. No blame. No egos.ย
We ship it, own it, and make itย betterโblamelessย but accountable, shoulder to shoulder. We work hard. We stay human. We trust each other. Weย figureย it out.ย
If you know what to do, delight in building it, and feel the ownership to support itโkeep reading.ย
What You\'ll Doย Customer Deliveryย
- Design complex IT/OT architecturesโin cloud and on-premโthat are secure, recoverable, and sized appropriatelyย
- Work directly with customers to understand their environment and estimate effortย
- Own customer solutions end-to-end: requirements design build supportย
- Build or use reusable modules when it makes senseโbuild bespoke when itย doesn\'tย
- Deploy and manage Kubernetes-based infrastructure and stateful applications across diverse customer environmentsย
Incident Response & Ownershipย
- Participate in on-call rotationย alongside the rest of the teamโeveryone here supports what we shipย
- Own incidents through resolution, then drive root cause analysis thatย eliminatesย the class of problemโnot just the symptomย
- Build the runbooks, alerts, and automation that make the next incident less likely or less painfulย
Infrastructure & Automationย
- Workย with Infrastructure-as-Code tools to provision and manage diverse customer environmentsย
- Implement and maintainย GitOpsย workflows for in-cluster deploymentsย
- Ensure all infrastructure and application changes are declarative and version-controlledย
- Automate self-healing and system updatesโreduce manual intervention and keep environments currentย
ย Observability & Reliabilityย
- Build andย maintainย monitoring, alerting, and dashboards using Prometheus, Loki, and Grafanaย
- Define SLIs and SLOs that reflect whatย actually mattersย to customersย
- Surface real problems, reduce noise, andย continually improve reliability andย teamย efficiencyย
Shape the Futureย
- Weย don\'tย have everything figured out.ย You\'llย help build, create, and shape how we operateย
- Contribute to standards, patterns, and processes that make us betterโnot bureaucracy for its own sakeย
- Bring the SRE mindset: automate toil, prefer boring/stable systems, and relentlessly improveย
What We\'re Looking Forย
- 5+ years in SRE, DevOps, or Infrastructure Engineeringย
- Strong Kubernetes skills in production environmentsโyou\'llย troubleshoot real clusters, not just tutorialsย
- Experience withย GitOpsย tooling (ArgoCD,ย Rancher Fleet,ย FluxCD,ย or similar)ย
- Solid understanding of Infrastructure-as-Code concepts (Terraform,ย Pulumi,ย Crossplane,ย or similar)ย
- Real incident response experienceโyou\'veย been on-call, stayed calm, and fixed things under pressureย
- Comfort with heterogeneous environmentsโevery customer site is a little different and you need to adaptย
- Clear communication skillsโyou can write a useful runbook, gather requirements on a customer call, and document what you learnedย
- Ability toย operateย in ambiguityโwe\'reย building clarity, not waiting for itย
Strong Plusย
- Azure experience (our primary cloud)ย
- Experience with SUSE ecosystem (SLE Micro, RKE2, Rancher, Longhorn)ย
- Industrial, manufacturing, or OT environment experienceย
- Familiarity withย Inductive Automation\'sย Ignition platform andย MQTTย
- Experience in a startup or small-team environment where you wore many hatsย
The SRE Mindsetย
This matters here. We need someone who:ย
- Sees repetitive manual work as a problem to automate, notย a fact of lifeย
- Prefers stable, predictable, "boring" production over clever and fragileย
- Supports what they createโno throwing things over the wallย
- Treats incidents as opportunities for systemic improvementย
- Works well on a small team where everyone carries weightย
- Stays current with SRE practices, emerging technologies, and cloud/edge trendsย
A Few Honest Wordsย
This is a startup. Hours can be demanding. Priorities shift. Youย won\'tย have a team ofย 30 backing you up.ย
What you will have: the autonomy to make real decisions, teammates who own their work, and customers who genuinely depend on what we build. We work hard because the work mattersโand we have fun doing it.ย
If you wantย a structured 9-5, predictability, and a clear ladderโthisย probably isn\'tย the right fit.ย
If you want to build, learn, and be part of something that\'s actually going somewhereโlet\'s talk.ย
What We Offerย
- Comprehensive benefits (Medical, Dental, Vision, 401K)ย
- Fullyย remoteโworkย from anywhere in the worldย
- A team whereย it\'sย safe to be honest, learn from mistakes, and get better togetherย
Additional Information
We are committed to the principle of equal employment opportunity for all employees and to providing a work environment free from discrimination and harassment.๏ปฟ