Compensation: Competitive base salary and meaningful equity
Benefits: Health & dental insurance, gym reimbursement, daily team meals, commuter benefits
We're an applied AI lab building coding agents. Julius executes
~1M lines of code every 36 hours for
1M+ users and has generated
3M+ visualizations. All code runs in code sandboxes (
isolated remote containers) that we manage. We're revenueโgenerating and backed by AI Grant, YCombinator, Bessemer Venture Partners and the founders from Vercel, Notion, Perplexity, Palantir, Replit, Zapier, Intercom, and Dropbox.
The RoleBuild and scale the
codeโexecution sandboxes that power Julius across cloud environments (AWS and GCP). We orchestrate
500k+ containers/month and growing. You'll own reliability, performance, and security for multiโtenant compute.
What You'll Do- Design and operate secure, multiโtenant container infrastructure with fast startup and smart autoscaling.
- Ship cloud deployments (Helm/Terraform) with SSO, network controls, and audit logging.
- Drive observability (metrics, traces, logs) with clear SLOs; lead incident response.
- Optimize images, scheduling, networking, and cost ; build fairโuse and rateโlimiting controls.
What You Bring- Production Kubernetes and container internals (Docker/containerd); strong networking fundamentals.
- Cloud (AWS/GCP/Azure) and IaC (Terraform/Helm).
- Monitoring/Logging (Prometheus, Grafana, OpenTelemetry, ELK/Vector).
- Security best practices for containerized, multiโtenant systems.
Nice to Have- gVisor/Kata/Firecracker; Cilium/eBPF; GPU scheduling; serverless autoscaling (KEDA/Knative/Karpenter).
- You've built an AI side project and enjoy tinkering with LLMs.
Why JuliusSmall, senior team; massive impact surface; hard infra problems at meaningful scale.