Upstart
Upstart

63 Upstart Operations Jobs Hiring Near You

Senior DevOps Engineer

OR · On-site +1

$129K - $166K/yr

Upstart's Cloud Platform team sits within the Reliability organization and is responsible for ... As a Senior DevOps Engineer at Upstart, you will help evolve this platform to support increasing ...

... Operations by automating high-volume workflows and enabling faster, more consistent decision-making. The team's mission is to democratize AI development across Upstart by creating a platform that ...

Staff Data Analyst, Servicing

$63K - $83K/yr

As the Staff Data Analyst on the Servicing Analytics team, you will play a pivotal role in influencing Upstart's success. Partnering closely with our Servicing product management, operations ...

Showing results 61-63

Senior DevOps Engineer

Senior DevOps Engineer

Upstart

OR • On-site, Remote

$129K - $166K/yr

Other

Posted 4 days ago


Job description

The Team: 

Upstart's Cloud Platform team sits within the Reliability organization and is responsible for building and operating the shared cloud infrastructure that powers all product and machine learning workloads. The team owns core platform components across Kubernetes (EKS), AWS infrastructure, service mesh, identity, and developer tooling, enabling reliability, scalability, and security across the business.

As a Senior DevOps Engineer at Upstart, you will help evolve this platform to support increasing scale and complexity. You'll partner closely with SRE, Delivery, InfoSec, and product/ML teams to improve reliability, developer experience, and cost efficiency across a platform used by nearly every engineering team.

How you'll make an impact

  • Design and operate a fleet of Kubernetes (EKS) clusters across production, staging, and ephemeral environments, ensuring reliability and high availability
  • Evolve AWS infrastructure and network architecture (VPCs, subnets, IAM, account structure) to support scalable, multi-team workloads
  • Build and maintain infrastructure-as-code and GitOps workflows using tools such as Terraform, CDK, and ArgoCD
  • Improve platform reliability and performance by defining and driving SLOs, analyzing incidents, and implementing systemic fixes
  • Participate in and help improve the on-call rotation, leading incident response and post-incident reviews to drive systemic platform improvements
  • Partner with SRE, Delivery, InfoSec, and product/ML teams to land high-impact infrastructure changes and platform standards
  • Drive improvements in developer experience by simplifying platform usage, reducing toil, and enabling faster product and ML development
  • Contribute to cost efficiency initiatives by optimizing resource utilization across Kubernetes and cloud infrastructure

Minimum Qualifications 

  • Bachelor's degree in Computer Science, Engineering, Mathematics, or a related field (or equivalent practical experience) plus 4+ years of experience
  • Experience operating Kubernetes in production environments, including cluster networking, storage, and RBAC
  • Proficiency with AWS infrastructure, including VPC design, networking, and IAM
  • Proven expertise in  implementing infrastructure-as-code using tools such as Terraform or AWS CDK
  • Experience implementing GitOps workflows using tools such as ArgoCD or similar
  • Ability to influence technical decisions across teams and drive adoption of platform standards

Preferred Qualifications

  • Knowledge of service mesh technologies such as Istio or Envoy
  • Experience designing or operating multi-cluster Kubernetes architectures
  • Experience with cloud networking at scale, including ingress/egress or edge platforms (e.g., Cloudflare)
  • Knowledge of cloud security, identity, and compliance frameworks (e.g., IAM, SOC 2, CIS benchmarks)

Position location This role is available in the following locations: Remote 

Time zone requirements The team operates on the East/West coast time zones.

Travel requirements As a digital first company, the majority of your work can be accomplished remotely. The majority of our employees can live and work anywhere in the U.S but are encouraged to to still spend high quality time in-person collaborating via regular onsites. The in-person sessions' cadence varies depending on the team and role; most teams meet once or twice per quarter for 2-4 consecutive days at a time.

#LI-REMOTE

#LI-MidSenior