1

Cloud Ops Jobs (NOW HIRING)

Systems Engineer - Cloud Ops

Memphis, TN · On-site

$54.25 - $72.25/hr

As a Systems Engineer on the Cloud Operations team, you will be responsible for deploying, managing, and optimizing our cloud-based infrastructure on Google Cloud Platform (GCP). You will work with ...

Cloud Practice Leader

Piscataway, NJ · On-site

$66.25 - $84.25/hr

... NOC, Cloud Ops, SecOps, L1/L2/L3) Collaborate with Sales and Presales in discovery workshops and customer presentations Develop high-level architecture diagrams, scope definitions, SLAs, and ...

Senior Financial Analyst - SaaS

Lehi, UT

$80K - $100K/yr

This individual will own the financial relationship with Cloud Ops and R&D leadership, lead AWS budgeting and forecasting in partnership with the FinOps team, and deliver reporting and dashboards ...

Cloud Ops Senior Engineer

Cambridge, MA · On-site

$61 - $81.50/hr

We are looking for a Senior Engineer to be a key member of the Cloud Operations team. You will be working with engineering to design and implement new automation and tools that accelerate the ...

Cloud Architect

Dallas, TX · On-site

$64.25 - $82/hr

This role ensures secure, scalable, and compliant cloud operations, emphasizing IaaS and PaaS models, automation, and collaboration with Cloud-Ops and enterprise data teams. Key Responsibilities: · ...

Cloud Practice Leader

Piscataway, NJ

$66.25 - $84.25/hr

... NOC, Cloud Ops, SecOps, L1/L2/L3) Collaborate with Sales and Presales in discovery workshops and customer presentations Develop high-level architecture diagrams, scope definitions, SLAs, and ...

Configuration DevOps Engineer

Palo Alto, CA · On-site

$62.25 - $85.25/hr

Senior Dev Ops Engineer (Cloud Ops) Organization: Operations Reports To: Sr. Manager, Cloud Operations Location: Palo Alto AAriba, Inc. is the leading provider of collaborative business commerce ...

Sr. DevOps Engineer

Cupertino, CA · On-site

$60 - $65/hr

Minimum Qualifications * 5+ years in Cloud Ops / SRE / DevOps roles. * Experience with GitOps + CI/CD (Jenkins, ArgoCD, etc.). * Strong in observability: Prometheus, Grafana, Splunk, etc.

Cloud Engineer

Morristown, NJ · On-site

$57.25 - $76.75/hr

Work closely with 3rd party support providers on Cloud OPS details * Responsible for Backup and Recovery operations * Perform Tier 1 through Tier III support * Azure Cloud Services Experience a Plus ...

Mphasis - Java junior architect

West Pittsburg, PA · On-site

$56.50 - $76.25/hr

An excellent techie with strong experience in Core Java, Spring, Spring Boot, Splunk, Appd, XML, Cloud Ops, etc. * Meeting with technology managers and the design team to discuss the goals and needs ...

Cloud Engineer

Morristown, NJ

$57.25 - $76.75/hr

Work closely with 3rd party support providers on Cloud OPS details * Responsible for Backup and Recovery operations * Perform Tier 1 through Tier III support * Azure Cloud Services Experience a Plus ...

Cloud Architect

Salt Lake City, UT · On-site

$62.75 - $80/hr

Work closely with our Cloud Ops team to guide & assist the successful implementation of cloud architect initiatives * Participate as a decision-maker in the Architecture Coalition process, assessing ...

next page

Showing results 1-20

Cloud Ops information

See salary details

$23

$62

$87

How much do cloud ops jobs pay per hour?

As of Jun 25, 2026, the average hourly pay for cloud ops in the United States is $62.89, according to ZipRecruiter salary data. Most workers in this role earn between $53.61 and $71.63 per hour, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive in the Cloud Ops position, and why are they important?

To thrive as a Cloud Ops professional, you need expertise in cloud infrastructure management, networking, security, and scripting languages, often supported by a degree in computer science or a related field. Familiarity with platforms like AWS, Azure, or Google Cloud, experience with automation tools such as Terraform or Ansible, and certifications like AWS Certified SysOps Administrator are highly valued. Strong problem-solving skills, effective communication, and the ability to work under pressure are essential soft skills for success. These competencies ensure reliable cloud operations, minimize downtime, and enable seamless collaboration across cross-functional teams.

What does a cloud ops team do?

A cloud operations (cloud ops) team manages and maintains cloud infrastructure and services, ensuring their availability, security, and performance. They handle tasks such as deploying applications, monitoring system health, automating processes with tools like scripts and cloud platforms, and troubleshooting issues to support reliable cloud-based environments.

What is a Cloud Ops job?

A Cloud Ops (Cloud Operations) job involves managing, monitoring, and optimizing cloud infrastructure to ensure reliability, security, and performance. Cloud Ops professionals handle tasks like incident response, automation, system updates, and cost management. They work with cloud platforms such as AWS, Azure, or Google Cloud to maintain seamless operations. Their goal is to ensure high availability, scalability, and security of cloud-based applications and services.

What jobs in the US pay 300,000 a year?

In the field of Cloud Operations, senior roles such as Cloud Architects, Cloud Engineering Managers, and Solutions Architects can reach or exceed $300,000 annually, especially with extensive experience, certifications like AWS or Azure, and leadership responsibilities. High-level positions in cloud infrastructure and strategic planning often command this level of compensation.

Is cloud operations a good career?

Cloud operations is a growing field that involves managing and maintaining cloud infrastructure using tools like AWS, Azure, or Google Cloud. It offers strong job demand, competitive salaries, and opportunities for specialization in areas such as security, automation, and DevOps. Success typically requires technical skills, certifications, and continuous learning to keep up with evolving technologies.

What does a typical day look like for a Cloud Ops professional?

A typical day for a Cloud Ops professional involves monitoring cloud systems and infrastructure, responding to alerts or incidents, performing scheduled maintenance, and deploying updates or security patches. Much of the day is spent collaborating with development, security, and IT teams to ensure systems are running optimally and to troubleshoot any issues that arise. You may also work on automating repetitive tasks using configuration management or scripting tools and participate in planning for scaling or new project rollouts. This dynamic environment requires adaptability and ongoing learning to keep pace with evolving technologies and best practices.

What engineers make $500,000?

Senior Cloud Operations engineers with extensive experience, specialized skills in cloud infrastructure, automation, and security, and often certifications like AWS Certified Solutions Architect or Google Cloud Professional Cloud Architect can reach or exceed $500,000 in total compensation, especially in high-cost-of-living areas or large organizations. These roles typically involve leadership responsibilities, complex system management, and strategic planning.
More about Cloud Ops jobs
What cities are hiring for Cloud Ops jobs? Cities with the most Cloud Ops job openings:
What states have the most Cloud Ops jobs? States with the most job openings for Cloud Ops jobs include:
Infographic showing various Cloud Ops job openings in the United States as of June 2026, with employment types broken down into 100% Full Time. Highlights an 100% In-person job distribution, with an average salary of $130,802 per year, or $62.9 per hour.
Systems Engineer - Cloud Ops

Systems Engineer - Cloud Ops

AutoZone

Memphis, TN • On-site

$54.25 - $72.25/hr

Full-time

Medical, Dental, Vision, Life, Retirement, PTO

Posted 24 days ago


AutoZone rating

5.3

Company rating: 5.3 out of 10

Based on 1,868 frontline employees who took The Breakroom Quiz

35th of 39 rated national retailers


Job description

As a Systems Engineer on the Cloud Operations team, you will be responsible for deploying, managing, and optimizing our cloud-based infrastructure on Google Cloud Platform (GCP). You will work with technologies such as Terraform, Kubernetes (GKE), GitOps/ArgoCD, CI/CD pipelines, and observability tools to ensure reliable, secure, and scalable platform operations.

You will also contribute to our AI/ML platform initiatives, supporting infrastructure for LLM-based applications and AI-powered automation tools that enhance developer productivity and operational efficiency.

You will collaborate with development teams, SREs, and platform architects to ensure seamless deployment and delivery of applications while maintaining the highest standards of reliability, security, and performance.
 

Since opening our first store in 1979, AutoZone has grown into a leading retailer and distributor of automotive parts and accessories across the Americas. Our customer-first mindset and commitment to Going the Extra Mile define who we are, for both our customers and AutoZoners. Working at AutoZone means being part of a team that values dedication, teamwork, and growth. Whether you're helping customers or building your career, we provide tools and support to help you succeed and drive your future.

Benefits at AutoZone
AutoZone offers thoughtful benefits programs with one-on-one benefits guidance designed to improve AutoZoners' physical, mental and financial well-being.

All AutoZoners (Full-Time and Part-Time):
  • Competitive pay
  • Unrivaled company culture
  • Medical, dental and vision plans
  • Exclusive discounts and perks, including an AutoZone in-store discount
  • 401(k) with company match and Stock Purchase Plan
  • AutoZoners Living Well Program for free mental health support
  • Opportunities for career growth
Additional Benefits for Full-Time AutoZoners:
  • Paid time off
  • Life, and short- and long-term disability insurance options
  • Health Savings and Flexible Spending Accounts with wellness rewards
  • Tuition reimbursement
Minimum age requirements may apply. Eligibility and waiting period requirements may apply; benefits for AutoZoners in Puerto Rico, Hawaii, or the U.S. Virgin Islands may differ. Learn more about all that AutoZone has to offer at Careers.AutoZone.com.

We proudly support Veterans, Active-duty Service Members, Reservists, National Guard and Military Families. Your experience is highly valued, and we encourage you to apply to join our team.

Online Application:
An online application is required. Click the Apply button to complete your application. For step-by-step instructions on how to apply visit careers.autozone.com/candidateresources.

AutoZone, and its subsidiary, ALLDATA are equal opportunity employers. All applicants will be considered for employment without attention to age, race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status, or any other legally protected categories.

Kubernetes Expertise (Essential):

  • 3+ years hands-on experience with Kubernetes in production environments
  • Deep understanding of Kubernetes architecture: API server, etcd, scheduler, controller manager, kubelet
  • Experience with GKE (Standard and Autopilot modes), including cluster creation, upgrades, and maintenance
  • Proficiency in troubleshooting workloads: analyzing pod logs, events, describe outputs, and container states
  • Strong understanding of resource management: requests, limits, QoS classes, and resource quotas
  • Experience with Kubernetes networking: Services (ClusterIP, NodePort, LoadBalancer), Ingress, Network Policies
  • Knowledge of Kubernetes storage: PersistentVolumes, PersistentVolumeClaims, StorageClasses, dynamic provisioning
  • Experience with Helm charts for application packaging and deployment
  • Familiarity with Kubernetes security: RBAC, Pod Security Standards, Secrets management, Workload Identity
  • Understanding of Kubernetes observability: metrics-server, kubectl top, container resource monitoring
  • Experience debugging common issues: ImagePullBackOff, CrashLoopBackOff, OOMKilled, Evicted pods, pending pods

Cloud & Infrastructure:

  • 3+ years of experience with Google Cloud Platform (GCP) services including GKE, Cloud Run, Cloud SQL, Memorystore, Pub/Sub, and Cloud Logging
  • Strong experience with Terraform for infrastructure as code (IaC)
  • Understanding of cloud networking: VPCs, subnets, firewall rules, Cloud NAT, Private Service Connect

CI/CD & GitOps:

  • Proficiency with GitLab CI/CD pipelines
  • Experience with ArgoCD or similar GitOps tools
  • Understanding of Helm charts and Kustomize for Kubernetes manifest management

Observability & Troubleshooting:

  • Experience with monitoring and APM tools (Dynatrace, Datadog, Prometheus, Grafana)
  • Ability to analyze logs, metrics, and traces to diagnose production issues
  • Familiarity with JVM troubleshooting (heap dumps, thread analysis, GC tuning, connection pool issues)

AI/ML Knowledge:

  • Basic understanding of LLM concepts, prompt engineering, and AI model deployment
  • Familiarity with AI coding assistants and their integration into development workflows
  • Interest in agentic AI systems and autonomous automation tools
  • Exposure to vector databases (Pinecone, Weaviate, pgvector) and RAG architectures is a plus

Systems & Networking:

  • Strong Linux administration skills
  • Understanding of networking concepts (DNS, load balancing, firewalls, TCP/IP)
  • Experience with service mesh (Istio) is a plus

General:

  • Excellent problem-solving and analytical skills
  • Strong written and verbal communication
  • Ability to work effectively in a collaborative, cross-functional environment
  • Experience working in an Agile/DevOps culture
  • Bachelor's degree in Computer Science, Information Technology, or related field (or equivalent experience)

Cloud Infrastructure, Automation & Operations:

  • Design, build, and maintain cloud infrastructure using Terraform to automate provisioning, scaling, and lifecycle management of resources on GCP
  • Develop and maintain CI/CD pipelines using GitLab CI to automate build, test, and deployment workflows. Implement and maintain GitOps practices using ArgoCD for declarative, version-controlled application deployment
  • Monitor system performance using observability tools (Dynatrace, Cloud Monitoring, Prometheus/Grafana) and troubleshoot production issues
  • Participate in on-call rotation to provide 24/7 support for critical infrastructure incidents
  • Perform root cause analysis on incidents and implement preventive measures. Document runbooks, architecture decisions, and operational procedures

Kubernetes Platform Management:

  • Deploy, configure, and manage containerized applications on Google Kubernetes Engine (GKE), including GKE Autopilot and Standard clusters
    Manage cluster lifecycle including upgrades, node pool configurations, and capacity planning
  • Troubleshoot pod failures, CrashLoopBackOff, OOMKilled events, and container resource issues
  • Configure and optimize resource requests/limits, Horizontal Pod Autoscaler (HPA), and Vertical Pod Autoscaler (VPA)
  • Manage Kubernetes networking including Services, Ingress controllers, Network Policies, and DNS configurations. Implement and manage service mesh (Istio) for traffic management, observability, and security
  • Manage secrets and configurations using Kubernetes Secrets, ConfigMaps, and external secret management tools. Implement pod security standards, RBAC policies, and workload identity configurations

 AI/ML Platform & Automation:

  • Support infrastructure for AI/ML workloads including LLM-based applications and model serving platforms
  • Deploy and manage AI-powered developer tools such as coding assistants (Claude Code, GitHub Copilot) and agentic AI systems. Explore and implement AI-assisted incident response and automated remediation workflows
  • Build and maintain infrastructure for Retrieval-Augmented Generation (RAG) pipelines and vector databases
  • Configure GPU-enabled node pools and optimize resource allocation for AI/ML workloads
  • Implement MCP (Model Context Protocol) servers and AI agent integrations for operational automation
  • Stay current with emerging AI technologies and evaluate their applicability for infrastructure automation
     

What AutoZone employees say

Pay

Benefits

Hours and flexibility

Workplace

Get the full story on Breakroom


AutoZone logo

About AutoZone

Sourced by ZipRecruiter

AutoZone Inc (AutoZone) is a retailer and distributor of automotive replacement parts and accessories. The company provides new and remanufactured automotive hard parts, maintenance items, accessories, and non-automotive products. AutoZone sells automotive diagnostic and repair software through its subsidiary ALLDATA.

Industry

Motor vehicle and motor vehicle parts wholesalers

Company size

10,000+ Employees

Headquarters location

Memphis, TN, US

Year founded

1979