STACK Infrastructure
STACK Infrastructure

6 Stack Infrastructure Infrastructure Engineer Jobs Hiring Near You

STACK Infrastructure Jobs Information

What are the key skills and qualifications needed to thrive as an Infrastructure Engineer, and why are they important?

To thrive as an Infrastructure Engineer, you need a solid understanding of networking, operating systems, cloud platforms, and infrastructure architecture, often supported by a degree in computer science or related fields. Familiarity with tools like AWS, Azure, VMware, automation frameworks (e.g., Ansible, Terraform), and relevant certifications such as AWS Certified Solutions Architect or CompTIA Network+ is typical. Strong problem-solving, communication, and teamwork skills help you collaborate across IT and business units and respond effectively to incidents. These skills and qualities are crucial for ensuring system reliability, security, and scalability in complex technology environments.

What are some common challenges Infrastructure Engineers face when managing large-scale systems?

Infrastructure Engineers often encounter challenges such as ensuring system scalability, maintaining high availability, and minimizing downtime during updates or incidents. Managing complex environments requires balancing security, performance, and cost efficiency while supporting rapid growth or changes in business needs. Effective communication and collaboration with development, security, and operations teams are also crucial to address issues quickly and maintain seamless service delivery.

What are Infrastructure Engineers?

Infrastructure Engineers are IT professionals responsible for designing, building, managing, and maintaining the foundational technology systems that support organizations. This includes servers, networks, storage, virtualization, and cloud resources. Their main goal is to ensure a reliable, secure, and scalable infrastructure that enables business operations and supports application needs. They often collaborate with other IT teams to implement new technologies, troubleshoot issues, and optimize system performance.

What is the difference between Infrastructure Engineer vs Network Engineer?

AspectInfrastructure EngineerNetwork Engineer
CertificationsCompTIA Network+, Cisco CCNA, Cisco CCNPCompTIA Network+, Cisco CCNA, Cisco CCNP
Work EnvironmentData centers, cloud environments, enterprise IT infrastructureNetwork operations centers, enterprise networks, ISP environments
ResponsibilitiesDesigning, implementing, maintaining IT infrastructure, servers, cloud systemsDesigning, configuring, troubleshooting network hardware and connectivity
Industry UsageIT companies, cloud providers, large enterprisesTelecommunications, ISPs, large organizations with complex networks

While both roles require similar certifications and work in enterprise environments, Infrastructure Engineers focus on overall IT infrastructure including servers and cloud systems, whereas Network Engineers specialize in network hardware and connectivity. Understanding these differences helps in choosing the right career path or job search focus.

What are the most popular categories at Stack Infrastructure?
Infographic showing various Infrastructure Engineer job openings at Stack Infrastructure in the United States as of May 2026, with employment types broken down into 100% Full Time. Highlights an 100% Physical job distribution.
Cloud Infrastructure and Automation Engineer

Cloud Infrastructure and Automation Engineer

STACK Infrastructure

Plano, TX • On-site

$103.50K - $135.80K/yr

Full-time

Posted yesterday


Job description

Job Summary:
STACK Infrastructure is an award-winning industry leader providing digital infrastructure for innovative companies. The Cloud Infrastructure & Automation Engineer will own the cloud platform and operational infrastructure, ensuring the successful deployment and reliability of automation and data initiatives.
Responsibilities:
• Design, deploy, and manage Azure infrastructure across dual EA subscriptions (Dev/Non-Prod and Production) including Databricks workspaces, AI Search clusters, Cosmos DB instances, ADLS Gen2, Azure OpenAI Service endpoints, and Azure Functions.
• Implement Infrastructure-as-Code using Terraform, Bicep, or ARM templates with modular, version-controlled patterns enabling new workloads to deploy within hours.
• Configure Azure networking (VNets, Private Endpoints, NSGs, Private DNS) for secure, globally distributed platform environments across AMER, EMEA, and APAC.
• Build container-based deployment patterns (Azure Container Apps, AKS) for API serving, agent hosting, model inference, and automation execution.
• Provision and manage LLM/SLM serving infrastructure: Azure OpenAI deployments, model endpoints, token-based scaling, and multi-region failover.
• Design end-to-end CI/CD pipelines (Azure DevOps, GitHub Actions) for application deployment, model promotion, data pipeline orchestration, and automated testing with blue/green and canary patterns.
• Build MLOps pipelines for model registration, versioning, A/B testing, canary deployment, and automated rollback of LLM endpoints and RAG configurations.
• Deploy and manage automation runtime infrastructure: Azure Logic Apps, Power Automate, Azure Functions, Durable Functions, and event-driven triggers for intelligent workflows.
• Maintain agent hosting environments (Chainlit, FastAPI, Teams bots) for the HR PM Agent and future agentic solutions, with auto-scaling and health monitoring.
• Create reusable deployment accelerators (Terraform modules, Helm charts, pipeline templates) to reduce time-to-production for each successive initiative.
• Drive Azure cost optimization: commitment-tier analysis, right-sizing, automated shutdown policies, and token consumption tracking across LLM endpoints.
• Implement RBAC, managed identities, Key Vault integration, and least-privilege access across all platform components.
• Ensure SOX compliance, data residency, and governance using Microsoft Purview, Defender XDR, and Azure Policy.
• Manage secrets, certificates, API key rotation, and Entra ID integration for platform authentication across global regions.
• Produce monthly infrastructure cost and performance reports with spend trends, cost-per-query, and optimization metrics.
Qualifications:
Required:
• 7+ years of cloud infrastructure/DevOps experience with at least 2 years supporting AI/ML, automation, or data platform workloads at scale.
• Expert-level Azure skills: Databricks, Cosmos DB, Azure Functions, Logic Apps, ADLS Gen2, Azure AI Search, Azure OpenAI Service, Container Apps/AKS, and Azure Monitor.
• Strong IaC proficiency: Terraform (modules, state, workspaces), Bicep, or ARM templates with environment-templated patterns.
• Hands-on CI/CD engineering: Azure DevOps, GitHub Actions, container registries, Helm charts, and blue/green/canary deployment automation.
• Solid Python and Bash skills for infrastructure tooling, automation scripts, and deployment utilities.
• Deep understanding of Azure networking, security (RBAC, managed identities, Key Vault, Private Endpoints, Azure Policy), and cost management.
• Experience with containerization (Docker) and orchestration (AKS or Container Apps) for production workload and model serving.
• Familiarity with AI platform infrastructure: Databricks provisioning, Cosmos DB scaling, AI Search management, and LLM endpoint deployment.
Preferred:
• Experience deploying RAG platform infrastructure, vector search clusters, and LLM/SLM serving endpoints in production.
• Hands-on MLOps: model registries, experiment tracking, automated deployment pipelines, and A/B testing infrastructure.
• Background in enterprise IT environments with M365, Intune, and Entra ID.
• Azure certifications: AZ-104, AZ-400, AZ-305.
• FinOps certification or demonstrated cloud cost optimization experience delivering measurable savings.
• Experience supporting global operations across AMER, EMEA, and APAC with high-availability requirements.
Company:
STACK Infrastructure provides digital infrastructure solutions, focusing on data centers, colocation, and build-to-suit projects. Founded in 2019, the company is headquartered in Denver, USA, with a team of 1001-5000 employees. The company is currently Late Stage.