2

Remote Ai Infrastructure Engineer Jobs (NOW HIRING)

AI Infrastructure Engineer

Ann Arbor, MI · On-site +1

$170K - $210K/yr

The AI Infrastructure Engineer is responsible for designing, building, and owning the end-to-end ... fully remote candidates, with periodic travel expected for company retreats and key on-site ...

AI Infrastructure Engineer

Ann Arbor, MI · Remote

$170K - $210K/yr

The AI Infrastructure Engineer is responsible for designing, building, and owning the end-to-end ... fully remote candidates, with periodic travel expected for company retreats and key on-site ...

AI Infrastructure Engineer

Ann Arbor, MI · On-site +1

$170K - $210K/yr

The AI Infrastructure Engineer is responsible for designing, building, and owning the end-to-end ... fully remote candidates, with periodic travel expected for company retreats and key on-site ...

AI Infrastructure Engineer

New York, NY · Remote

$150K - $200K/yr

As an AI Infrastructure Engineer, your role will include: * Lead Technical Deployments: Drive end ... and we have a remote-first work culture. We are the leading platform for operating GPU ...

AI Infrastructure Engineer

$110K - $144K/yr

They are seeking an AI Infrastructure Engineer to own the infrastructure and operational reliability that powers their AI systems, focusing on defining infrastructure patterns and building ...

AI Lab Infrastructure Engineer

$110K - $144K/yr

About the Role As an AI Infrastructure Engineer, you will architect and build the virtual access ... Build scalable remote processing capabilities supporting 100,000+ documents per day * Create ...

Staff AI Infrastructure Engineer

Redwood City, CA · On-site +1

$131K - $172K/yr

Achieving that requires training frontier-scale AI biology models, and that demands reliable, high-performance compute infrastructure. This is production engineering work at a frontier AI lab, with ...

Infrastructure Engineer (Storage)

New York, NY · On-site +1

$117K - $154K/yr

Who We Are Lightning AI is the company behind PyTorch Lightning. Founded in 2019, we build an end ... This role can work hybrid out of one of our US-based hubs (Seattle, NYC, or SF) or fully remote ...

next page

Showing results 1-20

Remote Ai Infrastructure Engineer information

See salary details

$46.5K

$127.1K

$182K

How much do remote ai infrastructure engineer jobs pay per year?

As of Jun 8, 2026, the average yearly pay for remote ai infrastructure engineer in the United States is $127,066.00, according to ZipRecruiter salary data. Most workers in this role earn between $107,500.00 and $141,000.00 per year, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive as a Remote AI Infrastructure Engineer, and why are they important?

To thrive as a Remote AI Infrastructure Engineer, you need expertise in cloud computing, distributed systems, and software engineering, often supported by a degree in computer science or a related field. Familiarity with tools like Kubernetes, Docker, Terraform, and cloud platforms such as AWS, Azure, or GCP is typically required, along with knowledge of CI/CD pipelines and AI/ML frameworks. Strong problem-solving skills, self-motivation, and effective remote communication are essential soft skills for success in this role. These skills ensure robust, scalable AI infrastructure that supports rapid innovation and seamless collaboration across distributed teams.

What is a Remote AI Infrastructure Engineer?

A Remote AI Infrastructure Engineer is a professional who designs, builds, and maintains the systems and tools necessary to support artificial intelligence (AI) projects, all while working remotely. Their responsibilities often include developing and optimizing cloud or on-premise infrastructure, ensuring scalability, managing data pipelines, and supporting machine learning workflows. They work closely with data scientists and software engineers to ensure AI models can be efficiently trained, deployed, and monitored in production environments. The remote aspect allows them to perform these tasks from anywhere, using collaboration tools and cloud platforms.

What are some common challenges faced by Remote AI Infrastructure Engineers, and how can they be addressed?

Remote AI Infrastructure Engineers often encounter challenges such as managing distributed systems, ensuring robust data pipelines, and maintaining high system reliability across different time zones. Collaboration with cross-functional teams can require clear communication and effective use of remote tools. To address these challenges, it's important to establish strong documentation practices, schedule regular check-ins, and utilize automated monitoring and deployment solutions. Staying proactive and adaptable helps ensure seamless infrastructure performance and team alignment.
More about Remote Ai Infrastructure Engineer jobs
What cities are hiring for Remote Ai Infrastructure Engineer jobs? Cities with the most Remote Ai Infrastructure Engineer job openings:
What are the most commonly searched types of Ai Infrastructure Engineer jobs? The most popular types of Ai Infrastructure Engineer jobs are:
What states have the most Remote Ai Infrastructure Engineer jobs? States with the most job openings for Remote Ai Infrastructure Engineer jobs include:
Infographic showing various Remote Ai Infrastructure Engineer job openings in the United States as of May 2026, with employment types broken down into 86% Full Time, 11% Part Time, and 3% Contract. Highlights an 85% Physical, 5% Hybrid, and 10% Remote job distribution, with an average salary of $127,066 per year, or $61.1 per hour.
AI Infrastructure Engineer

AI Infrastructure Engineer

STN Incorporated

Pleasanton, CA • Remote

$145K - $195K/yr

Full-time

Medical, Dental, Vision, Life, Retirement, PTO

Posted 2 days ago


Job description

Location: Remote or Onsite – Pleasanton, California

At STN, we don't just adapt to the digital future, we engineer it. Our mission is to help organizations thrive in a rapidly evolving technology landscape through strategic insight, cutting-edge solutions, and a security-first mindset. We provide end-to-end services spanning cloud consulting, AI infrastructure, and enterprise security, enabling secure, scalable, and future-ready transformation.

As trusted advisors, we align IT investments with business outcomes that drive performance and growth, starting with deep strategic engagement and delivering tailored solutions built for long-term impact.

Our approach is innovation-led and rooted in cybersecurity, with a focus on leveraging the right technologies to solve real-world challenges. We invest in our people and foster a culture of growth, inclusion, and purpose because we believe empowered teams build transformative technology.

Overview
The AI Infrastructure Engineer will be responsible for designing, deploying, and maintaining robust infrastructure systems tailored for AI and machine learning operations. This role focuses on ensuring seamless performance, scalability, and reliability in distributed computing environments. You'll collaborate with data scientists, ML engineers, and DevOps teams to support large-scale AI training and inference pipelines.

Key Responsibilities

  • Design and implement AI infrastructure solutions, including cluster management, resource allocation, and workload orchestration for high-performance computing (HPC) environments.
  • Deploy, configure, and troubleshoot containerized applications using Kubernetes across various flavors (e.g., vanilla Kubernetes, Amazon EKS, Google GKE, Azure AKS, and on-premises setups).
  • Manage job scheduling and resource management using Slurm for efficient utilization of GPU clusters in AI training workflows.
  • Optimize Ubuntu-based systems for AI workloads, including kernel tuning, security hardening, and performance monitoring.
  • Integrate and maintain NVIDIA GPU technologies, ensuring compatibility with AI frameworks like TensorFlow, PyTorch, and CUDA.
  • Monitor system performance, identify bottlenecks, and implement automation scripts for infrastructure provisioning and scaling.
  • Collaborate on disaster recovery planning, security compliance, and cost optimization for cloud and on-premises AI infrastructure.
  • Stay updated on emerging technologies in AI infrastructure and contribute to best practices documentation.

Required Qualifications

  • Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience).
  • Proven expertise as an Ubuntu specialist, with hands-on experience in system administration, networking, and scripting (e.g., Bash, Python) on Ubuntu servers.
  • Extensive experience with Kubernetes in all major flavors, including cluster setup, scaling, networking (e.g., CNI plugins), and security (e.g., RBAC, Pod Security Policies).
  • Strong proficiency in Slurm for managing HPC clusters, including job submission, queue configuration, and integration with GPU resources.
  • 3+ years of experience in infrastructure engineering, preferably in AI/ML or HPC environments.
  • Familiarity with cloud platforms (AWS, GCP, Azure) and container orchestration tools.
  • Excellent problem-solving skills and ability to work in a fast-paced, collaborative environment.

Preferred Qualifications

  • NVIDIA certifications (e.g., NVIDIA Certified Professional in Data Center GPU Management or CUDA Programming) are a strong plus.
  • Experience with other HPC schedulers (e.g., PBS, LSF) or AI-specific tools like Kubeflow.
  • Knowledge of infrastructure-as-code tools (e.g., Terraform, Ansible) and CI/CD pipelines.
  • Background in AI model deployment, monitoring tools (e.g., Prometheus, Grafana), or edge computing.

Compensation

  • Full-Time, Exempt
  • Salary: $145K-195K, DOE

Benefits

  • Health Coverage – Medical, Dental & Vision
  • FSA Health and Dependent Care available
  • 401(k) Plan
  • Unlimited Paid Time Off (PTO)
  • Observed Holidays Paid
  • Cell Phone Allowance
  • Collaborative, growth-driven culture

Applicants must be authorized to work in the U.S.  We are unable to provide sponsorship at this time.