1

Ml Infrastructure Jobs in Spring, TX (NOW HIRING)

Sr AI Computer Vision Engineer

Houston, TX · On-site

$116.90K - $154.20K/yr

... ML infrastructure, including Amazon SageMaker when appropriate • Collaborate closely with cross‑functional teams (AI platform, software engineering, domain experts) and mentor junior engineers ...

AI ML Operations Engineer

Houston, TX · On-site

$66.40K - $89.80K/yr

Designs production-grade ML systems end-to-end (data → training → deployment → monitoring ... Knowledge of Terraform and Ansible for automating infrastructure management * Knowledge of YAML for ...

AI ML Operations Engineer

Houston, TX

$66.40K - $89.80K/yr

Designs production-grade ML systems end-to-end (data training deployment monitoring) with ... Knowledge of Terraform and Ansible for automating infrastructure management * Knowledge of YAML for ...

Senior AI - Computer Vision Engineer

Houston, TX

$117K - $154.20K/yr

Work with GPUaccelerated environments (CUDAenabled frameworks) and AWSbased ML infrastructure, including Amazon SageMaker when appropriate * Collaborate closely with crossfunctional teams (AI ...

Senior AI - Computer Vision Engineer

Houston, TX · On-site

$99.80K - $137K/yr

Work with GPU-accelerated environments (CUDA-enabled frameworks) and AWS-based ML infrastructure, including Amazon SageMaker when appropriate * Collaborate closely with cross-functional teams (AI ...

... ML workloads, partnering with Infrastructure Engineering for cluster provisioning and governance • Extend existing CI/CD pipelines to support automated infrastructure changes and ML workflows • ...

AI Architect

Spring, TX · On-site

$137.25K - $202.45K/yr

Define scalable, secure, and cost-efficient AI frameworks for infrastructure automation and optimization. * Select appropriate AI/ML technologies, tools, and platforms (e.g., Azure ML, AWS SageMaker ...

next page

Showing results 1-20

Ml Infrastructure information

See Spring, TX salary details

$41.4K

$113.1K

$162K

How much do ml infrastructure jobs pay per year?

As of May 29, 2026, the average yearly pay for ml infrastructure in Spring, TX is $113,075.00, according to ZipRecruiter salary data. Most workers in this role earn between $95,700.00 and $125,500.00 per year, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive as an ML Infrastructure Engineer, and why are they important?

To thrive as an ML Infrastructure Engineer, you need a strong background in software engineering, cloud computing, and machine learning concepts, often supported by a degree in computer science or a related field. Proficiency with containerization tools (like Docker and Kubernetes), cloud platforms (such as AWS, GCP, or Azure), and CI/CD systems is critical. Excellent problem-solving, collaboration, and communication skills help you efficiently work with data scientists and DevOps teams. These skills and qualities are vital for building scalable, reliable ML systems that support rapid experimentation and deployment in production environments.

What are some common challenges faced by professionals working in ML Infrastructure roles?

Professionals in ML Infrastructure often encounter challenges related to scaling systems to handle large volumes of data, ensuring reliable deployment pipelines, and maintaining reproducibility across different environments. They must also collaborate closely with data scientists and engineers to streamline workflows and address issues like version control and model monitoring. Staying updated with rapidly evolving tools and best practices is essential, and balancing stability with innovation is a frequent aspect of the role.

What is ML Infrastructure?

ML Infrastructure refers to the underlying systems, tools, and processes that enable the development, deployment, and scaling of machine learning models. This includes data storage and management, computing resources, model training and serving environments, monitoring, and automation tools. ML Infrastructure ensures that data scientists and engineers can efficiently build, test, and maintain machine learning applications in a reliable and reproducible manner. It is a crucial foundation for organizations looking to operationalize AI and machine learning solutions at scale.

What is the difference between Ml Infrastructure vs Data Engineer?

AspectML InfrastructureData Engineer
Required CredentialsBachelor's in CS, Data Science, or related; knowledge of cloud platformsBachelor's in CS, Software Engineering, or related; experience with databases and ETL tools
Work EnvironmentFocus on deploying and maintaining ML systems, cloud environments, and infrastructure toolsDesigning, building, and managing data pipelines and storage solutions
Industry UsageUsed in AI/ML teams to support model deployment and scalabilityUsed across data-driven organizations for data management and analytics

ML Infrastructure specialists focus on deploying, scaling, and maintaining machine learning systems and infrastructure, while Data Engineers primarily build and manage data pipelines and storage solutions. Both roles require technical skills and often collaborate, but their core responsibilities differ in focus and tools used.

What job categories do people searching Ml Infrastructure jobs in Spring, TX look for? The top searched job categories for Ml Infrastructure jobs in Spring, TX are:
What cities near Spring, TX are hiring for Ml Infrastructure jobs? Cities near Spring, TX with the most Ml Infrastructure job openings:

Software Engineer, Machine Learning Infrastructure

Bot Auto

Houston, TX • On-site

$165.20K - $195.80K/yr

Other

Posted 19 days ago


Job description

Company Introduction

At Bot Auto, we are revolutionizing the transportation of goods with our cutting-edge autonomous trucks, enhancing the quality of life for communities around the globe. With the agility of a start-up and the wisdom of seasoned experts, Bot Auto boasts a team that has achieved numerous world-firsts and unparalleled innovations. United by a shared vision, we create miracles and propel the future of transportation. Join us and transform your dreams into reality.

We are seeking a highly skilled and motivated Software Engineer to design, develop, and scale our machine learning annotation, evaluation, and training infrastructure. This role is central to the quality and velocity of our perception and ML models - from curating and managing high-quality annotated datasets, to building robust evaluation pipelines that drive continuous model improvement. The ideal candidate combines strong systems engineering skills with a deep understanding of ML Workflows/Ops and large-scale data infrastructure.

Key Responsibilities

Machine Learning & Deep Learning Infrastructure

  • Evaluation Platform - Architect and own a scalable, end-to-end model evaluation platform for perception and prediction models central to autonomous driving. Define metrics, design for scale, and make results actionable for researchers.
  • Training Infrastructure - Partner with research scientists to optimize and scale distributed training workflows. Integrate experiment tracking and reproducibility into the model lifecycle from day one.
  • Dataset & Feature Store - Design and maintain a versioned, high-quality training data store that accelerates model development and supports rapid iteration.
  • ML Pipelines - Build automated pipelines spanning data preparation, model training, validation, and deployment - enabling fast experimentation and reproducible outcomes.
  • Annotation Platform - Contribute to tooling and infrastructure that powers high-throughput, high-accuracy data annotation at scale.
  • MLOps - Develop production ML services that treat models as products - with reliability, observability, and continuous improvement built in.

Data Infrastructure

  • Maintain and evolve a robust data storage and access layer (S3 data lake, Delta Lake) underpinning annotation, evaluation, and training workflows.
  • Build scalable, reliable data collection pipelines supporting diverse vehicle dispatch missions.
  • Develop foundational services and packages that provide clean, performant access to autonomous driving data across the stack.
Qualifications

Required:

  • Educational Background: Bachelor's or Master's in Computer Science, or equivalent practical experience.
  • Strong Programming Skills: Strong proficiency in Python; working knowledge of C++
  • ML/DL Infrastructure Experience - Demonstrated hands-on experience building or scaling at least one of the following in a production environment:
    • Evaluation platforms - automated model benchmarking, metric computation, and regression tracking across model versions.
    • Training infrastructure - distributed training pipelines, experiment tracking, and model lifecycle management (e.g. W&B, MLflow, ClearML).
    • Dataset curation & feature stores - versioned dataset management, data lineage, and tooling for high-quality training data at scale.
    • Annotation platforms - tooling or pipelines that support high-throughput, high-accuracy labeling workflows.
  • Distributed Systems - Strong experience with distributed computing and container orchestration - Kubernetes, Spark, or comparable frameworks.
  • Ability to operate independently: scope ambiguous problems, make sound architecture decisions, and drive them to completion.

Preferred:

  • C++ experience in performance-sensitive or safety-critical applications
  • Full-stack service development experience.
  • Prior work in autonomous driving or robotics.