Job Summary:
Bot Auto is revolutionizing the transportation of goods with cutting-edge autonomous trucks. They are seeking a highly skilled Software Engineer to design, develop, and scale machine learning infrastructure, focusing on model evaluation, training workflows, and data management.
Responsibilities:
โข Architect and own a scalable, end-to-end model evaluation platform for perception and prediction models central to autonomous driving. Define metrics, design for scale, and make results actionable for researchers.
โข Partner with research scientists to optimize and scale distributed training workflows. Integrate experiment tracking and reproducibility into the model lifecycle from day one.
โข Design and maintain a versioned, high-quality training data store that accelerates model development and supports rapid iteration.
โข Build automated pipelines spanning data preparation, model training, validation, and deployment โ enabling fast experimentation and reproducible outcomes.
โข Contribute to tooling and infrastructure that powers high-throughput, high-accuracy data annotation at scale.
โข Develop production ML services that treat models as products โ with reliability, observability, and continuous improvement built in.
โข Maintain and evolve a robust data storage and access layer (S3 data lake, Delta Lake) underpinning annotation, evaluation, and training workflows.
โข Build scalable, reliable data collection pipelines supporting diverse vehicle dispatch missions.
โข Develop foundational services and packages that provide clean, performant access to autonomous driving data across the stack.
Qualifications:
Required:
โข Educational Background: Bachelor's or Master's in Computer Science, or equivalent practical experience.
โข Strong Programming Skills: Strong proficiency in Python; working knowledge of C++
โข ML/DL Infrastructure Experience โ Demonstrated hands-on experience building or scaling at least one of the following in a production environment: Evaluation platforms โ automated model benchmarking, metric computation, and regression tracking across model versions. Training infrastructure โ distributed training pipelines, experiment tracking, and model lifecycle management (e.g. W&B, MLflow, ClearML). Dataset curation & feature stores โ versioned dataset management, data lineage, and tooling for high-quality training data at scale. Annotation platforms โ tooling or pipelines that support high-throughput, high-accuracy labeling workflows.
โข Distributed Systems โ Strong experience with distributed computing and container orchestration โ Kubernetes, Spark, or comparable frameworks.
โข Ability to operate independently: scope ambiguous problems, make sound architecture decisions, and drive them to completion.
Preferred:
โข C++ experience in performance-sensitive or safety-critical applications
โข Full-stack service development experience.
โข Prior work in autonomous driving or robotics.
Company:
Transforming American Transportation with Autonomous Trucks Founded in 2023, the company is headquartered in Houston, USA, with a team of 51-200 employees. The company is currently Growth Stage.