Data Pipeline Technician - Pennsauken, NJ
Salary Range $28.00 - $30.00 Hourly
Description
Summary
Data Pipeline Technicians execute and maintain the data pipeline processes designed by engineers, ensuring timely and accurate data delivery for AI development.
Responsibilities
- Plan and execute data collection activities, including log road trips to gather real-world driving data across varied environments.
- Operate and maintain specialized data collection vehicles and equipment.
- Ingest, clean, and prepare data for annotation and AI training.
- Perform maintenance of the AI fleet and verify quality of annotated data.
- Ensure timely and accurate delivery of data to engineering teams.
- Support operational needs of Engineering and other departments by providing reliable data and technical assistance.
Impact
Their reliability and attention to detail ensure the smooth operation of OEMs data-driven AI development, while also supporting data needs across the organization.
Required Skills
- Data Collection and Management – ability to plan and execute large-scale data collection activities, including real-world driving scenarios.
- Vehicle and Equipment Operation – skilled in operating and maintaining specialized data collection vehicles and hardware.
- Data Processing – Experience in ingesting, cleaning, and preparing data for annotation and AI training.
- Quality Assurance – ability to verify accuracy and completeness of annotated data.
- Technical Reliability – ensure timely and accurate delivery of data to engineering teams.
- Attention to Detail – maintain precision in data handling and equipment maintenance.
- Collaboration and Support – Provide technical assistance to engineering and other departments.
Preferred Skills
- Familiarity with AI development workflows and data annotation processes.
- Basic knowledge of data pipeline tools and scripting (e.g. Python, Bash).
- Understanding of vehicle systems and sensor technologies.
- Experience with Fleet management and preventative maintenance practices.
- Ability to troubleshoot hardware/software issues in data collection environments.