For more details contact at Ella.s@iserveworld.com
Job Title: AI Data Engineer
Location: Moline,IL
Pay Rate:USD$60.00 $65.00 /hr.with benefits
Key Responsibilities
Data Engineering: Design, develop, and manage data pipelines to handle and process large datasets with a focus on scalability and efficiency.
Langchain Integration: Utilize Langchain to build and optimize AI workflows, automate processes, and enhance AI model capabilities with data-driven features.
Cloud Data Infrastructure: Manage and scale data infrastructure on AWS, using services such as S3, EC2, Lambda, Redshift, and others.
SQL & Database Expertise: Write complex SQL queries for data extraction, transformation, and analysis. Optimize database performance and ensure data integrity.
Collaboration: Work closely with Data Scientists, AI Engineers, and other stakeholders to ensure data is properly structured for machine learning models and other analytics use cases.
Data Quality & Governance: Monitor data quality, create processes for error handling, and ensure compliance with data privacy regulations and best practices.
Performance Optimization: Continuously optimize and fine-tune data processing pipelines and database queries for better performance and scalability.
Documentation: Maintain detailed documentation on the architecture, data processes, and code to ensure smooth collaboration and knowledge transfer.
Requirements
3+ years of experience as a Data Engineer, with a focus on AI or machine learning data pipelines.
Hands-on experience with Langchain for building and optimizing AI workflows and automation.
Strong experience with large-scale data management, including working with distributed systems and processing massive datasets.
Proficient in SQL and experience with relational databases (e.g., PostgreSQL, MySQL, SQL Server) and NoSQL databases.
Extensive experience with AWS, including services like S3, EC2, Lambda, Redshift, and RDS.
What are the 3-4 non-negotiable requirements of this position?
Experience: 3+ years of experience as a Data Engineer, with a focus on AI or machine learning data pipelines. Hands-on experience with Langchain for building and optimizing AI workflows and automation. Strong experience with large-scale data management, including working with distributed systems and processing massive datasets. Proficient in SQL and experience with relational databases (e.g., PostgreSQL, MySQL, SQL Server) and NoSQL databases. Extensive experience with AWS, including services like S3, EC2, Lambda, Redshift, and RDS. Technical Skills: Proficient in Python Strong understanding of database design, data modeling, and query optimization. Familiarity with data warehousing concepts and tools. Exerience working with data pipelines and workflow orchestration tools like Apache Airflow, or similar. Knowledge of AI and machine learning concepts, particularly around data preprocessing and feature engineering. Education: Bachelor's or Master's degree in Computer Science, Data Engineering, Artificial Intelligence, or a related field.
What are the nice-to-have skills?
Preferred Qualifications: Experience with Apache Spark, Kafka, or other big data technologies. Familiarity with containerization (Docker, Kubernetes) for deploying scalable data solutions. Knowledge of AI-specific data workflows, including data preparation for natural language processing, computer vision, or other AI applications.