Job Summary:
Zappos is a company focused on transforming data into actionable insights, and they are seeking a Data Engineer to design, develop, and maintain their data infrastructure. The role involves working with cross-functional teams to ensure data is collected, processed, and made available for analysis and reporting, while also optimizing data pipelines and ensuring data quality.
Responsibilities:
• Design, build, and maintain robust data pipelines to acquire, process, and store data from various sources such as databases, APIs, and external data providers.
• Develop and optimize ETL (Extract, Transform, Load) processes to clean, enrich, and structure raw data into a usable format for analysis and reporting.
• Implement and manage data warehousing solutions to ensure efficient data storage, retrieval, and query performance.
• Establish data quality standards, perform data validation, and proactively identify and address data quality issues.
• Optimize data pipelines and storage solutions to handle large volumes of data while maintaining high performance and reliability.
• Ensure data privacy and security by implementing access controls, encryption, and compliance with data protection regulations.
• Collaborate with data scientists, analysts, and other stakeholders to understand data requirements and provide the necessary data infrastructure to support their needs.
• Maintain comprehensive documentation for data pipelines, data models, and processes to facilitate knowledge sharing and troubleshooting.
• Implement monitoring solutions to proactively detect and address data pipeline failures or performance bottlenecks.
• Keep abreast of industry trends and emerging technologies in data engineering to recommend and implement improvements to our data infrastructure.
Qualifications:
Required:
• 3+ years of data engineering experience
• Experience with data modeling, warehousing and building ETL pipelines
• Knowledge of distributed systems as it pertains to data storage and computing
• Bachelor's degree
Preferred:
• Experience with AWS technologies like Redshift, S3, AWS Glue, EMR, Kinesis, FireHose, Lambda, and IAM roles and permissions
• Experience with non-relational databases / data stores (object storage, document or key-value stores, graph databases, column-family databases)
• Usage of generative AI tools to enhance workflow efficiency, with a willingness to learn effective prompting and evaluation practices.
• Ability to recognize opportunities where generative AI could enhance products, workflows, or customer experiences.
Company:
Zappts is a reference in Consulting and Development of Channels, Platforms, and Digital Products. Founded in 1999, the company is headquartered in Las Vegas, USA, with a team of 1001-5000 employees. The company is currently Late Stage.