Role Overview:The Loss Prevention Tech team as a Data Engineer is an engineer with deep expertise in designing, building, and maintaining scalable data pipelines and analytics platforms.
The role involves working closely with business stakeholders, analysts, and engineering teams to enable data driven decision making using large scale datasets and AWS cloud technologies.
This role is critical in delivering reliable, high performance data solutions across batch and streaming workloads.
Key Responsibilities:- Work with business stakeholders and analysts to understand data requirements and translate them into technical solutions
- Design, build, and maintain highly available and distributed data pipelines for ingestion, transformation, and processing of large datasets
- Develop and optimize ETL / ELT workflows using cloud native data engineering tools
- Write efficient SQL queries and support data warehousing solutions for analytical workloads
- Build reusable, scalable programs using Python, Unix shell scripting, Java, or similar languages to solve complex data problems
- Design and support analytical data infrastructure enabling ad hoc access to large datasets and compute resources
- Integrate data from multiple structured and unstructured sources using SQL and cloud big data technologies
- Collaborate with cross functional teams including developers, PMs, and architects on end to end solution design
- Support testing, defect resolution, and production issues in collaboration with QA and operations teams
Required Skills:- Strong proficiency in SQL
- Hands on experience with Python and/or Unix shell scripting
- Experience building and maintaining ETL / ELT pipelines
- Familiarity with ETL frameworks
- Experience with cloud based data platforms AWS (Athena, EMR, Redshift, S3, Glue, Kinesis, Lambda)
- Exposure to distributed data processing and large scale datasets
- Understanding of data warehousing concepts
- Ability to design datasets for reporting, dashboards, and analytics use cases
Preferred Skills and Qualifications:- Experience with streaming data pipelines
- Knowledge of data modelling and performance optimization
- Exposure to DevOps / CI CD practices for data pipelines
- Bachelor's degree in computer science or equivalent