Data Engineer - I
The Data Engineer - I will be responsible for building and maintaining optimal data pipeline and reporting architecture for the next generation Data Warehouse, reporting, and data analytics environment. The individual will be a part of a team responsible for building, managing and monitoring all data transformation and data load into the Data Lake hosted on AWS. Key Responsibilities
- Identify, design, and implement data process improvements by: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability.
- Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of structured and unstructured data sources using SQL and AWS 'big data' technologies.
- Build analytics tools that utilize the data pipeline to provide actionable insights into customer acquisition, operational efficiency and other key business performance metrics.
- Work with internal stakeholders, including executives, to assist with data-related technical issues and support their data needs.
- Work with data and analytics experts to strive for greater functionality in our data systems.
- Assist in developing data model architecture for the next generation management reporting system.
- Ongoing maintenance of the data warehouse and reporting infrastructure.
Minimum Qualifications
- Bachelor's degree in Information Systems, Computer Science, Computer Engineering, Statistics, Mathematics or related areas.
- One or more Industry certifications; AWS Developer Associate, Certified Big Data Specialty, Tableau Desktop Certified Associate, Certified Associate in Python programing (PCAP), Apache Spark certification will be given preference.
- 1-2 years of programming in the field of data engineering, analytics, data visualization, SQL and related technologies
Key skills, knowledge & experience Deep understanding and experience in the following technologies and programming environment:
- Apache Spark and PySpark, Hadoop
- SQL and any NoSQL and RDBMS platforms
- Python, R, Java, JavaScript PL/SQL, XML, XSLT
- GitHub or similar shared code repository
- Tableau
- AWS cloud services
- DevOps practices
Optional skills, knowledge & experience Individuals with skills, knowledge and experience in the following applications and services will be given special consideration:
- Salesforce CRM
- Workday HRM and ERP system
- Seismic content management
- Google AdWords, Google Analytics, Google Tag Manager
- Self-motivated problem solver who has worked on building data transformation process to support business intelligence applications in a large environment will be given preference.
- Excellent organization skills, diagnostics and problem-solving abilities as well as strong written and verbal communication skills.
- Energetic, able to clearly communicate ideas and deliver results