Job Description
Title: Data Developer
Location: Hybrid/Charlotte, NC
Contract Duration: 12 Months
Summary:
We are seeking a Mid-Level Data Engineer (5–7 years) to design, develop, and maintain scalable data solutions within a Microsoft Fabric environment. This role focuses on building robust lakehouse architectures, developing end-to-end data pipelines, and delivering high-quality, analytics-ready datasets. The ideal candidate will have strong experience with Microsoft Fabric, Apache Spark, Python, and modern data engineering practices, enabling seamless data integration from multiple enterprise sources for reporting and analytics.
Responsibilities:
Lakehouse Architecture & Platform Engineering
- Design, build, and maintain scalable lakehouse architecture using Microsoft Fabric and OneLake
- Ensure high availability, performance, and reliability of the data platform
Data Pipelines & Ingestion
- Develop and maintain end-to-end data pipelines for ingestion, transformation, and data serving
- Build scalable ingestion frameworks for batch and real-time data sources (APIs, databases, event streams)
- Integrate data from enterprise systems such as Jira, ERP, CRM, flat files, and streaming platforms
Data Processing & Transformation
- Develop and execute large-scale Spark jobs (batch and streaming)
- Author and maintain notebooks using PySpark and SQL for data transformation and analysis
- Implement data cleansing, enrichment, and transformation logic
Data Quality & Integration
- Build ingestion pipelines for structured, semi-structured, and unstructured data
- Implement data validation rules and quality control mechanisms
Data Enablement
- Deliver analytics-ready datasets for Power BI and downstream reporting systems
Experience:
- 5–7 years of experience in Data Engineering or related field
- Strong hands-on experience with Microsoft Fabric (Lakehouse, OneLake, Pipelines, Notebooks)
- Expertise in Apache Spark (PySpark, Spark SQL)
- Proficiency in SQL and Python
- Experience with Delta Lake / Delta Tables
- Strong understanding of ETL/ELT pipeline design and implementation
- Experience integrating APIs and streaming data sources
- Familiarity with enterprise data systems (ERP, CRM, Jira) is a plus
- Experience supporting analytics and reporting tools like Power BI