Job Summary:
Building Services 32BJ Benefit Funds is responsible for administering various benefits to SEIU 32BJ members. The Data Engineer will play a key role in delivering data-driven products to support the organization's mission, focusing on data collection, movement, storage, and transformation.
Responsibilities:
• Works with Health Fund Analytics, Operations, and IT to design, develop, maintain, and optimize complex data pipelines supporting both on-premises and Azure cloud environments
• Migrates and integrates data from disparate internal and external sources into centralized Azure cloud and on-premises data warehouse solutions using established data engineering best practices
• Uses SQL, Azure Data Factory, Databricks, Python, and other data transformation tools to develop and automate scalable ETL/ELT processes for ingesting, transforming, and loading data from multiple vendors into centralized data platforms
• Designs and implements resilient ingestion pipelines capable of handling schema drift, missing or invalid fields, inconsistent vendor formats, and evolving source system structures
• Builds scalable, flexible, and extensible data models that support evolving business requirements, onboarding of new vendors, and downstream analytics/reporting needs
• Implements and maintains medallion architecture principles with clear separation of raw, refined, and curated data layers
• Diagnoses and resolves performance bottlenecks impacting pipeline efficiency, reporting processes, and downstream data consumers across SQL Server and Databricks environments
• Supports and optimizes data workflows across hybrid on-premises and cloud-based platforms during ongoing cloud migration initiatives
• Translates operational and business requirements into scalable, maintainable, and efficient data engineering solutions
• Anticipates and mitigates risks related to vendor data variability, schema evolution, and data quality issues to ensure data reliability and continuity
• Prioritizes and manages technical debt to improve platform stability, maintainability, scalability, and delivery efficiency
• Generates data subsets, semantic models, APIs, variables, and reusable datasets required for integration with internal applications, analytics tools, and public-facing platforms
• Works collaboratively with IT and Operations teams to evaluate, implement, and support scalable cloud-based solutions, including Azure and Dynamics 365 technologies
• Supports implementation and ongoing maintenance of enterprise Data Governance policies, standards, and data management best practices within assigned domains
• Supports data engineering operations through proactive monitoring, alerting, troubleshooting, debugging, and maintenance activities to minimize downtime and ensure data quality
• Interfaces with internal stakeholders and external vendor IT teams to resolve data quality issues and ensure HIPAA-compliant data handling, transfer, and storage practices
• Creates and maintains clear technical documentation, including data dictionaries, schemas, user guides, quick-start materials, and workflow/process documentation
• Provides training sessions, tutorials, and ongoing support to analysts and business users on data access, query development, reporting tools, and available data resources
• Serves as a subject matter expert on internal and external data sources, data architecture, and enterprise data management practices
Qualifications:
Required:
• 1–3 years of professional experience in data engineering, data integration, data warehousing, analytics engineering, or related technical disciplines
• Demonstrated ability to support scalable data pipelines, data transformation processes, and cloud-based data initiatives within collaborative technical environments
• Strong understanding of software engineering best practices applied to data engineering, including modular design, automated testing, CI/CD, version control, and idempotent processing
• Advanced SQL development skills, including stored procedures, functions, triggers, query optimization, indexing strategies, and performance troubleshooting across large-scale datasets
• Strong understanding of modern data engineering concepts and architecture, including ETL/ELT frameworks, pipeline orchestration, data modeling, schema evolution, batch ingestion patterns, and medallion architecture principles within Databricks/Delta Lake environments
• Proficiency in Python (preferred) for developing scalable, maintainable, and production-ready data pipelines within Databricks
• Experience working within the Azure ecosystem, especially Azure Databricks, Delta Lake, and cloud migration initiatives
• Experience working within Agile/DevOps delivery environments and cross-functional technical teams
• Prior experience working with healthcare claims data and understanding healthcare/benefits domain concepts, including claims, eligibility, providers, and benefit fund cost drivers
• Bachelor’s degree in Computer Science, Information Systems, Data Science, Engineering, or a related technical field, or equivalent combination of education and hands-on experience
• Strong verbal and written communication skills in English, including the ability to read, write, understand, and effectively communicate technical and business information
Preferred:
• Proficiency in Python for developing scalable, maintainable, and production-ready data pipelines within Databricks
Company:
The Building Service 32BJ Benefit Funds is the umbrella organization responsible for administering benefits, including Health, Pension, Retirement Savings, Training and Legal Services benefits to over 100,000 SEIU 32BJ members. Founded in 2015, the company is headquartered in New York, USA, with a team of 501-1000 employees. The company is currently Late Stage.