Job Summary:
New York Blood Center (NYBC) has been serving the tri-state area for over 60 years, delivering lifesaving blood products and services. As a Senior Data Engineer, you will design and deliver complex data engineering solutions, drive technical decisions, and mentor Data Engineers while ensuring the reliability and scalability of the data platform across integrated enterprise source systems.
Responsibilities:
• Architect, build, and own complex data pipelines for high-volume, high criticality workstreams across NYBCe's enterprise data platform.
• Lead the design and implementation of ELT/ETL frameworks using SQL, Python, Azure Data Factory, Databricks, and Azure Synapse Analytics.
• Establish pipeline reliability standards—monitoring, alerting, error handling, and recovery protocols—and ensure adherence across the team.
• Drive the design of scalable data models supporting dimensional warehousing, data lake architectures on Azure.
• Contribute to architectural decisions on data storage, partitioning, compute optimization, and consumption layer design.
• Lead migrations from legacy data solutions to modern cloud-native platforms, managing risk and business continuity throughout.
• Design and deliver feature pipelines and data preparation frameworks that support machine learning model development and deployment.
• Partner with Data Scientists to translate model requirements into production-grade data assets and feature stores.
• Collaborate with Analytics Engineers to ensure data models are optimized for analytical consumption and reporting performance.
• Define and implement data quality frameworks—validation rules, SLAs, anomaly detection, and automated testing for pipeline outputs.
• Lead data governance initiatives including metadata management, lineage tracking, data cataloging (Microsoft Purview), and access control.
• Ensure platform compliance with HIPAA, NYBCe data policies, and applicable regulatory requirements.
• Mentor Data Engineers—providing code reviews, technical guidance, and architectural feedback that elevates team capability.
• Contribute to DAPI's engineering standards, reusable frameworks, and technical documentation.
• Participate in Agile ceremonies and model strong engineering discipline—clear DevOps hygiene, sprint commitment, and delivery accountability.
Qualifications:
Required:
• Bachelor’s degree in computer science, Data Science, Information Technology, or a related quantitative field.
• 6+ years of progressive experience in data engineering with demonstrated ownership of complex, production-grade data platforms.
• Expert-level SQL (query optimization, indexing strategy, execution plans) and Python (PySpark, pipeline frameworks, testing).
• Deep hands-on experience with Azure data services: Azure Data Factory, Azure Databricks, Azure Synapse Analytics, Azure Data Lake Storage.
• Proven experience designing dimensional data models and data lake architecture at enterprise scale.
• Experience building data pipelines that directly support machine learning feature engineering and model serving.
• Strong background in data quality engineering—automated validation, SLA enforcement, and lineage tracking.
• Experience with relational databases (SQL Server, Oracle) and migration from legacy to cloud-native platforms.
Preferred:
• Microsoft Certified: Azure Data Engineer Associate
• Databricks Certified Associate Developer for Apache Spark
Company:
New York Blood Center is a blood collection and distribution organization. Founded in 1964, the company is headquartered in New York, USA, with a team of 1001-5000 employees. The company is currently Late Stage.