1

Databricks Delta Lake Jobs (NOW HIRING)

Azure Databricks Engineer

Iselin, NJ · On-site

$61 - $79.25/hr

Strong expertise in Databricks (Delta Lake, Unity Catalog, Lakehouse Architecture, Table Triggers, Delta Live Pipelines, Databricks Runtime etc.) * Proficiency in Azure Cloud Services. * Solid ...

Sr. Data Engineer - Databricks

$117K - $140K/yr

Implement and manage data solutions using Databricks, Delta Lake, and Unity Catalog. * Ensure data quality, reliability, and performance across large-scale and complex datasets. * Collaborate with ...

Lead Data Engineer

Charlotte, NC · On-site

$100K - $131K/yr

... Databricks.  * Optimize ETL/ELT workflows, ensuring efficiency in data processing, storage, and retrieval.  * Leverage Apache Spark, Delta Lake, and Azure-native services to build high ...

Expertise in Databricks Delta Lake and Lakehouse Architecture . * Strong experience in designing and implementing scalable BI reports in Tableau . * Expert-level skills in architecting solutions on ...

Azure Databricks Engineer

Fairfax, VA · On-site

$59.50 - $77.50/hr

Design, build, and maintain scalable ETL/ELT pipelines using Azure Databricks, Delta Lake, and Apache Spark to process large volumes of structured and unstructured data. * Integrate data from ...

Databricks Lead/Architecture

$56.50 - $77.50/hr

Define and design end-to-end data Lakehouse architecture leveraging Databricks, Delta Lake, and cloud-native services. * Create reference architectures for batch, real-time, and streaming data ...

Azure Databricks Engineer

Fairfax, VA · On-site

$96K - $132K/yr

Design, build, and maintain scalable ETL/ELT pipelines using Azure Databricks, Delta Lake, and Apache Spark to process large volumes of structured and unstructured data. * Integrate data from ...

Data Engineer with AI - Remote

Boston, MA · On-site +1

$124K - $149K/yr

Design, build, and maintain batch/streaming pipelines in Python + PySpark on Databricks (Delta Lake, Autoloader, Structured Streaming). * Implement data models (Bronze/Silver/Gold), optimize with ...

Senior Databricks Platform Engineer

Arlington, VA · On-site

$120K - $165K/yr

Build and configure Databricks workspaces, clusters, notebooks, jobs, and Delta Lake storage, integrating with AWS services such as S3, IAM, and KMS. * Implement and maintain security controls ...

Build and configure Databricks workspaces, clusters, notebooks, jobs, and Delta Lake storage, integrating with AWS services such as S3, IAM, and KMS. * Implement and maintain security controls ...

Senior Databricks Platform Engineer

Arlington, VA · On-site

$120K - $165K/yr

Build and configure Databricks workspaces, clusters, notebooks, jobs, and Delta Lake storage, integrating with AWS services such as S3, IAM, and KMS. * Implement and maintain security controls ...

Work with cutting-edge tools like Databricks, Delta Lake, and PySpark to transform raw data into actionable insights for America's toughest challenges. Due to the nature of work performed within this ...

Data Engineer/ Python/Pyspark/ SQL/ ETL/ DataLake/Databricks/Delta Lake Position Summary: The Lakehouse Developer is responsible for designing, implementing, and maintaining data lakehouse solutions ...

... Delta Lake to transform raw (bronze) data into trusted curated (silver) and analytics-ready (gold) data layers. • Operationalize Databricks Workflows for orchestration, dependency management, and ...

next page

Showing results 1-20

Databricks Delta Lake information

See salary details

$21.5K

$50.1K

$73.5K

How much do databricks delta lake jobs pay per year?

As of Jun 8, 2026, the average yearly pay for databricks delta lake in the United States is $50,117.00, according to ZipRecruiter salary data. Most workers in this role earn between $38,500.00 and $59,000.00 per year, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive as a Databricks Delta Lake Engineer, and why are they important?

To thrive as a Databricks Delta Lake Engineer, you need expertise in big data engineering, SQL, data modeling, and cloud computing, typically supported by a background in computer science or data engineering. Familiarity with Databricks, Apache Spark, Delta Lake, and cloud platforms like AWS or Azure, along with relevant certifications such as Databricks Certified Data Engineer, is essential. Strong problem-solving abilities, communication skills, and adaptability help you collaborate with cross-functional teams and address evolving data challenges. These competencies ensure robust, scalable, and reliable data pipelines that support critical business analytics and decision-making.

What is the difference between Databricks Delta Lake vs Data Engineer?

AspectDatabricks Delta LakeData Engineer
Primary FocusManaging and optimizing data lakes with ACID transactions and scalable storageDesigning, building, and maintaining data pipelines and infrastructure
Required SkillsSQL, Spark, cloud platforms, data modeling, Delta Lake architectureSQL, ETL tools, programming (Python, Scala), data warehousing
Work EnvironmentCloud-based data platforms, big data environmentsData warehouses, cloud or on-premises data infrastructure

While Databricks Delta Lake focuses on managing scalable, reliable data lakes with ACID compliance, Data Engineers build and maintain the pipelines and infrastructure that enable data analysis. Both roles often collaborate but have distinct technical focuses within data management.

What is Databricks Delta Lake?

Databricks Delta Lake is an open-source storage layer that brings reliability, performance, and ACID (Atomicity, Consistency, Isolation, Durability) transactions to data lakes. It enables users to build robust and scalable data pipelines by providing features like schema enforcement, time travel, and unified batch and streaming data processing. Delta Lake helps organizations manage large volumes of data efficiently, ensuring data quality and consistency for analytics and machine learning workloads.

What are some common challenges faced by Databricks Delta Lake engineers when implementing data pipelines, and how can they be addressed?

Databricks Delta Lake engineers often encounter challenges related to data consistency, schema evolution, and handling large-scale streaming data. Ensuring ACID compliance while enabling efficient batch and streaming workloads can be complex, especially when dealing with frequent schema changes or high-volume data ingestion. To address these challenges, engineers typically use Delta Lake's built-in features such as schema enforcement, time travel, and optimized file management. Collaborating closely with data engineering and DevOps teams helps ensure robust pipeline monitoring, quick issue resolution, and continuous optimization.
Infographic showing various Databricks Delta Lake job openings in the United States as of May 2026, with employment types broken down into 1% Locum Tenens, 81% Full Time, 17% Part Time, and 1% Contract. Highlights an 94% Physical, 2% Hybrid, and 4% Remote job distribution, with an average salary of $50,117 per year, or $24.1 per hour.
Senior Data Lakehouse Architect (Databricks), Vice President

Senior Data Lakehouse Architect (Databricks), Vice President

State Street

Boston, MA • On-site

$73 - $97.75/hr

Full-time

Posted 23 days ago


Job description

Job Summary:
State Street is seeking a Senior Data Lakehouse Architect to design and lead the build-out of a Legal Data Lakehouse platform on AWS and Databricks. This role will drive the architecture, engineering, and governance of scalable, secure, and compliant data capabilities supporting legal operations, contract intelligence, eDiscovery, and AI/ML use cases.
Responsibilities:
• Define and implement the end-to-end Legal Data Lakehouse architecture using Databricks (Delta Lake, Unity Catalog, Workflows) on AWS
• Design multi-layered data architecture (Bronze, Silver, Gold) to support:
• Contract metadata and document ingestion
• Legal matter management data
• eDiscovery datasets
• External regulatory and compliance feeds
• Establish scalable ingestion frameworks (batch and streaming) for structured and unstructured legal data (PDFs, contracts, emails)
• Lead development of ETL/ELT pipelines using Databricks, Spark, and Python/SQL
• Integrate with enterprise platforms, including:
• Contract lifecycle management systems
• AI platforms and LLM pipelines
• Document repositories and enterprise content systems
• Design patterns for extracting structured data from unstructured legal documents and persisting into Delta Lake
• Enable downstream integration with enterprise data platforms, analytics tools, and AI/ML pipelines
• Implement data governance frameworks using Databricks Unity Catalog and AWS-native controls (IAM, KMS)
• Establish:
• Fine-grained access controls (row/column-level security)
• Data lineage and auditability
• Ensure compliance with:
• Data privacy regulations (e.g., GDPR)
• Internal security and audit requirements
• Partner with IAM teams to integrate with enterprise identity providers (e.g., Entra ID / Azure AD)
• Architect data models supporting:
• Contract analytics, clause extraction, and obligation tracking
• Legal AI use cases (contract review, litigation insights, compliance monitoring, legal spend analytics)
• Design search and retrieval architectures (RAG) for enterprise legal knowledge bases
• Enable entity extraction and knowledge graph frameworks
• Integrate with LLM/GenAI platforms to support capabilities such as document summarization, Q&A, and workflow automation
• Establish CI/CD pipelines and infrastructure-as-code (Terraform, Git-based workflows)
• Define standards for:
• Code quality and versioning
• Environment promotion (Dev / QA / Prod)
• Implement observability and alerting for platform health and reliability
• Partner with Legal and Technology leadership to define platform roadmap and priorities
• Provide architectural governance and design oversight
• Mentor data engineers and platform teams
• Translate business and legal requirements into scalable, enterprise-grade solutions
• Operate within a federated data and platform model, collaborating across engineering, security, and domain teams
Qualifications:
Required:
• 10+ years of experience in data architecture, engineering, or analytics platforms
• 5+ years of hands-on experience with Databricks and Apache Spark
• Strong experience with AWS-based data platforms
• Expertise in data governance, security, and compliance in regulated environments
• Experience working with unstructured data and NLP/document processing pipelines
• Bachelor’s or Master’s degree in Computer Science, Data Engineering, Information Systems, or a related technical discipline
• Relevant certifications strongly preferred: Databricks Certified Data Engineer / Architect, AWS Certified Solutions Architect (Associate or Professional)
• Define and implement the end-to-end Legal Data Lakehouse architecture using Databricks (Delta Lake, Unity Catalog, Workflows) on AWS
• Design multi-layered data architecture (Bronze, Silver, Gold) to support contract metadata and document ingestion, legal matter management data, eDiscovery datasets, external regulatory and compliance feeds
• Establish scalable ingestion frameworks (batch and streaming) for structured and unstructured legal data (PDFs, contracts, emails)
• Lead development of ETL/ELT pipelines using Databricks, Spark, and Python/SQL
• Integrate with enterprise platforms, including contract lifecycle management systems, AI platforms and LLM pipelines, document repositories and enterprise content systems
• Design patterns for extracting structured data from unstructured legal documents and persisting into Delta Lake
• Enable downstream integration with enterprise data platforms, analytics tools, and AI/ML pipelines
• Implement data governance frameworks using Databricks Unity Catalog and AWS-native controls (IAM, KMS)
• Establish fine-grained access controls (row/column-level security), data lineage and auditability
• Ensure compliance with data privacy regulations (e.g., GDPR) and internal security and audit requirements
• Partner with IAM teams to integrate with enterprise identity providers (e.g., Entra ID / Azure AD)
• Architect data models supporting contract analytics, clause extraction, and obligation tracking, legal AI use cases (contract review, litigation insights, compliance monitoring, legal spend analytics)
• Design search and retrieval architectures (RAG) for enterprise legal knowledge bases
• Enable entity extraction and knowledge graph frameworks
• Integrate with LLM/GenAI platforms to support capabilities such as document summarization, Q&A, and workflow automation
• Establish CI/CD pipelines and infrastructure-as-code (Terraform, Git-based workflows)
• Define standards for code quality and versioning, environment promotion (Dev / QA / Prod)
• Implement observability and alerting for platform health and reliability
• Partner with Legal and Technology leadership to define platform roadmap and priorities
• Provide architectural governance and design oversight
• Mentor data engineers and platform teams
• Translate business and legal requirements into scalable, enterprise-grade solutions
• Operate within a federated data and platform model, collaborating across engineering, security, and domain teams
Preferred:
• Strong hands-on experience with the Databricks Lakehouse platform, including Delta Lake, Unity Catalog, Workflows, and MLflow
• Deep expertise in AWS data platform services, including S3, Glue, EMR, Lambda, Redshift, and IAM
• Proven experience architecting and delivering enterprise-scale data lakehouse solutions on AWS using Databricks
• Advanced proficiency in Apache Spark (PySpark/Scala), SQL, and Python
• Strong understanding of data governance and security, including access controls, metadata management, and encryption (KMS, CMK/BYOK)
• Experience building end-to-end data pipelines (batch and streaming) and supporting AI/ML workloads within a lakehouse architecture
• Experience in Legal, Compliance, Financial Services, or other regulated industries
• Understanding of legal data constructs, including contracts, clauses, obligations, and matters
• Experience supporting legal AI use cases (contract analytics, document summarization, compliance monitoring)
• Experience handling sensitive data in highly regulated, audit-driven environments
Company:
State Street offers a range of financial services, including investment management, research and trading, as well as asset management. Founded in 1792, the company is headquartered in Boston, USA, with a team of 10001+ employees. The company is currently Late Stage.

State Street logo

About State Street

Sourced by ZipRecruiter

State Street is one of the largest custodian banks, asset managers and asset intelligence companies in the world. From technology to product innovation, we're making our mark on the financial services industry. For more than two centuries, we've been helping our clients safeguard and steward the investments of millions of people. We provide investment servicing, data & analytics, investment research & trading and investment management to institutional clients.

Industry

Finance and insurance

Company size

10,000+ Employees

Headquarters location

Boston, MA, US

Year founded

1792

Social media