1

Trainee Databricks Data Engineer Jobs (NOW HIRING)

Databricks Data Engineer

Manassas Park, VA

$113K - $135.70K/yr

The Databricks Data Engineer will help design, build, deploy, and maintain scalable and production grade data pipelines in modern cloud environments, enabling analytics, AI, ML, and decision ...

Databricks Data Engineer

Wildwood, MO

$107.40K - $129K/yr

The Databricks Data Engineer will help design, build, deploy, and maintain scalable and production grade data pipelines in modern cloud environments, enabling analytics, AI, ML, and decision ...

Databricks Data Engineer

Dallas, TX

$113.70K - $136.60K/yr

Databricks Data Engineer Position: Databricks Data Engineer Type: On-site Location: Dalls, Texas (Local or ready to relocate is fine) Client: Through CTS JD: Minimum 8 to 10+ years of working ...

Databricks Data Engineer

San Antonio, TX

$104.10K - $125K/yr

Databricks Data Engineer Position: Databricks Data Engineer Type: On-site Location: Dalls, Texas (Local or ready to relocate is fine) Client: Through CTS JD: Minimum 8 to 10+ years of working ...

Databricks Data Engineer

Manassas, VA · On-site

$114.50K - $137.50K/yr

The Databricks Data Engineer will help design, build, deploy, and maintain scalable and production grade data pipelines in modern cloud environments, enabling analytics, AI, ML, and decision ...

Databricks Data Engineer

Manassas, VA · On-site

$114.50K - $137.50K/yr

The Databricks Data Engineer will help design, build, deploy, and maintain scalable and production grade data pipelines in modern cloud environments, enabling analytics, AI, ML, and decision ...

Databricks Data Engineer

Manassas, VA · On-site

$114.50K - $137.50K/yr

The Databricks Data Engineer will help design, build, deploy, and maintain scalable and production grade data pipelines in modern cloud environments, enabling analytics, AI, ML, and decision ...

Databricks Data Engineer

Manassas, VA

$114.50K - $137.50K/yr

The Databricks Data Engineer will help design, build, deploy, and maintain scalable and production grade data pipelines in modern cloud environments, enabling analytics, AI, ML, and decision ...

Databricks Data Engineer

Manassas, VA

$114.50K - $137.50K/yr

The Databricks Data Engineer will help design, build, deploy, and maintain scalable and production grade data pipelines in modern cloud environments, enabling analytics, AI, ML, and decision ...

Databricks Data Engineer

Marlborough, NH · On-site

$113.20K - $136K/yr

This role is for a Databricks Data Engineer with experience in designing, developing, and deploying robust, scalable batch and streaming data pipelines using PySpark, Spark SQL, and Delta Live Tables.

Databricks Data Engineer

Vienna, VA

$114.90K - $138K/yr

Who we are looking for We are seeking a Databricks Data Engineer for a full-time position supporting our various clients We are committed to your growth and success in the IT software development ...

Databricks Data Engineer

Costa Mesa, CA

$122.80K - $147.50K/yr

As a Databricks Data Engineer, you will support the design, build, and optimization of cloud-based data engineering solutions that enable large-scale transformation. You will work with business and ...

Databricks Data Engineer

Cincinnati, OH

$109.90K - $131.90K/yr

As a Databricks Data Engineer, you will support the design, build, and optimization of cloud-based data engineering solutions that enable large-scale transformation. You will work with business and ...

Databricks Data Engineer

Philadelphia, PA

$115.50K - $138.70K/yr

As a Databricks Data Engineer, you will support the design, build, and optimization of cloud-based data engineering solutions that enable large-scale transformation. You will work with business and ...

Databricks Data Engineer

Tempe, AZ

$109.70K - $131.70K/yr

As a Databricks Data Engineer, you will support the design, build, and optimization of cloud-based data engineering solutions that enable large-scale transformation. You will work with business and ...

Databricks Data Engineer

Vienna, VA · Remote

$114.90K - $138K/yr

Databricks Data Engineer Engagement Type: FTE Grade: 6 Location: REMOTE Note: This Position Is Not Eligible For Immigration Sponsorship At This Time. Healthcare Industry is mandatory. Healthcare ...

Databricks Data Engineer

Denver, CO

$117.90K - $141.50K/yr

As a Databricks Data Engineer, you will support the design, build, and optimization of cloud-based data engineering solutions that enable large-scale transformation. You will work with business and ...

Databricks Data Engineer

Houston, TX

$109.30K - $131.30K/yr

As a Databricks Data Engineer, you will support the design, build, and optimization of cloud-based data engineering solutions that enable large-scale transformation. You will work with business and ...

Databricks Data Engineer Columbus, Ohio, United States About the Job Title: Databricks Data Engineer (Senior Manager / AVP) Location: Columbus, OH (1st choice), Remote (2nd choice) Experience ...

next page

Showing results 1-20

Trainee Databricks Data Engineer information

See salary details

$44.5K

$129.7K

$177.5K

How much do trainee databricks data engineer jobs pay per year?

As of Jun 1, 2026, the average yearly pay for trainee databricks data engineer in the United States is $129,716.00, according to ZipRecruiter salary data. Most workers in this role earn between $114,500.00 and $137,500.00 per year, depending on experience, location, and employer.

What is the difference between Trainee Databricks Data Engineer vs Junior Data Engineer?

AspectTrainee Databricks Data EngineerJunior Data Engineer
Required CredentialsBasic knowledge of Databricks, SQL, and data fundamentalsDegree in Computer Science or related field, some experience with data tools
Work EnvironmentTraining programs, mentorship, entry-level projects on Databricks platformEntry-level to mid-level data teams, real-world data projects
Employer & Industry UsageTech companies, data consulting firms, startups focusing on cloud data platformsVariety of industries including finance, healthcare, retail, with data teams

The Trainee Databricks Data Engineer is an entry-level role focused on learning Databricks and data engineering fundamentals, often within training programs. In contrast, a Junior Data Engineer typically has some hands-on experience and works on real data projects. Both roles are common in tech-driven industries, but the trainee position emphasizes skill development, while the junior role involves more independent work.

More about Trainee Databricks Data Engineer jobs
What cities are hiring for Trainee Databricks Data Engineer jobs? Cities with the most Trainee Databricks Data Engineer job openings:
What are the most commonly searched types of Databricks Data Engineer jobs? The most popular types of Databricks Data Engineer jobs are:
What states have the most Trainee Databricks Data Engineer jobs? States with the most job openings for Trainee Databricks Data Engineer jobs include:
Infographic showing various Trainee Databricks Data Engineer job openings in the United States as of May 2026, with employment types broken down into 99% Full Time, and 1% Temporary. Highlights an 16% Physical, 79% Hybrid, and 5% Remote job distribution, with an average salary of $129,716 per year, or $62.4 per hour.
Databricks Data Engineer

Databricks Data Engineer

W. R. Berkley

Manassas Park, VA

$113K - $135.70K/yr

Other

Medical, Dental, Vision, Life, Retirement, PTO

Posted 24 days ago


W.R. Berkley rating

8.2

Company rating: 8.2 out of 10

Based on 6 frontline employees who took The Breakroom Quiz

123rd of 259 rated insurance


Job description

Databricks Data Engineer

This position requires on-site work Monday–Thursday at either our Manassas, VA or Chesterfield, MO location.

The Databricks Data Engineer will help design, build, deploy, and maintain scalable and production grade data pipelines in modern cloud environments, enabling analytics, AI, ML, and decision advantage at scale. This role will work with cutting-edge tools like Databricks, Delta Lake, PySpark, and AI/BI genie to transform raw data into actionable insights. As a hands-on Databricks Data Engineer with deep expertise in Azure Databricks and MLOps, this role will have the opportunity to migrate and translate legacy SSIS ETL logic into scalable, cloud-native data pipelines in Databricks. This role will partner with data engineers, data scientists, and product manager to design features, train/evaluate models, and deploy them to production using MLflow, Databricks and Workflows—with rigorous observability, governance (Unity Catalog), and CI/CD automation.

Data Pipeline Engineering

  • Design, build, and maintain high-performance, scalable ETL/ELT pipelines using Azure Databricks, Delta Lake, and PySpark.
  • Convert and modernize existing SSIS package logic into cloud-native Databricks pipelines using PySpark notebooks, Delta Live Tables (DLT), and Databricks Workflows.
  • Implement reliable batch and streaming pipelines with robust data quality and validation frameworks.
  • Optimize pipeline performance using Photon, efficient file formats, partitioning, Z-ordering, and caching strategies.

Lakehouse Platform Development

  • Develop and manage datasets within Delta Lake, ensuring ACID reliability, schema evolution, versioning, and time travel.
  • Architect feature-rich data layers including: Bronze (raw ingestion), Silver (validated, conformed), Gold (analytics-ready and ML-ready).
  • Implement data governance using Unity Catalog for fine-grained access control, lineage, auditability, and metadata management.

MLOps & ML-Enabled Data Pipelines

  • Partner with data scientists and data engineers to create feature pipelines, model training pipelines, and production scoring pipelines.
  • Deploy and operationalize models using MLflow, Databricks Model Registry, and Databricks Workflows.
  • Use Databricks built-in AI SQL functions such as ai_query, ai_forecast, ai_analyze_sentiment to generate actionable insight from large amount of unstructured or structured raw data
  • Implement monitoring for: Pipeline failures, Data/feature drift, Model performance degradation, Operational SLAs/SLIs/SLOs
  • Build automated CI/CD workflows using GitHub Actions or Azure DevOps for notebook deployment, pipeline testing, and environment promotion.

Data Platform, Data Security & Data Governance

  • Collaborate with data engineers to design reliable data products on Delta Lake; leverage Delta Live Tables (DLT) for declarative pipelines when applicable.
  • Enforce Unity Catalog for lineage, permissions, and audit; manage secrets, tokens, and keys securely (e.g., Databricks secrets, Key Vault/Secrets Manager ).

Collaboration & Leadership

  • Work closely with cross-functional teams: data engineering, data scientist, product manager, and business stakeholders.
  • Serve as a Databricks SME—championing best practices, code standards, governance, and reusable frameworks.
  • Document architecture, workflows, data models, runbooks, and operational procedures.
Qualifications
  • Minimum of 3 years of experience in Databricks, PySpark notebooks, Python, DevOps, software development, and data engineering.
  • Certified Databricks Data Engineer Associate or Professional is a plus.

Skills & Competencies

  • Proficient in designing, building, deploying, and maintaining high-performance, scalable ETL/ELT pipelines using Azure Databricks, Delta Lake, and PySpark Notebook.
  • Proficient in building, deploying, and operating production ML models such as supervised, unsupervised, and anomaly detection, including techniques for imbalanced datasets
  • Proficient with ML engineering and MLOps, including model versioning, CI/CD for ML, monitoring, drift detection, and automated retraining
  • Proficiency in Python including Pandas and PySpark Dataframes
  • Expert level of SQL skills including Stored Procedure, experience with SSIS, SSRS, Power BI is a plus.
  • Proficient with cloud data engineering platforms, such as Azure, Databricks, Spark, or SQL, and batch and streaming pipelines
  • Familiar with Databricks AI Built-In Functions such as AI_Query, AI_Gen, AI_Classify, AI_Forecast, AI_Analyze_Sentiment, able to use them to extract actionable insights from large amount of unstructured or structured raw data
  • Experience with Python and ML frameworks, such as PyTorch or TensorFlow
  • Experience improving data quality, lineage, and observability in enterprise data environments and operationalizing rules and model-driven scoring for prioritization, routing, or case selection
  • Experience with predictive analytics, machine learning and artificial intelligence desired.

Education

  • A Bachelor's degree in Computer Science, Management Information Systems, Engineering, Math, Physics, or a related quantitative field is required (4-year degree). Master's degree preferred
  • Experience in the commercial insurance industry is a plus.
Additional Company Details

The Company is an equal employment opportunity employer. We do not accept any unsolicited resumes from external recruiting firms. The company offers a competitive compensation plan and robust benefits package for full time regular employees. Base salary & Benefits include Health, dental, vision, life, disability, wellness, paid time off, 401(k) and profit-sharing plans. The actual salary for this position will be determined by a number of factors, including the scope, complexity and location of the role; the skills, education, training, credentials and experience of the candidate; and other conditions of employment.

Additional Requirements

• Ability to travel locally and nationally up to 5% of the time

Sponsorship Details

Sponsorship not Offered for this Role