2

Entry Level Databricks Data Engineer Jobs in Virginia

Databricks Data Engineer

Manassas, VA · On-site

$114K - $137K/yr

The Databricks Data Engineer will help design, build, deploy, and maintain scalable and production grade data pipelines in modern cloud environments, enabling analytics, AI, ML, and decision ...

Databricks Data Engineer

Manassas, VA

$114K - $137K/yr

The Databricks Data Engineer will help design, build, deploy, and maintain scalable and production grade data pipelines in modern cloud environments, enabling analytics, AI, ML, and decision ...

Databricks Data Engineer

Manassas, VA

$114K - $137K/yr

The Databricks Data Engineer will help design, build, deploy, and maintain scalable and production grade data pipelines in modern cloud environments, enabling analytics, AI, ML, and decision ...

Databricks Data Engineer

Vienna, VA

$114K - $138K/yr

Who we are looking for We are seeking a Databricks Data Engineer for a full-time position supporting our various clients We are committed to your growth and success in the IT software development ...

Sr Databricks Data Engineer

Arlington, VA · On-site

$120K - $165K/yr

Share this job: Share: Share Sr Databricks Data Engineer with Facebook Share Sr Databricks Data Engineer with LinkedIn Share Sr Databricks Data Engineer with Twitter Caution against fraudulent job ...

Data Engineer

Richmond, VA · On-site

$113K - $136K/yr

Databricks * Data analysis experience * Additional ML tooling exposure * DevOps / cloud background * Familiarity with Jenkins and CI/CD / pipelining tools For more information please contact our ...

Data Engineer - AWS/Databricks

Reston, VA · Remote

$119K - $143K/yr

This role requires knowledge and/or experience with Spark, Delta Lake, and distributed data pipelines on Databricks. The ideal candidate brings both engineering and strategic insight into enterprise ...

Data Engineer - AWS/Databricks

Reston, VA · On-site +1

$119K - $143K/yr

This role requires knowledge and/or experience with Spark, Delta Lake, and distributed data pipelines on Databricks. The ideal candidate brings both engineering and strategic insight into enterprise ...

Data Engineer

Arlington, VA · On-site

$131K - $158K/yr

... Databricks, Spark, and related technologies. • Develop efficient data processing and ... open-source data engineering tools such as ElasticSearch and NiFi • Experience with Docker ...

Azure Data Engineer

Fairfax, VA · On-site

$113K - $136K/yr

Azure Data Factory, Synapse Analytics, Databricks, Data Lake, and Power BI * SQL and NoSQL database ... Data Governance, Data Engineering, and Data Science workflows * Machine Learning and MLOps best ...

Azure Data Engineer

Fairfax, VA

$113K - $136K/yr

Azure Data Factory, Synapse Analytics, Databricks, Data Lake, and Power BI * SQL and NoSQL database ... Data Governance, Data Engineering, and Data Science workflows * Machine Learning and MLOps best ...

Data Engineer

Mclean, VA · On-site

$115K - $139K/yr

Python Pyspark node.js vue.js AWS EMR/Glue Lambda S3 SNS DynamoDB Databricks Description: Sr. Data Engineer Leads meetings with internal customers to determine data needs Requirements mapping and ...

... Databricks Data Engineer Associate] is a plus - Designing and implementing thorough data ... PwC does not intend to hire experienced or entry level job seekers who will need, now or in the ...

As a Data Engineer on the Databricks platform at Booz Allen, you'll support mission-critical data initiatives for defense, intelligence, and civil clients. You'll help design, build, and maintain ...

next page

Showing results 1-20

Entry Level Databricks Data Engineer information

What is an Entry Level Databricks Data Engineer?

An Entry Level Databricks Data Engineer is a professional who uses Databricks, a cloud-based data analytics platform, to design, build, and maintain data pipelines. They are responsible for preparing and processing large datasets, ensuring data quality, and enabling analytics and machine learning workflows. Typically, they work with tools such as Apache Spark, SQL, and Python, and collaborate with data analysts and data scientists to deliver data-driven solutions. As entry-level engineers, they are expected to have foundational knowledge of data engineering concepts and be eager to learn more advanced techniques on the job.

What are the key skills and qualifications needed to thrive as an Entry Level Databricks Data Engineer, and why are they important?

To thrive as an Entry Level Databricks Data Engineer, you need a foundational understanding of data engineering concepts, SQL, and Python or Scala, typically supported by a relevant degree in computer science or a related field. Familiarity with Databricks, Apache Spark, cloud platforms (like AWS or Azure), and optional certifications such as Databricks Data Engineer Associate are highly valuable. Strong analytical thinking, attention to detail, and effective communication skills help you collaborate with teams and solve complex data challenges. These skills and qualities are essential for building reliable data pipelines, ensuring data quality, and delivering actionable insights in a fast-paced environment.

What are some common challenges faced by entry-level Databricks Data Engineers, and how can they effectively overcome them?

Entry-level Databricks Data Engineers often face challenges such as learning to optimize Apache Spark jobs, managing complex data pipelines, and understanding cloud-based workflows. To overcome these, it's important to dedicate time to hands-on practice with Databricks notebooks, collaborate closely with more experienced engineers, and actively participate in code reviews and team discussions. Leveraging Databricks' extensive documentation and community forums can also help troubleshoot issues and stay updated on best practices.
What are the most commonly searched types of Databricks Data Engineer jobs in Virginia? The most popular types of Databricks Data Engineer jobs in Virginia are:
What are popular job titles related to Entry Level Databricks Data Engineer jobs in Virginia? For Entry Level Databricks Data Engineer jobs in Virginia, the most frequently searched job titles are:
What job categories do people searching Entry Level Databricks Data Engineer jobs in Virginia look for? The top searched job categories for Entry Level Databricks Data Engineer jobs in Virginia are:
What cities in Virginia are hiring for Entry Level Databricks Data Engineer jobs? Cities in Virginia with the most Entry Level Databricks Data Engineer job openings:
Databricks Data Engineer

Databricks Data Engineer

Berkley

Manassas, VA • On-site

$114K - $137K/yr

Other

Medical, Dental, Vision, Life, Retirement, PTO

Posted 15 days ago


Job description

Company Details
We started in early 2019 as a small group of technologists with a passion for making insurance better. Today we are working with a team of industry experts who run five different insurance brands and collectively control $1 billion in annual premiums.
We believe in an idea and execution meritocracy. In other words, a place where the best ideas win and the people who deliver the most value get the most opportunities.
As we grow our team, we are looking for inquisitive, entrepreneurial people who are excited to reimagine the insurance industry.
Insurance is too complex. Help us make it better.
Responsibilities
This position requires on-site work Monday-Thursday at either our Manassas, VA or Chesterfield, MO location.
The Databricks Data Engineer will help design, build, deploy, and maintain scalable and production grade data pipelines in modern cloud environments, enabling analytics, AI, ML, and decision advantage at scale. This role will work with cutting-edge tools like Databricks, Delta Lake, PySpark, and AI/BI genie to transform raw data into actionable insights. As a hands-on Databricks Data Engineer with deep expertise in Azure Databricks and MLOps, this role will have the opportunity to migrate and translate legacy SSIS ETL logic into scalable, cloud-native data pipelines in Databricks. This role will partner with data engineers, data scientists, and product manager to design features, train/evaluate models, and deploy them to production using MLflow, Databricks and Workflows-with rigorous observability, governance (Unity Catalog), and CI/CD automation.
Data Pipeline Engineering
  • Design, build, and maintain high-performance, scalable ETL/ELT pipelines using Azure Databricks, Delta Lake, and PySpark.
  • Convert and modernize existing SSIS package logic into cloud-native Databricks pipelines using PySpark notebooks, Delta Live Tables (DLT), and Databricks Workflows.
  • Implement reliable batch and streaming pipelines with robust data quality and validation frameworks.
  • Optimize pipeline performance using Photon, efficient file formats, partitioning, Z-ordering, and caching strategies.

Lakehouse Platform Development
  • Develop and manage datasets within Delta Lake, ensuring ACID reliability, schema evolution, versioning, and time travel.
  • Architect feature-rich data layers including:
    • Bronze (raw ingestion)
    • Silver (validated, conformed)
    • Gold (analytics-ready and ML-ready)
  • Implement data governance using Unity Catalog for fine-grained access control, lineage, auditability, and metadata management.

MLOps & ML-Enabled Data Pipelines
  • Partner with data scientists and data engineers to create feature pipelines, model training pipelines, and production scoring pipelines.
  • Deploy and operationalize models using MLflow, Databricks Model Registry, and Databricks Workflows.
  • Use Databricks built-in AI SQL functions such as ai_query, ai_forecast, ai_analyze_sentiment to generate actionable insight from large amount of unstructured or structured raw data
  • Implement monitoring for:
    • Pipeline failures
    • Data/feature drift
    • Model performance degradation
    • Operational SLAs/SLIs/SLOs
  • Build automated CI/CD workflows using GitHub Actions or Azure DevOps for notebook deployment, pipeline testing, and environment promotion.

Data Platform, Data Security & Data Governance
  • Collaborate with data engineers to design reliable data products on Delta Lake; leverage Delta Live Tables (DLT) for declarative pipelines when applicable.
  • Enforce Unity Catalog for lineage, permissions, and audit; manage secrets, tokens, and keys securely (e.g., Databricks secrets, Key Vault/Secrets Manager).

Collaboration & Leadership
  • Work closely with cross-functional teams: data engineering, data scientist, product manager, and business stakeholders.
  • Serve as a Databricks SME-championing best practices, code standards, governance, and reusable frameworks.
  • Document architecture, workflows, data models, runbooks, and operational procedures.

Qualifications
  • Minimum of 3 years of experience in Databricks, PySpark notebooks, Python, DevOps, software development, and data engineering.
  • Certified Databricks Data Engineer Associate or Professional is a plus.

Skills & Competencies
  • Proficient in designing, building, deploying, and maintaining high-performance, scalable ETL/ELT pipelines using Azure Databricks, Delta Lake, and PySpark Notebook.
  • Proficient in building, deploying, and operating production ML models such as supervised, unsupervised, and anomaly detection, including techniques for imbalanced datasets
  • Proficient with ML engineering and MLOps, including model versioning, CI/CD for ML, monitoring, drift detection, and automated retraining
  • Proficiency in Python including Pandas and PySpark Dataframes
  • Expert level of SQL skills including Stored Procedure, experience with SSIS, SSRS, Power BI is a plus.
  • Proficient with cloud data engineering platforms, such as Azure, Databricks, Spark, or SQL, and batch and streaming pipelines
  • Familiar with Databricks AI Built-In Functions such as AI_Query, AI_Gen, AI_Classify, AI_Forecast, AI_Analyze_Sentiment, able to use them to extract actionable insights from large amount of unstructured or structured raw data
  • Experience with Python and ML frameworks, such as PyTorch or TensorFlow
  • Experience improving data quality, lineage, and observability in enterprise data environments and operationalizing rules and model-driven scoring for prioritization, routing, or case selection
  • Experience with predictive analytics, machine learning and artificial intelligence desired.

Education
  • A Bachelor's degree in Computer Science, Management Information Systems, Engineering, Math, Physics, or a related quantitative field is required (4-year degree). Master's degree preferred
  • Experience in the commercial insurance industry is a plus.

Additional Company Details
The Company is an equal employment opportunity employer.
We do not accept any unsolicited resumes from external recruiting firms.
The company offers a competitive compensation plan and robust benefits package for full time regular employees.
Base salary & Benefits include Health, dental, vision, life, disability, wellness, paid time off, 401(k) and profit-sharing plans.
The actual salary for this position will be determined by a number of factors, including the scope, complexity and location of the role; the skills, education, training, credentials and experience of the candidate; and other conditions of employment.
Additional Requirements
• Ability to travel locally and nationally up to 5% of the time
Sponsorship Details
Sponsorship not Offered for this Role