1

Data Engineer Ml Jobs in Reston, VA (NOW HIRING)

ML Engineer/Data Engineer Location: Mclean, VA (Hybrid 3 days onsite a week) Project overview This position is for ML decisioning team that own a platform does which does ML decision for mobile /web ...

Sr. Data Engineer (AI/ML)

Reston, VA · Remote

$100K - $160K/yr

Position: Sr Data Engineer (AI/ML) Location: Remote Security Clearance: DHS Suitability - contract requires U.S. Citizenship Must Have Qualifications: 5+ years of experience in Data/ML engineering ...

Collaborate with engineering and product development teams. Qualifications: Education/Experience: * 5+ years of experience as a Data Scientist, Data Engineer, ML Engineer, or Data Analyst and a ...

Collaborate with engineering and product development teams. Qualifications: Education/Experience: * 5+ years of experience as a Data Scientist, Data Engineer, ML Engineer, or Data Analyst and a ...

Databricks Data Engineer

Manassas Park, VA · On-site

$113K - $135.70K/yr

MLOps & ML-Enabled Data Pipelines * Partner with data scientists and data engineers to create feature pipelines, model training pipelines, and production scoring pipelines. * Deploy and ...

Data Engineer

Arlington, VA · On-site

$131.90K - $158.40K/yr

Data Engineer Elder Research Inc., a wholly owned subsidiary of MANTECH international Corporation ... Modernize and optimize data and ML workflows by implementing best practices for scalability ...

New

Data Engineer

Washington, DC · On-site

$90K - $110K/yr

Data Engineer Washington DC (Day1 onsite) The pay range for this role is $90k - $110k per annum ... Integrate Databricks workloads with downstream reporting, analytics, and AI/ML use cases

Databricks Data Engineer

Manassas, VA · On-site

$114.50K - $137.50K/yr

MLOps & ML-Enabled Data Pipelines * Partner with data scientists and data engineers to create feature pipelines, model training pipelines, and production scoring pipelines. * Deploy and ...

Databricks Data Engineer

Manassas, VA

$114.50K - $137.50K/yr

MLOps & ML-Enabled Data Pipelines * Partner with data scientists and data engineers to create feature pipelines, model training pipelines, and production scoring pipelines. * Deploy and ...

Databricks Data Engineer

Manassas, VA · On-site

$114.50K - $137.50K/yr

MLOps & ML-Enabled Data Pipelines * Partner with data scientists and data engineers to create feature pipelines, model training pipelines, and production scoring pipelines. * Deploy and ...

Databricks Data Engineer

Manassas, VA · On-site

$114.50K - $137.50K/yr

MLOps & ML-Enabled Data Pipelines * Partner with data scientists and data engineers to create feature pipelines, model training pipelines, and production scoring pipelines. * Deploy and ...

Databricks Data Engineer

Manassas, VA · On-site

$114.50K - $137.50K/yr

MLOps & ML-Enabled Data Pipelines * Partner with data scientists and data engineers to create feature pipelines, model training pipelines, and production scoring pipelines. * Deploy and ...

next page

Showing results 1-20

Data Engineer Ml information

See Reston, VA salary details

$47.9K

$171.7K

$253.3K

How much do data engineer ml jobs pay per year?

As of May 28, 2026, the average yearly pay for data engineer ml in Reston, VA is $171,677.00, according to ZipRecruiter salary data. Most workers in this role earn between $138,900.00 and $176,900.00 per year, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive as a Data Engineer ML, and why are they important?

To thrive as a Data Engineer ML, you need strong programming skills (especially in Python or Scala), knowledge of data modeling, and a solid foundation in database technologies, typically supported by a degree in computer science or a related field. Familiarity with big data frameworks (like Spark or Hadoop), cloud platforms (AWS, GCP, or Azure), and ETL tools, as well as relevant certifications, is highly beneficial. Excellent problem-solving abilities, teamwork, and clear communication help you collaborate with data scientists and stakeholders effectively. These skills are essential for building robust data pipelines and infrastructure that enable scalable, high-quality machine learning solutions.

How do Data Engineer ML roles typically collaborate with data scientists and machine learning engineers on projects?

Data Engineer ML professionals work closely with data scientists and machine learning engineers by building and maintaining robust data pipelines, ensuring clean and reliable datasets are readily available for modeling and analysis. They often participate in meetings to understand model requirements, help optimize data storage for performance, and support the deployment of machine learning models into production environments. Effective collaboration involves continuous communication to troubleshoot data issues, implement data validation, and scale solutions as project needs evolve. This teamwork ensures that data-driven projects move efficiently from experimentation to deployment.

What does a Data Engineer ML do?

A Data Engineer ML (Machine Learning) is responsible for designing, building, and maintaining the data pipelines and infrastructure necessary for machine learning applications. They clean, process, and organize large datasets to ensure data quality and accessibility for data scientists and ML engineers. In addition, they may work on deploying machine learning models to production environments and optimizing data workflows for efficiency and scalability.

What is the difference between Data Engineer Ml vs Data Scientist?

AspectData Engineer MlData Scientist
Required CredentialsBachelor's in CS, Data Engineering certificationsBachelor's/Master's in CS, Data Science certifications
Work EnvironmentBuilding data pipelines, managing databasesAnalyzing data, creating models
Employer & Industry UsageTech companies, finance, healthcareResearch institutions, tech firms, finance

Data Engineer Ml focuses on developing and maintaining data infrastructure and pipelines, while Data Scientists analyze data and build predictive models. Both roles often collaborate but serve different functions within data teams.

What cities near Reston, VA are hiring for Data Engineer Ml jobs? Cities near Reston, VA with the most Data Engineer Ml job openings:

Software Engineer-Data Engineering, Machine Learning (ML)

AAMVA (American Association of Motor Vehicle Administrators)

Arlington, VA • On-site

$131.90K - $158.40K/yr

Other

This job post has expired today. Applications are no longer accepted.


Job description

Machine Learning Data Engineer

The IT Division is responsible for the development and operations of information systems for the State and Federal agencies doing business related to or using information from the administration of motor vehicles and driver licenses.

The Machine Learning (ML) Data Engineer position has core responsibilities for the design, development, deployment, and operational support of machine learning solutions on cloud infrastructure. This includes the full model lifecycle — from data acquisition and dataset preparation through feature engineering, experimentation, model training, validation, production deployment, and ongoing monitoring. Current applications include anomaly detection across high-volume messaging networks, but the scope encompasses any ML capability that strengthens system reliability, operational intelligence, and data-driven decision-making across AAMVA systems.

Essential Duties and Responsibilities:

We are seeking a talented Data Engineer with machine learning experience to join our team. You will design, build, and operationalize ML solutions running on cloud infrastructure (Azure or AWS). You will work across the full model lifecycle: preparing datasets, engineering features, running experiments, deploying models to production, and operating them on cloud infrastructure.

As a detail-oriented professional, you have a strong track record of independently managing projects and driving them to successful completion. Your statistical foundation and engineering discipline enable you to move from exploratory analysis through to production-grade, monitored solutions. You communicate clearly with both technical and non-technical stakeholders — translating model behavior, data constraints, and engineering trade-offs into terms that drive decisions. You operate effectively across the broader IT organization, with sufficient general IT fluency to understand how ML systems interact with infrastructure, security, operations, and business workflows, and you proactively build those connections rather than working in a data silo.

Key responsibilities include:

  • Designing and building dataset preparation pipelines — acquiring, cleaning, transforming, and versioning data for ML training and evaluation
  • Engineering features that extract meaningful signals from structured and semi-structured data sources (time-series patterns, statistical profiles, categorical encodings)
  • Running structured experimentation — testing multiple algorithms against defined scenarios, measuring performance, and documenting findings
  • Training, evaluating, and tuning ML models including regression, classification, clustering, anomaly detection, and ensemble methods
  • Deploying models to production on cloud infrastructure and building the pipelines that keep them running (retraining, scoring, threshold management)
  • Monitoring model performance in production — tracking drift, false positive rates, and detection efficacy over time
  • Building and maintaining batch and streaming data pipelines using Synapse, Fabric, Spark, and Event Hubs that feed ML systems
  • Writing and optimizing analytical queries (SQL, KQL, PySpark) for data exploration, statistical profiling, and real-time analysis
  • Creating validation frameworks — synthetic test data generation, backtesting against historical logs, and shadow-mode evaluation
  • Building dashboards and visualizations that communicate model outputs to technical and non-technical stakeholders
  • Collaborating with cross-functional teams to identify ML opportunities and translate operational problems into data solutions; communicating findings, trade-offs, and model behavior clearly to technical and non-technical audiences across IT, operations, and leadership

Direct Reports: None

Qualifications:

Formal Education:

Bachelor's degree in computer science, data science, statistics, mathematics, or related quantitative field. Equivalent work experience may be substituted

Knowledge, Skills, and Abilities:

  • 3–5 years of hands-on experience in data engineering, ML engineering, or applied analytics
  • Hands-on cloud platform experience (Azure or AWS) building and deploying data or ML solutions on managed cloud services; specific platform less important than depth of experience
  • Working knowledge of statistical foundations: distributions, variance, standard deviation, trend vs. seasonality, hypothesis testing, and how to apply them to real operational data
  • Experience with the ML experiment-to-production cycle: dataset preparation, feature engineering, model training, evaluation, and deployment
  • Proficiency in Python for data processing, statistical analysis, and ML model development
  • Strong SQL skills with understanding of relational database fundamentals: data modeling, query optimization, indexing strategies, and how SQL Server infrastructure supports production workloads (T-SQL, stored procedures, Availability Groups)
  • Experience building data pipelines that handle batch and streaming workloads
  • Experience with version control systems (Git) and CI/CD practices
  • Strong problem-solving skills, attention to detail, and ability to work independently on ambiguous problems
  • Strong written and verbal communication skills — able to explain technical findings to non-technical stakeholders and engage productively across IT, operations, and leadership; comfort operating outside the ML silo and contributing to broader technology discussions

Preferred Qualifications:

  • Experience with time-series analysis, anomaly detection, or statistical process control on operational data
  • Familiarity with unsupervised and semi-supervised techniques (isolation forest, clustering, ensemble methods)
  • Experience building and managing ML model lifecycle on Azure (MLflow, Fabric ML, Azure ML) or AWS (SageMaker, Glue, Step Functions)
  • Familiarity with KQL (Kusto Query Language) for time-series decomposition, log analytics, or real-time data exploration
  • Knowledge of data modeling and dimensional modeling concepts
  • Experience with synthetic test data generation and model validation frameworks
  • Familiarity with operations and monitoring of mission-critical data platforms

Technical Stack:

  • Core Technologies: Microsoft Fabric, Azure Synapse Analytics, Apache Spark, Delta Lake, Azure Event Hubs
  • ML & Analytics: scikit-learn, PySpark ML, statistical modeling, time-series analysis, feature engineering, model validation
  • Languages: Python, SQL, PySpark, KQL, C#
  • Data Infrastructure: T-SQL, Stored Procedures, SQL Server Availability Groups
  • Azure Services: Azure Functions, Azure Data Factory, Azure Key Vault
  • Optional: Databricks, Snowflake, Lakehouse Architecture, Azure OpenAI; AWS candidates: equivalent services (SageMaker, Glue, Kinesis, Redshift) are acceptable in place of Azure-specific stack items
  • Visualization: Power BI
  • Development: Azure DevOps, CI/CD

Disclaimer Statement: The preceding job description has been written to reflect management's assignment of essential functions. It does not prescribe or restrict the tasks that may be assigned.

AAMVA is an Equal Opportunity Employer/Veterans/Disabled