Job Summary:
Oak Ridge National Laboratory (ORNL) is a U.S. Department of Energy national laboratory focused on solving complex scientific challenges. The Geospatial Data Engineer will support research and operational workflows in geospatial data science, applied machine learning, and engineering practices to deliver geospatial products for national security and humanitarian efforts.
Responsibilities:
• Develop, maintain, and operationalize geospatial data science pipelines across ingestion, feature engineering, training, inference, evaluation, and delivery, using reproducible MLOps practices (version control, testing, experiment tracking, containerization, and CI/CD).
• Support implementation of agentic AI workflows to discover, gather, and prepare data from open-source repositories (e.g., catalogs, APIs, and bulk downloads), including provenance tracking, metadata extraction, and licensing/usage notes.
• Build scalable geospatial data preparation and validation routines for raster and vector data (projection harmonization, spatial joins, tiling/chunking, and QA/QC).
• Develop geospatial validation frameworks for model outputs (e.g., comparisons to reference datasets, spatial cross-validation, summary dashboards, and automated report generation).
• Support documentation, metadata development, and version tracking for data products and model releases; contribute to technical summaries, figures, and reports/publications as appropriate.
• Participate in code reviews, model reviews, and data readiness reviews to ensure analytical defensibility, transparency, and fitness-for-use in operational and decision-support contexts.
• Collaborate with research staff to integrate new data sources, indicators, and modeling approaches into existing workflows; communicate clearly across technical and domain teams.
Qualifications:
Required:
• Bachelor’s degree and 3+ year’s experience in Geography, GIScience, Computer Science, Data Science, Statistics, Engineering, or a related field with a strong quantitative and software development emphasis.
• Demonstrated experience with geospatial analysis using Python in a production or research to production environment leveraging common geospatial libraries (e.g., geopandas, rasterio, shapely, pyproj) and/or enterprise GIS tooling (e.g., PostGIS).
• Strong software engineering fundamentals: Git-based workflows, testing, code review, and writing maintainable, well-documented code.
• Experience preparing and validating raster and vector datasets (data cleaning, transformation, projection/CRS management, and quality control).
• Working knowledge of machine learning and statistical modeling concepts (e.g., regression, classification, clustering, model evaluation).
• Ability to work effectively in a team-based, production-oriented research environment and communicate technical results to diverse stakeholders.
Preferred:
• Master’s degree in a relevant discipline or equivalent applied experience in geospatial data science, MLOps, or applied machine learning.
• Experience with modern MLOps tooling and practices (e.g., MLflow or equivalent experiment tracking, model registries, containerization, reproducible environments).
• Experience building data pipelines and workflow orchestration (e.g., Airflow, Prefect, Dagster, Make/Snakemake) and working in Linux/HPC environments.
• Experience with large, multi-resolution geospatial datasets and performance-oriented processing (tiling, chunking, parallelization; Dask/Spark a plus).
• Experience using or building agentic/LLM-enabled workflows for data discovery, extraction, and normalization, with attention to provenance, reproducibility, and quality.
• Familiarity with uncertainty, data limitations, and bias in population and demographic modeling and in applied geospatial decision-support contexts.
• Active or eligible U.S. security clearance or ability to obtain one.
Company:
Oak Ridge National Laboratory holds a range of R&D assignments, from fundamental nuclear physics to applied R&D on advanced energy systems. Founded in 1943, the company is headquartered in Oak Ridge, USA, with a team of 5001-10000 employees. The company is currently Late Stage.