What Impact You'll Have
GRVTY's team provides tactical data engineering solutions. We embed skilled Dataย Engineers, Data Scientists, and ETL Developers directly into intelligence analyst groupsย to be their go-to data wranglers. We develop new tools, code, and services to executeย data engineering activities. Our engineers work to collect, process, and feed analyticย tools, turning data into intelligence in response to immediate mission needs, with directย impact on real world situations. You will see your work used here on a daily basis, andย you'll have the opportunity to support a variety of Sponsor mission organizations andย mission partner organizations.
This is a time of development and growth on the program, with an increasing number of missions being supported. The work is high impact and important, and the customer moves quickly. The environment is fast-paced, flexible, and open to innovation - you'll have more latitude here in choosing how to achieve results than on many other projects. The customer cares more about what you can do as opposed to your years of
experience, and work hours are typically quite flexible - roll up your sleeves, get thingsย done, and no one cares much about the specific hours that you work. The work spaceย itself is also quite nice, and there is an excellent cafeteria!ย The tech stack on this team is rather huge and includes Python (Pandas, numpy, scipy,ย scikit-learn, standard libraries, etc.), Python packages that wrap Machine Learningย (packages for NLP, Object Detection, etc.), Linux, AWS/C2S, Apache NiFi, Spark,ย pySpark, Hadoop, Kafka, ElasticSearch, Solr, Kibana, neo4J, MariaDB, Postgres,ย Docker, Puppet, and many others.ย Work on this program takes place in McLean, VA and in various field offices throughoutย Northern VA (we cannot support remote work) and requires a TS/SCI + Polygraphย clearance (acceptable to this customer).
What You'll Be Owning
GRVTY is seeking a Data & Software Engineer with a TS/SCI + Poly clearance (applicable to this customer) to join one of our top projects in McLean, VA.The Data & Software Engineer works with a small team to build complex data flows for a custom application. Successful candidate will have advanced Python programming skills, familiarity with Java, an understanding of data security, privacy, governance and compliance principles and a demonstrated history of building production data pipelines and ETL workflows at scale.
Candidate must have experience:
- Building end-to-end data pipelines leveraging Python
- Using orchestration tools to deploy data pipelines, including configuring and updating Spark Jobs
- Containerizing and deploying applications in cloud environments like AWS.
- Working with MySQL and PostgreSQL including performance tuning, schema design, and query optimization for complex, analytical workloads.ย
- Leveraging industry standard tools for code control (Git, IaaC control, etc.)
- Working with data catalogs, tracking data lineage ย and handling a variety of data formats, including Geospatial.
- Using Bash scripting for automation and data processing tasks
- Integrating Al/ML services and models
Responsibilities:
- Work with stakeholders to understand data requirements, assess feasibility, and design appropriate solutions with minimal oversight
- Leverage strong problem-solving and debugging skills for data quality issues, pipeline failures, and performance bottlenecks
- Leverage a background in large-scale data migration or platform modernization efforts
- Contribute to data engineering documentation, best practices, and design patterns.
What You Must Have
- Active TS/SCI with Polygraph Clearance
- Minimum of 5 years' experience with:ย
- Apacheย Spark & PySpark
- Advancedย Python skills (including Pandas & NumPy)
- Docker, Podman
- AWSย S3, Lambda & Step functions
- Apacheย Iceberg, Airflow, etc.
- SQLย (with Trino)
- NoSQL, DynamoDB
- Unityย Catalog OSS, Apache Polaris
- Apacheย Superset
- Terraformย or CloudFormation
- OpenLineage
- H3, PostGIS
#LI-BPJ