What Impact You'll Have
GRVTY's team provides tactical data engineering solutions. We embed skilled Dataย Engineers, Data Scientists, and ETL Developers directly into intelligence analyst groupsย to be their go-to data wranglers. We develop new tools, code, and services to executeย data engineering activities. Our engineers work to collect, process, and feed analyticย tools, turning data into intelligence in response to immediate mission needs, with directย impact on real world situations. You will see your work used here on a daily basis, andย you'll have the opportunity to support a variety of Sponsor mission organizations andย mission partner organizations.
This is a time of development and growth on the program, with an increasing number of missions being supported. The work is high impact and important, and the customer moves quickly. The environment is fast-paced, flexible, and open to innovation - you'll have more latitude here in choosing how to achieve results than on many other projects. The customer cares more about what you can do as opposed to your years of
experience, and work hours are typically quite flexible - roll up your sleeves, get thingsย done, and no one cares much about the specific hours that you work. The work spaceย itself is also quite nice, and there is an excellent cafeteria!ย The tech stack on this team is rather huge and includes Python (Pandas, numpy, scipy,ย scikit-learn, standard libraries, etc.), Python packages that wrap Machine Learningย (packages for NLP, Object Detection, etc.), Linux, AWS/C2S, Apache NiFi, Spark,ย pySpark, Hadoop, Kafka, ElasticSearch, Solr, Kibana, neo4J, MariaDB, Postgres,ย Docker, Puppet, and many others.ย Work on this program takes place in McLean, VA and in various field offices throughoutย Northern VA (we cannot support remote work) and requires a TS/SCI + Polygraphย clearance (acceptable to this customer).
What You'll Be Owning
GRVTY is seeking a Data Engineer with a TS/SCI + Poly clearance (applicable to this customer) to join one of our top projects in Chantilly, VA.ย ย
Responsibilities
- Develop new tools, code & services to execute data engineering activities. Activities include the following tasks:ย
- Movement of structure & unstructured data using approved methods. Execute data ingestion activities for storing data in a local or enterprise level location. View data in its source format.
- Develop code to format data that supports exploration. Analyze source data formats & work with Data Scientists & partners to determine the formats & transforms that best meet mission objectives.
- Develop code and tools to provide one-time & on-going data formatting & transformations into enterprise or boutique data models. Implement existing ETL code & best practices/standards.
- Develop an ETL Code Transition Plan. Develop & deliver documentation for each project including ETL mappings, code use guide, code location & access instructions.
- Facilitate Code Reviews. Provide consulting services to support data transport, ingestion, conditioning, access, & management.ย
What You Must Have
- Active TS/SCI with Polygraph Clearance
- Ability and willingness to quickly learn a new toolย
- Strong communication skills with both your teammates, and your leadsย
- SQL, Python, Pyspark experience.ย
- Willingness to do development type work when needed (junior level development at best)
- Ability to be a self starter and ask questions when neededย
- Comfortable working with and manipulating data in compliance with the offices workflow
- Extract, Transform and Load (ETL) tools and processes.ย ย
- AWSย
- APIsย
- Linuxย
- Geospatial tools/dataย
What Would Be Nice to Have
- Palantir Foundry Experience
- Kubeflowย
- Experience with OCR and text extraction of PDFs.ย
- Experience with data validation / data quality after ETLing to be sure it's ready for end users
- Docker
- Jenkins
- Hadoop/Spark
- Kibana
- Kafka
- NiFi
- ElasticSearchย ย