1

Permanent Data Labeling Jobs (NOW HIRING)

Sr. Data Engineer-Databricks SME (Remote)

Raleigh, NC · On-site +1

$111K - $133K/yr

Develop and enforce data tagging frameworks to classify, label, and annotate datasets with ... Must be a US Citizen or have US Permanent Residence status (Green Card). * Must have resided in the ...

... and data labeling * Design robust software with comprehensive test suites and monitoring ... making a permanent impact on the future of the internet, and on humanity. Our Perks * Health ...

Sr. Software Engineer, Growth

Manhattan, NY · On-site +1

$135K - $178K/yr

... and data labeling * Design robust software with comprehensive test suites and monitoring ... making a permanent impact on the future of the internet, and on humanity. Our Perks * Health ...

Apprentice - Vehicle Operator (Data Collection)

Troy, MI · On-site

$20.50 - $25.75/hr

Verify successful data transfer, labeling accuracy, and metadata completeness. * Coordinate with ... Candidates must be legally authorized to work in the United States on a permanent basis. * Must be ...

Be Seen First

... to-Permanent, and Permanent Jobs throughout Illinois, Indiana, and Tennessee. We proudly offer ... Identify labeling errors, misclassifications, data omissions, or inconsistencies Qualifications:

next page

Showing results 1-20

Permanent Data Labeling information

What cities are hiring for Permanent Data Labeling jobs? Cities with the most Permanent Data Labeling job openings:
What are the most commonly searched types of Data Labeling jobs? The most popular types of Data Labeling jobs are:
What states have the most Permanent Data Labeling jobs? States with the most job openings for Permanent Data Labeling jobs include:
Sr. Data Engineer-Databricks SME (Remote)

Sr. Data Engineer-Databricks SME (Remote)

A.C. Coy

Raleigh, NC • Remote

$111K - $133K/yr

Contractor

Posted yesterday


Job description

  • Tier One Technologies is seeking a Data Engineer to support our US Government client with data ingestion, data deduplication and data tagging for migration of a large-scale data environment into Databricks.
  • This remote contract-to-hire position will be originated in Raleigh, NC.
  • SELECTED CANDIDATES WITHOUT REQUIRED CLEARANCE WILL BE SUBJECT TO A FEDERAL GOVERNMENT BACKGROUND INVESTIGATION TO RECEIVE IT.

  • Design, develop, and maintain scalable data ingestion pipelines to onboard structured, semi-structured, and unstructured data from batch and streaming sources (e.g., APIs, databases, flat files, message queues) into the Azure/Databricks environment.
  • Implement de-duplication strategies across large-scale datasets using deterministic and probabilistic matching techniques to ensure data integrity and reduce redundancy within the Data Lake.
  • Develop and enforce data tagging frameworks to classify, label, and annotate datasets with appropriate metadata (e.g., sensitivity, source, domain, lineage) to support data governance, discoverability, and compliance requirements.
  • Assist with Operationalizing deployments and support of Cloud services for ETL Operations. This will include standardizing and automating processes and workflows, creating documentation/knowledge articles, and overall assisting Operations staff who have limited experience in Cloud.
  • Written and oral presentations to high-level CIO management on status of current efforts.
  • Possesses skills and experience related to business management, systems engineering, operations research, and management engineering. Typically has specialization in a particular technology or business application. Keeps abreast of technological developments and industry trends.
  • Assist with deployment, configuration, and management of Azure Cloud environment.
  • Assist with migration efforts of existing ETL jobs into Azure/Databricks cloud environment.
  • Ability to share optimization and efficiencies with the larger team and management.
  • Ability to automate solutions to repetitive problems/tasks.

  • A degree from an accredited College/University in the applicable field of services is required. If the degree is not in the applicable field, then four additional years of related experience is required.
  • 13+ years of overall IT experience.
  • 5+ years demonstrated experience designing and implementing data ingestion pipelines using tools such as Azure Data Factory, Apache Kafka, Apache NiFi, Spark Structured Streaming, or equivalent technologies.
  • 5+ years of experience applying de-duplication techniques at scale, including record linkage, fuzzy matching, and entity resolution across structured and unstructured datasets.
  • 5+ years of hands-on experience with data tagging and metadata management, including the use of tagging schemas, data catalogs (e.g., Azure Purview, Apache Atlas), and automated classification tools to support data governance and lineage tracking.
  • 5+ years of demonstrated experience working with unstructured data.
  • 2+ years of experience in using Databricks or other Spark-based platforms.
  • Fluency in at least one scripting language (Python, Perl, Ruby, or equivalent).
  • Experience with one or more of the following products and technologies: SAS, C++, Hadoop, SQL Database/Coding, Teradata, Oracle, Amazon S3, Apache Spark, Machine Learning, Natural Language Processing, and visualization tools such as Tableau, Strategy and QLIK is a plus.
  • Integration of Git in continuous deployment and experience with DevOps monitoring tools is a plus.
  • Familiarity with Cloud Operations support in Azure is a plus.
  • Excellent communication skills.
  • Must be able to obtain a Position of Public Trust Clearance.
  • Must be a US Citizen or have US Permanent Residence status (Green Card).
  • Must have resided in the US for the last 5 years and not have traveled outside the US for a combined total of 6 months or more in last 5 years.

A.C. Coy Company- Staffing & Consulting Services logo

About A.C. Coy Company- Staffing & Consulting Services

Sourced by ZipRecruiter

Since 1986, our mission is to be the “Always on Target” staffing and consulting provider by helping people and companies achieve their hiring goals. The team is dedicated to achieving these goals by attracting and retaining top talent that meets our clients’ culture and environment. We are fully engaged with our people and companies through ongoing communication to ensure expectations are met.

Industry

Recruiting and staffing services

Company size

51 - 200 Employees

Headquarters location

Canonsburg, PA, US

Social media