2

Remote Databricks Developer Jobs in Raleigh, NC (NOW HIRING)

Sr. Data Engineer-Databricks SME (Remote)

Raleigh, NC · On-site +1

$111K - $133K/yr

... into Databricks. * This remote contract-to-hire position will be originated in Raleigh, NC ... Integration of Git in continuous deployment and experience with DevOps monitoring tools is a plus.

These solutions are powered by engineering for business advantage, transforming mission-critical ... remote client service delivery. Recruiting for this role ends on 06/30/2026. Work you'll do As a ...

We unify internal and external data on modern cloud platforms-including Snowflake and Databricks ... You will also guide engineers and reinforce strong delivery practices, while advancing the team ...

We unify internal and external data on modern cloud platforms-including Snowflake and Databricks ... You will also guide engineers and reinforce strong delivery practices, while advancing the team ...

Data Engineer

Raleigh, NC · On-site +1

$111K - $133K/yr

... DevOps, and Dynamics 365 data integration initiatives. The Exponential Technology Group (XTG) is a ... This opportunity is remote with the ideal candidate being located in DFW, Phoenix, Atlanta or ...

Solutions Engineer, DoiT Cloud Intelligence

Raleigh, NC · Remote

$54.25 - $72.50/hr

... US. (Fully remote) Who We Are DoiT is a global technology company empowering cloud-driven ... Databricks, BigQuery, as well as FinOps experience. As a trusted and award-winning strategic ...

next page

Showing results 1-20

Remote Databricks Developer information

What are the key skills and qualifications needed to thrive as a Remote Databricks Developer, and why are they important?

To thrive as a Remote Databricks Developer, you need strong expertise in data engineering, programming languages like Python or Scala, and experience with big data frameworks, typically supported by a degree in computer science or a related field. Proficiency with Databricks platform, Apache Spark, cloud services (such as AWS or Azure), and relevant certifications like Databricks Certified Associate Developer are commonly required. Strong problem-solving, communication, and self-motivation are crucial soft skills for remote collaboration and project delivery. These skills and qualities ensure efficient development, scalable data solutions, and effective teamwork in distributed environments.

What is the difference between Remote Databricks Developer vs Data Engineer?

AspectRemote Databricks DeveloperData Engineer
Required SkillsProficiency in Databricks, Spark, Python, SQLProficiency in data pipelines, ETL, cloud platforms, SQL
Work EnvironmentCollaborates on data projects using Databricks platformBuilds and maintains data infrastructure across cloud environments
CertificationsDatabricks certifications often preferredCloud certifications (AWS, Azure), data engineering certifications

While both roles involve working with data and cloud platforms, a Remote Databricks Developer specializes in developing solutions within the Databricks environment, focusing on Spark and data analytics. A Data Engineer has a broader scope, designing and maintaining data pipelines and infrastructure across various platforms. The roles overlap in skills like SQL and cloud knowledge, but their primary focus and tools differ.

How does a Remote Databricks Developer typically collaborate with cross-functional teams while working from different locations?

Remote Databricks Developers often work closely with data engineers, data scientists, and business analysts through virtual collaboration tools like Slack, Jira, and Zoom. Since team members may be distributed across various time zones, clear communication, regular stand-up meetings, and thorough documentation are essential for ensuring alignment on project goals and deadlines. Developers are also expected to participate in code reviews and shared knowledge sessions to maintain coding standards and support a collaborative environment. This structure helps ensure that complex data solutions are delivered efficiently and meet business requirements.

What is a Remote Databricks Developer?

A Remote Databricks Developer is a software professional who specializes in building, managing, and optimizing data pipelines and analytics workflows on the Databricks platform, while working from a remote location. They use Databricks, which is based on Apache Spark, to process large datasets, develop ETL processes, implement machine learning models, and collaborate with data teams. Their responsibilities often include writing code in languages like Python, Scala, or SQL, integrating with cloud services, and ensuring data quality and security. Working remotely, they communicate with teams online and use cloud-based tools to complete their tasks efficiently.
What are the most commonly searched types of Databricks Developer jobs in Raleigh, NC? The most popular types of Databricks Developer jobs in Raleigh, NC are:
What are popular job titles related to Remote Databricks Developer jobs in Raleigh, NC? For Remote Databricks Developer jobs in Raleigh, NC, the most frequently searched job titles are:
What job categories do people searching Remote Databricks Developer jobs in Raleigh, NC look for? The top searched job categories for Remote Databricks Developer jobs in Raleigh, NC are:
Sr. Data Engineer-Databricks SME (Remote)

Sr. Data Engineer-Databricks SME (Remote)

A.C. Coy

Raleigh, NC • Remote

$111K - $133K/yr

Contractor

Posted 12 days ago


Job description

  • Tier One Technologies is seeking a Data Engineer to support our US Government client with data ingestion, data deduplication and data tagging for migration of a large-scale data environment into Databricks.
  • This remote contract-to-hire position will be originated in Raleigh, NC.
  • SELECTED CANDIDATES WITHOUT REQUIRED CLEARANCE WILL BE SUBJECT TO A FEDERAL GOVERNMENT BACKGROUND INVESTIGATION TO RECEIVE IT.

  • Design, develop, and maintain scalable data ingestion pipelines to onboard structured, semi-structured, and unstructured data from batch and streaming sources (e.g., APIs, databases, flat files, message queues) into the Azure/Databricks environment.
  • Implement de-duplication strategies across large-scale datasets using deterministic and probabilistic matching techniques to ensure data integrity and reduce redundancy within the Data Lake.
  • Develop and enforce data tagging frameworks to classify, label, and annotate datasets with appropriate metadata (e.g., sensitivity, source, domain, lineage) to support data governance, discoverability, and compliance requirements.
  • Assist with Operationalizing deployments and support of Cloud services for ETL Operations. This will include standardizing and automating processes and workflows, creating documentation/knowledge articles, and overall assisting Operations staff who have limited experience in Cloud.
  • Written and oral presentations to high-level CIO management on status of current efforts.
  • Possesses skills and experience related to business management, systems engineering, operations research, and management engineering. Typically has specialization in a particular technology or business application. Keeps abreast of technological developments and industry trends.
  • Assist with deployment, configuration, and management of Azure Cloud environment.
  • Assist with migration efforts of existing ETL jobs into Azure/Databricks cloud environment.
  • Ability to share optimization and efficiencies with the larger team and management.
  • Ability to automate solutions to repetitive problems/tasks.

  • A degree from an accredited College/University in the applicable field of services is required. If the degree is not in the applicable field, then four additional years of related experience is required.
  • 13+ years of overall IT experience.
  • 5+ years demonstrated experience designing and implementing data ingestion pipelines using tools such as Azure Data Factory, Apache Kafka, Apache NiFi, Spark Structured Streaming, or equivalent technologies.
  • 5+ years of experience applying de-duplication techniques at scale, including record linkage, fuzzy matching, and entity resolution across structured and unstructured datasets.
  • 5+ years of hands-on experience with data tagging and metadata management, including the use of tagging schemas, data catalogs (e.g., Azure Purview, Apache Atlas), and automated classification tools to support data governance and lineage tracking.
  • 5+ years of demonstrated experience working with unstructured data.
  • 2+ years of experience in using Databricks or other Spark-based platforms.
  • Fluency in at least one scripting language (Python, Perl, Ruby, or equivalent).
  • Experience with one or more of the following products and technologies: SAS, C++, Hadoop, SQL Database/Coding, Teradata, Oracle, Amazon S3, Apache Spark, Machine Learning, Natural Language Processing, and visualization tools such as Tableau, Strategy and QLIK is a plus.
  • Integration of Git in continuous deployment and experience with DevOps monitoring tools is a plus.
  • Familiarity with Cloud Operations support in Azure is a plus.
  • Excellent communication skills.
  • Must be able to obtain a Position of Public Trust Clearance.
  • Must be a US Citizen or have US Permanent Residence status (Green Card).
  • Must have resided in the US for the last 5 years and not have traveled outside the US for a combined total of 6 months or more in last 5 years.

A.C. Coy Company- Staffing & Consulting Services logo

About A.C. Coy Company- Staffing & Consulting Services

Sourced by ZipRecruiter

Since 1986, our mission is to be the “Always on Target” staffing and consulting provider by helping people and companies achieve their hiring goals. The team is dedicated to achieving these goals by attracting and retaining top talent that meets our clients’ culture and environment. We are fully engaged with our people and companies through ongoing communication to ensure expectations are met.

Industry

Recruiting and staffing services

Company size

51 - 200 Employees

Headquarters location

Canonsburg, PA, US

Social media