1

Data Cleaning Jobs in Decatur, GA (NOW HIRING)

Perform data cleaning, transformation, and basic harmonization tasks. * Support merging of datasets and validation of outputs. * Assist in tracking and documenting changes in data structure ...

Perform data cleaning, transformation, and basic harmonization tasks. * Support merging of datasets and validation of outputs. * Assist in tracking and documenting changes in data structure ...

Work with the full data analysis pipeline including data cleaning, process documentation, and communicating results via report writing and visualization. * Participate actively in HR Business ...

Strong experience accessing, extracting, and cleaning primary data from large institutional databases. * Queries & Report Writing (3+ Years): Proficient in writing complex queries and translating ...

Performing ongoing data cleaning and database maintenance. * Supporting demo room setup and general facility logistics. * Following up with internal teams to track updates and ensuring timely ...

Experienced Data Analyst with expertise in SQL, Oracle databases, Tableau, and statistical data ... Skilled in extracting, cleaning, interpreting, and visualizing complex datasets to support ...

GA-802964 Hybrid/Local Data Analyst (15+) with BO, Tableau, Oracle/SQL Server, Data Management/cleaning, Criminal Justice experience Location: Atlanta, GA (DCS) Duration: 12 Months ON-SITE INTERVIEW ...

DATA ARCHITECT

Atlanta, GA

$60.50 - $78/hr

Clean, normalize, and standardize datasets to ensure accuracy and consistency. * Implement data validation and monitoring processes to maintain data integrity. * Troubleshoot and resolve data ...

... clean" data by reviewing computer reports, printouts, and performance indicators to locate and correct code problems Work with management to prioritize business and information needs Locate and ...

DATA ARCHITECT

Atlanta, GA

$60.50 - $78/hr

Clean, normalize, and standardize datasets to ensure accuracy and consistency. * Implement data validation and monitoring processes to maintain data integrity. * Troubleshoot and resolve data ...

... clean" data by reviewing computer reports, printouts, and performance indicators to locate and correct code problems • Work with management to prioritize business and information needs • Locate ...

next page

Showing results 1-20

Data Cleaning information

What is a Data Cleaning job?

A Data Cleaning job involves identifying and correcting errors, inconsistencies, and inaccuracies in datasets to ensure high-quality data for analysis. This process includes removing duplicate records, filling in missing values, standardizing formats, and eliminating irrelevant or erroneous data. Data cleaning helps improve data accuracy, reliability, and usability for business intelligence, machine learning, and decision-making. Professionals in this role typically work with databases, spreadsheets, and data management tools to refine raw data into a structured and meaningful format.

What are the key skills and qualifications needed to thrive in the Data Cleaning position, and why are they important?

To thrive in Data Cleaning, you need a strong attention to detail, analytical skills, and a solid understanding of data management practices, often supported by training or coursework in data science, statistics, or information technology. Familiarity with tools like Microsoft Excel, SQL, Python (with libraries such as pandas), or specialized data cleaning software is highly valuable. Excellent problem-solving abilities, persistence, and effective communication are important soft skills for identifying and addressing data inconsistencies while collaborating with other team members. These skills are essential to ensure that datasets are accurate, reliable, and ready for analysis, leading to trustworthy business insights.

What are the most common challenges faced by professionals in data cleaning roles?

One of the biggest challenges in data cleaning is dealing with incomplete, inconsistent, or duplicate data from multiple sources, which often requires creative problem-solving and close attention to detail. Communicating with team members to clarify data definitions and intended use is also a frequent part of the job, as misinterpretations can lead to errors. Additionally, deadlines and large datasets can make the role fast-paced, so strong organizational skills and efficiency are important. However, overcoming these challenges offers valuable experience and plays a crucial role in ensuring the success of projects that depend on high-quality data.
What are popular job titles related to Data Cleaning jobs in Decatur, GA? For Data Cleaning jobs in Decatur, GA, the most frequently searched job titles are:
What cities near Decatur, GA are hiring for Data Cleaning jobs? Cities near Decatur, GA with the most Data Cleaning job openings:
Infographic showing various Data Cleaning job openings in Decatur, GA as of May 2026, with employment types broken down into 79% Full Time, 18% Part Time, and 3% Contract. Highlights an 82% Physical, and 18% Remote job distribution.
Big Data Developer- Q123

Big Data Developer- Q123

R2 Technologies Corporation

Alpharetta, GA • On-site

$54.50 - $72/hr

Full-time

Medical, Dental, Vision, PTO

Posted 27 days ago


Job description

Overview:
R2 Technologies Corporation (R2) is a technology services provider headquartered in Alpharetta, GA, with expertise in a range of cutting-edge technologies. R2 specializes in Java, Dot Net, Big Data, Cloud Computing, artificial intelligence (AI), machine learning (ML), software development, project management, SAP, and enterprise resource planning (ERP) systems. Additionally, R2 offers highly skilled resources and productivity platforms that enable clients to rapidly deliver business value to their stakeholders.
R2's strength lies in providing platform-based solutions, architecting, and designing enterprise solutions, leveraging cloud technologies such as Amazon Web Services (AWS), Google Cloud Platform (GCP), and Microsoft Azure to deliver scalable and cost-effective solutions.
R2's expertise in AI and ML enables clients to leverage the power of data to make data-driven decisions and improve their overall performance. R2 also provides solutions for internet of things (IoT) and blockchain technologies, which can help clients improve their supply chain management and streamline their operations.
Since its inception, R2 has rapidly grown to become one of the most respected and trusted technology companies in the United States, providing product development and staffing services to a diverse range of clients, including small and midsize businesses, as well as Fortune 1000 companies.
Job Title: Big Data Engineer
Location: Alpharetta, GA.
Type: Full-time / Contract
Overview:
We are seeking a skilled and experienced Developer to join our team. The ideal candidate will have expertise in programming and experience in building scalable and reliable applications.
Responsibilities:
• Understand the enterprise architecture within the context of existing platforms services and strategic direction.
• Digest broader enterprise, horizontal view across all technical disciplines to evaluate interoperability and incorporate it in solution architecture.
• Understand end-to-end solutions with sound technical architecture, in Big Data analytics framework along with customized solutions that are scalable, with primary focus on performance, quality, maintainability, cost and testability.
• Deliver innovative solutions within the platform to establish common components, while allowing customization of solutions for different products.
• Demonstrated knowledge of software development technology, principles, methods, tools, and practices and industry standards and trends; and current web and database technologies.
Required Skills:
• 5+ years of experience with Big Data pipelines including: Spark and Scala;
• 5+ years of experience working with internal stakeholders including: Networking, API Market Place, and Infosec and external third parties to build new services;
• 3+ years of AWS experience in building Enterprise scale applications and services;
• 3+ years of experience with AWS services including: ECS Fargate, EKS, Docker containers, Lambdas, EMR, EC2, SNS/SQS, MKS, S3, and RDS;
• 3+ years of experience building scaled data platforms and enterprise products on AWS cloud;
• 3+ years of experience in building Enterprise Level AWS infrastructure using Terraform or Cloud Formation Templates;
• 2+ years of experience working with Orchestration services and hands-on experience with Airflow;
• 2+ years of scripting experience with Shell or Python;
• Experience utilizing DevOps and CI/CD concepts to create deployment Pipelines; and
• Programming skills for data processing, availability, scalability, clustering, microservices, multi-threaded development and performance patterns.
• 3+ years of experience in production support and data reprocessing at any stage in the data pipeline.
• Experience with Python or Java, and Spark
• Experience in Apache Kafka
• Experience in data cleaning, visualization and reporting using Redshift.
• Experience in AWS EMR or Hadoop, and MapReduce
• Experience in Data Mining
• Experience processing large amounts of structured and unstructured data, including integrating data from multiple sources.
• Experience in AWS EC2, AWS RDS, AWS EBS, AWS IAM, and AWS S3
• Experience using SQL language in any MySQL or Oracle databases.
• 2+ years of experience: big data using nosql like mongodb or Spark in developing distributed processing applications; building applications with immutable infrastructure in the AWS (Amazon Web Services) Cloud with automation technologies like Terraform or Ansible or CloudFormation.
Optional Skills:
• Experience in designing and implementing highly performant data ingestion pipelines from multiple sources using Apache Spark and/or Azure Databricks
• Show efficiency in handling data - tracking data lineage, ensuring data quality, and improving discoverability of data.
• Integrating end-to-end data pipeline to take data from source systems to target data repositories ensures the quality and consistency of data is always maintained.
• Knowledge of Engineering and Operational Excellence using standard methodologies.
• Comfortable using PySpark APIs to perform advanced data transformations.
• Familiarity with implementing classes with Python.
Qualifications:
• Bachelor's degree in computer science, Engineering, or related field.
• Relevant certification.
Attributes:
We are seeking a candidate who is passionate, intelligent, and a critical thinker. The ideal candidate should be a proactive communicator, documenting their work clearly and succinctly. They should be detail-oriented, thoughtful, and respectful, with a focus on teamwork. The candidate should possess strong problem-solving skills and have the ability to work independently and within a team. They should be able to adapt to changing requirements and maintain a positive attitude in a fast-paced environment.
What's In It for You?
We offer competitive benefits, pay, and bonus potential, including group health insurance, vision and dental insurance, and paid vacation.
Skills:
Big Data,Spark,Scala,pyspark