1

Hadoop Python Jobs in Georgia (NOW HIRING)

Big Data Engineer

Alpharetta, GA · On-site

$54.50 - $72/hr

Work hands-on with Hadoop, Apache Spark, and other big data tools to support data processing and analysis. Create and maintain UNIX shell scripts and Python scripts for automation and data workflows.

The ideal candidate will have strong expertise in data science, including proficiency in Python ... Utilize cloud technologies (AWS, Azure, Hadoop) for scalable data solutions. * Create interactive ...

Sr. Data Engineer

Alpharetta, GA · On-site

$111.80K - $134.20K/yr

Python, Databricks * Handson experience in building and optimizing data processing applications ... Comprehensive understanding of Hadoop HDFS and cloud Big Data technologies with handson experience ...

Senior DevOps Engineer

Alpharetta, GA · On-site

$120K - $140K/yr

Build and manage big data clusters, including Azure Databricks and On-Premise Hadoop, for advanced ... Scripting and Automation: Proficiency in Python, Bash, or PowerShell for automation tasks.

Senior DevOps Engineer

Alpharetta, GA · On-site

$126.90K - $163K/yr

Build and manage big data clusters, including Azure Databricks and On-Premise Hadoop, for advanced ... Scripting and Automation: Proficiency in Python, Bash, or PowerShell for automation tasks.

Data Engineer

Atlanta, GA · On-site

$110.10K - $132.20K/yr

Hadoop, Map/Reduce, Spark, HBase, HDInsight, Data Bricks, Hive) and with programming languages like UNIX shell scripting, Python etc. Has used SQL, PL/SQL or T-SQL with RDBMSs like Teradata, MS SQL ...

Data Engineer

Atlanta, GA · On-site

$110.10K - $132.20K/yr

Hadoop, Map/Reduce, Spark, HBase, HDInsight, Data Bricks, Hive) and with programming languages like UNIX shell scripting, Python etc. • Has used SQL, PL/SQL or T-SQL with RDBMSs like Teradata, MS ...

Python or R strongly preferred Hands on experience with BI tools; Tableau, power BI etc. Excellent ... Hadoop, MapReduce, Spark or other Big Data Platforms Practical knowledge in NLP and text mining ...

next page

Showing results 1-20

Hadoop Python information

What are the key skills and qualifications needed to thrive as a Hadoop Python Developer, and why are they important?

To thrive as a Hadoop Python Developer, you need a strong understanding of distributed computing, Hadoop ecosystem components (like HDFS, MapReduce, Hive, or Pig), and advanced Python programming skills, often supported by a degree in computer science or related field. Familiarity with tools such as Apache Spark, Sqoop, and workflow schedulers (like Oozie or Airflow), along with experience in handling big data platforms, is typically required. Problem-solving abilities, attention to detail, and effective communication help developers collaborate with teams and translate business requirements into scalable data solutions. These skills and qualifications are essential for efficiently processing and analyzing large datasets, ensuring data reliability, and driving business insights.

How do Hadoop Python developers typically collaborate with data engineers and analysts on large-scale data projects?

Hadoop Python developers frequently work alongside data engineers and analysts to design, implement, and optimize data pipelines for handling vast datasets. They are responsible for writing Python scripts that interface with Hadoop components, ensuring data is processed efficiently and meets project requirements. Regular communication with data engineers helps align on infrastructure and architectural decisions, while close collaboration with analysts ensures data outputs are accurate and actionable. Agile methodologies and daily stand-ups are common, fostering teamwork and quick problem-solving.

What is a Hadoop Python developer?

A Hadoop Python developer is a software professional who specializes in using Python programming language to develop, implement, and maintain applications that process and analyze large datasets within the Hadoop ecosystem. They leverage Python libraries like PySpark to write scalable data processing scripts, interact with Hadoop components such as HDFS, and optimize big data workflows. These developers play a critical role in building data pipelines, performing data transformation, and supporting analytics projects in organizations that handle vast amounts of data.

What is the difference between Hadoop Python vs Hadoop Java Developer?

AspectHadoop PythonHadoop Java Developer
Required CredentialsPython programming skills, Hadoop certificationsJava programming skills, Hadoop certifications
Work EnvironmentData analysis, scripting, data pipeline developmentCore development, system integration, big data application coding
Industry UsageData science, analytics, machine learning projectsData infrastructure, platform development, system optimization

Hadoop Python and Hadoop Java Developer roles both involve working with Hadoop ecosystems, but Python focuses more on data analysis and scripting, while Java is geared towards core development and system integration. The choice depends on your programming expertise and career goals within big data environments.

What are popular job titles related to Hadoop Python jobs in Georgia? For Hadoop Python jobs in Georgia, the most frequently searched job titles are:
What job categories do people searching Hadoop Python jobs in Georgia look for? The top searched job categories for Hadoop Python jobs in Georgia are:
What cities in Georgia are hiring for Hadoop Python jobs? Cities in Georgia with the most Hadoop Python job openings:
Big Data Engineer

Big Data Engineer

TriOptus LLC

Alpharetta, GA • On-site

$54.50 - $72/hr

Contractor

Posted 29 days ago


Job description

Job Title: Big Data Engineer (Hybrid)
Location: Alpharetta, Georgia (Hybrid Role)
Employment Type: Contract-to-Hire (CTH)
MOR2JP00017198
Job Description:
We are seeking a highly skilled and experienced Data Engineer to join our team in Alpharetta, Georgia. This hybrid role offers an exciting opportunity for candidates with hands-on experience in Hadoop, Apache Spark, and other key technologies. As a Data Engineer, you will play a critical role in the migration process from Hadoop to Snowflake, contributing to the development and optimization of our data pipelines.
Responsibilities:
Lead and contribute to the migration of data pipelines and workloads from Hadoop to Snowflake (approximately 50% of the role).
Develop and maintain ETL processes and data transformations in a hybrid cloud environment.
Work hands-on with Hadoop, Apache Spark, and other big data tools to support data processing and analysis.
Create and maintain UNIX shell scripts and Python scripts for automation and data workflows.
Utilize SQL for querying, manipulating, and managing large datasets.
Participate in building and managing CI/CD pipelines for efficient data deployment and integration.
Collaborate with cross-functional teams to optimize data architecture and ensure smooth data flow within the organization.
Ensure high-quality production environments and handle production support as needed.
Requirements:
Education: Bachelor's degree in Computer Science, Information Technology, or a related field.
Experience: At least 7 years of professional experience as a Data Engineer or similar role.
Technical Expertise:
Extensive hands-on experience with Hadoop.
Strong proficiency in UNIX Shell scripting.
Solid experience with Python scripting for automation and data processing.
Proficiency in Apache Spark for distributed data processing.
Experience with CI/CD pipelines for continuous integration and delivery.
Strong knowledge of SQL for querying and managing large datasets.
Demonstrated production environment experience.
Desired Skills:
Familiarity with IDMC or any other ETL tools.
Prior finance industry experience is a plus.
Experience with Snowflake is highly preferred.
Interview Process:
First Round: 1-hour Zoom interview.
Second Round: Final interview