1

Hadoop Python Jobs in Missouri (NOW HIRING)

Sr Developer

O Fallon, MO · On-site

$51 - $67.25/hr

... Hadoop, Spark, SPARK-SQL, Hive, • NIFI, KAFKA - knowledge advantage • Knowledge in either scala or python • Should have good working experience/knowledge in SQL • Should have good ...

Big Data Engineer

Creve Coeur, MO · On-site

$52.25 - $69/hr

... as HBASE, Hadoop/Spark, AWS EMR, Cassandra * Extensive ETL experience * General Purpose Programming languages (Java, C, Scala, Python, Erlang, etc.) * Database Technology - (Postgres, Mongo ...

Big Data Engineer

Creve Coeur, MO

$52.25 - $69/hr

... as HBASE, Hadoop/Spark, AWS EMR, Cassandra * Extensive ETL experience * General Purpose Programming languages (Java, C, Scala, Python, Erlang, etc.) * Database Technology - (Postgres, Mongo ...

$88.40K - $106.10K/yr

Strong programming skills in Java and Scala, with additional knowledge of Python considered a plus. * Solid experience with big data technologies such as Hadoop, Spark, Ab Initio, and Informatica.

SQL and Python * GenAI tools and AI-assisted development (prompt engineering, rapid prototyping ... Familiarity with big data concepts (Spark, Hadoop) and visualization tools (Tableau, Power BI ...

SQL and Python * GenAI tools and AI-assisted development (prompt engineering, rapid prototyping ... Familiarity with big data concepts (Spark, Hadoop) and visualization tools (Tableau, Power BI ...

Senior Software Engineer (Big Data)

O Fallon, MO · On-site +1

$52.25 - $69/hr

This is a Software Engineer backend engineering role, focused on Scala/Python/Java, distributed ... Kafka Hadoop ecosystem Hive/Impala Exposure to model monitoring, or AI platform enablement ...

SQL and Python * GenAI tools and AI-assisted development (prompt engineering, rapid prototyping ... Familiarity with big data concepts (Spark, Hadoop) and visualization tools (Tableau, Power BI ...

next page

Showing results 1-20

Hadoop Python information

What are the key skills and qualifications needed to thrive as a Hadoop Python Developer, and why are they important?

To thrive as a Hadoop Python Developer, you need a strong understanding of distributed computing, Hadoop ecosystem components (like HDFS, MapReduce, Hive, or Pig), and advanced Python programming skills, often supported by a degree in computer science or related field. Familiarity with tools such as Apache Spark, Sqoop, and workflow schedulers (like Oozie or Airflow), along with experience in handling big data platforms, is typically required. Problem-solving abilities, attention to detail, and effective communication help developers collaborate with teams and translate business requirements into scalable data solutions. These skills and qualifications are essential for efficiently processing and analyzing large datasets, ensuring data reliability, and driving business insights.

How do Hadoop Python developers typically collaborate with data engineers and analysts on large-scale data projects?

Hadoop Python developers frequently work alongside data engineers and analysts to design, implement, and optimize data pipelines for handling vast datasets. They are responsible for writing Python scripts that interface with Hadoop components, ensuring data is processed efficiently and meets project requirements. Regular communication with data engineers helps align on infrastructure and architectural decisions, while close collaboration with analysts ensures data outputs are accurate and actionable. Agile methodologies and daily stand-ups are common, fostering teamwork and quick problem-solving.

What is a Hadoop Python developer?

A Hadoop Python developer is a software professional who specializes in using Python programming language to develop, implement, and maintain applications that process and analyze large datasets within the Hadoop ecosystem. They leverage Python libraries like PySpark to write scalable data processing scripts, interact with Hadoop components such as HDFS, and optimize big data workflows. These developers play a critical role in building data pipelines, performing data transformation, and supporting analytics projects in organizations that handle vast amounts of data.

What is the difference between Hadoop Python vs Hadoop Java Developer?

AspectHadoop PythonHadoop Java Developer
Required CredentialsPython programming skills, Hadoop certificationsJava programming skills, Hadoop certifications
Work EnvironmentData analysis, scripting, data pipeline developmentCore development, system integration, big data application coding
Industry UsageData science, analytics, machine learning projectsData infrastructure, platform development, system optimization

Hadoop Python and Hadoop Java Developer roles both involve working with Hadoop ecosystems, but Python focuses more on data analysis and scripting, while Java is geared towards core development and system integration. The choice depends on your programming expertise and career goals within big data environments.

What are popular job titles related to Hadoop Python jobs in Missouri? For Hadoop Python jobs in Missouri, the most frequently searched job titles are:
What job categories do people searching Hadoop Python jobs in Missouri look for? The top searched job categories for Hadoop Python jobs in Missouri are:
What cities in Missouri are hiring for Hadoop Python jobs? Cities in Missouri with the most Hadoop Python job openings:
Infographic showing various Hadoop Python job openings in Missouri as of May 2026, with employment types broken down into 1% Internship, 86% Full Time, 12% Part Time, and 1% Contract. Highlights an 77% Physical, and 23% Remote job distribution.

Senior Hadoop Developer

The Timberline Group

Saint Louis, MO • On-site

Other

Posted 16 days ago


Job description

Senior Hadoop Developer to develop, create, and modify general computer applications software or specialized utility programs.
Job responsibilities and duties include:
  • Implement data analytics processing algorithms on Big Data batch and stream processing frameworks (e.g. Hadoop MapReduce, Python, Spark, Scala, Kafka etc.).
  • Perform data acquisition, preparation, and perform analysis leveraging a variety of data programming techniques in Spark using Scala.
  • Work on complex issues where analysis of situations and data requires an in-depth evaluation of variable factors.
  • Load data from different datasets and decide on which file format is efficient for a task. Hadoop Developers source large volumes of data from diverse data platforms into Hadoop platform.
  • Install, configure, and maintain enterprise Hadoop environment.
  • Build distributed, reliable, and scalable data pipelines to ingest and process data in real-time. Hadoop Developers deals with fetching impression streams, transaction behaviors, clickstream data, and other unstructured data.
  • Define Hadoop Job Flows and manage Hadoop jobs using Scheduler.
  • Review and manage Hadoop log files.
  • Design and implement column family schemas of Hive and HBase within HDFS
  • Assign schemas and create Hive tables with suitable formats and compression techniques.
  • Mentor Big Data Developers on best practices and strategic development.
  • Develop efficient Pig and Hive scripts with joins on datasets using various techniques.
  • Apply different HDFS formats and structure like Parquet, Avro, etc. to speed up analytics.
  • Fine tune Hadoop applications for high performance and throughput.
  • Troubleshoot and debug any Hadoop ecosystem run time issues.
  • Develop and document technical design specifications.
  • Design and develop data integration solutions (batch and real-time) to support enterprise data platforms including Hadoop, RDBMS, and NoSQL.
  • Lead technical meetings, as required, and convey ideas clearly and tailor communication based on selected audience (technical and non-technical).
  • Implement Spark Streaming architecture and integration with JMS queue with custom receivers.
  • Develop and deploy API services in Java Spring.
  • Create Hive and HBase data source connection to Spring.
  • Implement multi-threading in Java/Scala.

This position has no direct reports and does not supervise any other personnel.
Minimum requirements:
Bachelor's degree in Computer Science, Applied Computer Science, Engineering, or any related field of study, plus at least two (2) years of experience in the job offered or in any related position(s).
Qualified applicants must also have demonstrable proficiency, skill, experience, and knowledge with the following:
1.Hadoop/Big Data Ecosystem and Architecture
2.Hive, Spark, HBase, Sqoop, Impala, Kafka, Flume, Oozie, and MapReduce
3.Programming experience in Java, Scala, Python, and Shell Scripting
4.SQL and Data modelling
Work from home benefit offered.