1

Hadoop Python Jobs in New York (NOW HIRING)

... and python scripting Thank you, Nanda Kishore /Phone: 646-300-7063 / Qualifications Hadoop and Cloudera Additional Information All your information will be kept confidential according to EEO ...

... and python scripting Thank you, Nanda Kishore |Phone: 646-300-7063 | Qualifications Hadoop and Cloudera Additional Information All your information will be kept confidential according to EEO ...

Hadoop AWS Spark PySpark Java Python Scala Job Desciption: Key Skills: * Big Data Technologies : Hadoop, Spark, HDFS, Hive, Cloudera, Hortonworks * Cloud Platforms: AWS (Glue, Lambda, Redshift, S3 ...

Hadoop Architect/Developer 12+ months Manhasset, NY 07/06/2015 I need a strong Hadoop Architect ... Skills with shell scripting, Python, Java, and or C/C++ programming are also required.

Hadoop ecosystem (HDFS, Hive, Spark),PySpark,Python,Apache Kafka Secondary: UI - Angular. Experience: Minimum 9 years Roles & Responsibilities Architectural Leadership: • Define end-to-end ...

Description Minimum 1 year of building and coding applications using Hadoop components - HDFS, Hbase, Hive, Sqoop, Flume etc Minimum 1 year of coding Java MapReduce, Python, Pig programming, Hadoop ...

Development experience in one of the following - Java, Python OR Scala * Capable of working with ... The Hadoop Developer will be responsible to Architect, design and develop code that consistently ...

SQL Developer

New York, NY · Remote

$46.25 - $63.50/hr

Proficiency in SQL, PL/SQL, T-SQL, Hadoop, Python, and Java. Performance Tuning: Experience in SQL optimization using SQL Profiler and Query Execution Plans. Processes: In-depth understanding of ETL ...

Python Developer

Jersey City, NJ · On-site

$52.50 - $72.25/hr

Python (Key), Big data, Hadoop (Preferred), Java Skills: Sound technologist, taking ownership, good problem solving and analytical skills. These skills are a must have Skills APPS NICHE SKILLS ...

Healthcare Data Engineer

New York, NY · Remote

$117K - $140K/yr

Technical Stack: Proficiency in SQL, PL/SQL, T-SQL, Hadoop, Python, and Java. Optimization: Hands-on experience in SQL optimization and tuning using SQL Profiler and Query Execution Plans.

Python Developer

Jersey City, NJ

$52.50 - $72.25/hr

Python (Key), Big data, Hadoop (Preferred), Java Skills: Sound technologist, taking ownership, good problem solving and analytical skills. These skills are a must have Skills APPS NICHE SKILLS ...

next page

Showing results 1-20

Hadoop Python information

Is Hadoop a good career?

Hadoop Python roles involve working with big data processing using Hadoop frameworks and Python programming. These jobs are in demand in data-driven industries, often requiring knowledge of distributed systems, data analysis, and related tools like Spark or Hive. Careers in this field can offer growth opportunities with relevant certifications and experience in data engineering or analytics.

Does Hadoop work with Python?

Hadoop can work with Python through tools like Hadoop Streaming, which allows developers to write MapReduce jobs in Python. Additionally, frameworks such as PySpark enable Python integration with Apache Spark, often used alongside Hadoop for big data processing. Knowledge of these tools is valuable for Hadoop Python roles.

What are the key skills and qualifications needed to thrive as a Hadoop Python Developer, and why are they important?

To thrive as a Hadoop Python Developer, you need a strong understanding of distributed computing, Hadoop ecosystem components (like HDFS, MapReduce, Hive, or Pig), and advanced Python programming skills, often supported by a degree in computer science or related field. Familiarity with tools such as Apache Spark, Sqoop, and workflow schedulers (like Oozie or Airflow), along with experience in handling big data platforms, is typically required. Problem-solving abilities, attention to detail, and effective communication help developers collaborate with teams and translate business requirements into scalable data solutions. These skills and qualifications are essential for efficiently processing and analyzing large datasets, ensuring data reliability, and driving business insights.

What is the highest paying job in Python?

The highest paying Python-related jobs include roles such as Machine Learning Engineer, Data Scientist, and Quantitative Analyst, often requiring advanced skills in algorithms, statistics, and frameworks like TensorFlow or scikit-learn. These positions typically offer salaries exceeding $120,000 annually, especially with experience and relevant certifications.

What is the difference between Hadoop Python vs Hadoop Java Developer?

AspectHadoop PythonHadoop Java Developer
Required CredentialsPython programming skills, Hadoop certificationsJava programming skills, Hadoop certifications
Work EnvironmentData analysis, scripting, data pipeline developmentCore development, system integration, big data application coding
Industry UsageData science, analytics, machine learning projectsData infrastructure, platform development, system optimization

Hadoop Python and Hadoop Java Developer roles both involve working with Hadoop ecosystems, but Python focuses more on data analysis and scripting, while Java is geared towards core development and system integration. The choice depends on your programming expertise and career goals within big data environments.

What is a Hadoop Python developer?

A Hadoop Python developer is a software professional who specializes in using Python programming language to develop, implement, and maintain applications that process and analyze large datasets within the Hadoop ecosystem. They leverage Python libraries like PySpark to write scalable data processing scripts, interact with Hadoop components such as HDFS, and optimize big data workflows. These developers play a critical role in building data pipelines, performing data transformation, and supporting analytics projects in organizations that handle vast amounts of data.

What is the salary of Hadoop engineer?

The salary of a Hadoop engineer typically ranges from $80,000 to $150,000 annually, depending on experience, location, and certifications. Skilled professionals with expertise in big data tools and programming languages like Python can command higher salaries in this field.

How do Hadoop Python developers typically collaborate with data engineers and analysts on large-scale data projects?

Hadoop Python developers frequently work alongside data engineers and analysts to design, implement, and optimize data pipelines for handling vast datasets. They are responsible for writing Python scripts that interface with Hadoop components, ensuring data is processed efficiently and meets project requirements. Regular communication with data engineers helps align on infrastructure and architectural decisions, while close collaboration with analysts ensures data outputs are accurate and actionable. Agile methodologies and daily stand-ups are common, fostering teamwork and quick problem-solving.
What are popular job titles related to Hadoop Python jobs in New York? For Hadoop Python jobs in New York, the most frequently searched job titles are:
What job categories do people searching Hadoop Python jobs in New York look for? The top searched job categories for Hadoop Python jobs in New York are:
What cities in New York are hiring for Hadoop Python jobs? Cities in New York with the most Hadoop Python job openings:
Cloud Data Platform Engineer

Cloud Data Platform Engineer

Global Channel Management

Manhattan, NY • On-site

$126K - $151K/yr

Other

Posted 19 days ago


Job description

Cloud Data Platform Engineer

New York, New York, United States

$ 87.00 - 88.00 (US Dollar)

Cloud Data Platform Engineer needs 5+ years implementing data applications or data platforms with BigData/Hadoop, Python/Java/Spark full stack, etc.

5+ year support experience of Big Cloud Data Platform Engineer requires:

  • 3 days a week, hybrid
  • Locations: Iselin, NJ, Charlotte, NC, N Y, NY
  • Data Engineering 60-70%.
  • Infrastructure 30-40%. (Moving Data in & Out of cloud)
  • Snowflake exp is preferred
  • Masters or Bachelor's degree in computer science or related discipline
  • Manage and ensure delivery of project tasks as required
  • Excellent documentation skills to create and manage design, implementation and automation related documentation
  • Implementation experience for Hadoop distribution platforms like Cloudera or AWS EMR.
  • Extensive experience in designing, engineering and managing data lake ingestion, validation, transformation and consumption services leveraging cloud data tools like Hive, Spark, EMR, Glue ETL and Catalog, Snowflake etc.
  • Experience on implementing solutions with Worflow orchestration/scheduling tools such as Airflow, Autosys etc.
  • Experience building CI/CD pipelines for infrastructure implementations with Teraform, Gitlab etc.
  • Must be ability to multi-task and prioritize tasks for self
  • Must experience working in agile environment with multiple priorities

Preferred Skills:

  • 5+ years implementing data applications or data platforms with BigData/Hadoop, Python/Java/Spark full stack, etc.
  • 5+ year support experience of Big Data technologies in Hadoop ecosystem Hive, HDFS, MapReduce, Spark, Yarn, Kafka, Pig, HBase, Sqoop, Elastic Search, Kerberos.
  • 5+ years experience with ETL/ELT tools such as AWS Glue ETL, Talend, Datastage etc.
  • 5+ years experience with Orchestration tools such as Airflow, Autosys etc.
  • Experience with Jira and Agile methodology is a plus;
  • Experience with understanding DevOps or CI/CD process w.r.t to infrastructure management activities

Cloud Data Platform Engineer duties:

  • Design, develop and deliver cloud datastore solutions and develop automation pipelines to migrate data sets from On-prem to Cloud platforms. Practice Infrastructure as code to develop automation routines and integration flows to manage state of the datastore platform systems
  • Provision secures from start datastores and enable them with required security controls including encryption, masking, certificate/keys rotation etc.
  • Collaborates with developers, analysts, various system administrators to identify business requirements in designing efficient datastore solutions and interfaces.
  • Identifies and documents all system constraints, implications, and consequences of various proposed system changes.
  • Reviews technical documentation to guide system users and to assist with the ongoing operation, maintenance, and development of the system. Evaluates the efficiency and effectiveness of application operations and troubleshooting problems.
  • Provide expert level IT technical lead services, including the direction, evaluation, selection, configuration, implementation, and integration of new and existing technologies and tools in a cloud platform.

Global Channel Management logo

About Global Channel Management

Sourced by ZipRecruiter

Global Channel Management is a technology company that specializes in various types of recruiting and staff augmentation. Global Channel Management understands the challenges companies face when it comes to the skills and experience needed to fill the void of the day to day function. Organizations need to reduce training and labor costs but at the same time requiring the best talent for the job. GCM's Ownership and Management teams have extensive Staffing, Recruiting, HR and Executive Leadership knowledge, Experience and Expertise. Our Understanding and Commitment to our Client's Satisfaction are key reasons GCM has been successful in establishing long term relationships.

Industry

Recruiting and staffing services

Company size

11 - 50 Employees

Headquarters location

Austell, GA, US

Year founded

2009

Social media