Job Title: Senior PySpark DeveloperLocation: Iselin, NJ (onsite)Duration: 12 monthsJob Type: W2 contract.
We are seeking an experienced Senior PySpark Developer to join our Data Engineering team in Iselin, NJ. In this role, you will design, develop, and optimize large-scale data processing pipelines using PySpark, Databricks, and AWS/Azure cloud services. You will collaborate with cross-functional teams to deliver high-performance data solutions that drive business intelligence, analytics, and reporting for our financial services platforms.
Key ResponsibilitiesDesign & Development:
Develop and optimize PySpark applications for batch and real-time data processing.
Build scalable ETL/ELT pipelines using Databricks, Spark SQL, and Delta Lake.
Integrate data from multiple sources (databases, APIs, streaming platforms) into cloud-based data lakes/warehouses.
Implement data transformations, aggregations, and joins efficiently in distributed environments.
Performance Tuning & Optimization:
Optimize Spark jobs for performance (partitioning, caching, broadcast joins).
Troubleshoot and resolve data skew, memory issues, and job failures.
Monitor and fine-tune Databricks clusters for cost efficiency.
Cloud & Big Data Technologies:
Work with AWS (EMR, Glue, S3) or Azure (Synapse, Data Lake, ADF) for data storage and processing.
Implement data governance, security, and compliance best practices.
Collaboration & Leadership:
Partner with data scientists, analysts, and business teams to deliver actionable insights.
Mentor junior developers and enforce coding standards, testing, and CI/CD practices.
Required Skills & Experience5+ years of hands-on PySpark development experience.
Strong expertise in Apache Spark (SQL, DataFrames, RDDs) and Databricks.
Proficiency in Python, SQL, and shell scripting.
Experience with cloud platforms (AWS/Azure) and big data tools (Hive, Kafka, Snowflake).
Knowledge of data modeling, partitioning, and performance optimization.
Bachelor’s or Master’s degree in Computer Science, Engineering, or related field.
Preferred QualificationsFinancial services or banking industry experience.
Certifications in AWS/Azure, Databricks, or Spark.
Familiarity with CI/CD (Jenkins, GitLab) and infrastructure-as-code (Terraform).