1

Databricks Data Engineer Jobs in New York (NOW HIRING)

Data Engineer

New York, NY ยท On-site

$125K - $150K/yr

Databricks Certified Data Engineer Associate * Microsoft Certified: Azure Fundamentals * Microsoft Certified: Azure Data Engineer Associate * Microsoft Exam: Designing and Implementing Microsoft ...

New

Databricks & Data Platform Development * Build and optimize data pipelines using Azure Databricks ... Partner with cloud, DevOps, and architecture teams for platform integration * Automation & AI ...

Azure Databricks Engineer

Iselin, NJ ยท On-site

$61 - $79.25/hr

Data Pipeline Development: * Build and maintain scalable ETL/ELT pipelines using Databricks ... Azure Data Engineer Associate or Databricks certified Data Engineer Associate certification ...

Data Engineer

Jericho, NY

$115K - $140K/yr

The Data Engineer will play a critical role in designing, building, and scaling enterprise data ... Databricks is also valued. The role partners closely with business, analytics, and engineering ...

Data Engineer

Jericho, NY ยท On-site

$115K - $140K/yr

The Data Engineer will play a critical role in designing, building, and scaling enterprise data ... Databricks is also valued. The role partners closely with business, analytics, and engineering ...

Data Engineer - Remote

Manhattan, NY ยท On-site +1

$126K - $151K/yr

Data Engineer Duration: 6-12 months Location: Remote Seeking a highly skilled and motivated Data ... SQL * Databricks, Collibra, and/or Alteryx. * Familiarity with cloud-based data platforms ...

Data Engineer

Jericho, NY ยท On-site

$115K - $140K/yr

The Data Engineer will play a critical role in designing, building, and scaling Kimco's enterprise ... Databricks is also valued. The role partners closely with business, analytics, and engineering ...

Data Engineer

Jericho, NY ยท On-site

$115K - $140K/yr

The Data Engineer will play a critical role in designing, building, and scaling Kimco's enterprise ... Databricks is also valued. The role partners closely with business, analytics, and engineering ...

Data Engineer

New York, NY ยท On-site

$125K - $150K/yr

Create and manage workflows in Databricks and migrate existing Azure Data Factory pipelines. * Load ... Work with analytics, engineering, and business teams to deliver clean, ready-to-use datasets.

... Architect, Databricks Data Engineer Associate] is a plus - Designing and implementing thorough data architecture strategies - Developing and documenting data models and data flow diagrams ...

Data Engineer

Jersey City, NJ

$119K - $143K/yr

Senior Data Engineer with PySpark and Databricks Client: SMBC Location: Hybrid 3X a week in Jersey City (Locals only, final round is in person) Note: Candidates MUST be very hands on with good ...

New

... Architect, Databricks Data Engineer Associate] is a plus - Designing and implementing thorough data architecture strategies - Developing and documenting data models and data flow diagrams ...

... Databricks Data Engineer Associate] is a plus - Designing and implementing thorough data architecture strategies - Developing and documenting data models, data flow diagrams, and data architecture ...

... Databricks Data Engineer Associate] is a plus - Designing and implementing thorough data architecture strategies - Developing and documenting data models, data flow diagrams, and data architecture ...

... Databricks Data Engineer Associate] is a plus - Designing and implementing thorough data architecture strategies - Developing and documenting data models, data flow diagrams, and data architecture ...

... Architect, Databricks Data Engineer Associate] is a plus - Designing and implementing thorough data architecture strategies - Developing and documenting data models and data flow diagrams ...

... Databricks Data Engineer Associate] is a plus - Designing and implementing thorough data architecture strategies - Developing and documenting data models, data flow diagrams, and data architecture ...

next page

Showing results 1-20

Databricks Data Engineer information

See New York salary details

$48.7K

$141.9K

$194.2K

How much do databricks data engineer jobs pay per year?

As of Jun 12, 2026, the average yearly pay for databricks data engineer in New York is $141,914.00, according to ZipRecruiter salary data. Most workers in this role earn between $125,300.00 and $150,400.00 per year, depending on experience, location, and employer.

What is a Databricks Data Engineer job?

A Databricks Data Engineer is responsible for designing, building, and maintaining scalable data pipelines on the Databricks platform. They work with Apache Spark, Delta Lake, and cloud services to process large datasets efficiently. Their role involves data ingestion, transformation, optimization, and ensuring data quality for analytics and machine learning. Additionally, they collaborate with data scientists, analysts, and business teams to deliver reliable data solutions.

What does a typical day look like for a Databricks Data Engineer?

A typical day for a Databricks Data Engineer involves developing and maintaining scalable data pipelines, optimizing big data workflows using Spark, and collaborating with data scientists, analysts, and other engineers. You will regularly work within cloud environments to manage and process large datasets, conduct troubleshooting, and ensure data reliability and performance. Daily tasks may also include writing code, participating in team meetings, and implementing best practices for data security and governance. This role is highly collaborative, requiring frequent communication to align on project goals and address any technical challenges. The dynamic, project-based structure helps expand your skills and offers growth opportunities into senior engineering or data architecture roles.

What are the key skills and qualifications needed to thrive in the Databricks Data Engineer position, and why are they important?

To thrive as a Databricks Data Engineer, you need strong expertise in data engineering concepts, big data processing, and programming languages such as Python, Scala, or SQL, often supported by a degree in computer science or a related field. Proficiency in Databricks, Apache Spark, cloud platforms (like AWS, Azure, or GCP), and relevant certifications such as Databricks Certified Data Engineer are highly valued. Effective problem-solving, collaboration, and clear communication skills help engineers work efficiently within cross-functional teams. These skills are essential for designing scalable data pipelines, ensuring data quality, and delivering actionable analytics in dynamic business environments.

What are the most commonly searched types of Databricks Data Engineer jobs in New York? The most popular types of Databricks Data Engineer jobs in New York are:
What are popular job titles related to Databricks Data Engineer jobs in New York? For Databricks Data Engineer jobs in New York, the most frequently searched job titles are:
What job categories do people searching Databricks Data Engineer jobs in New York look for? The top searched job categories for Databricks Data Engineer jobs in New York are:
What cities in New York are hiring for Databricks Data Engineer jobs? Cities in New York with the most Databricks Data Engineer job openings:
Data Engineer

Data Engineer

Fusemachines

New York, NY โ€ข On-site

$125K - $150K/yr

Other

Posted yesterday


Job description

ย About Fusemachines

Founded in 2013, Fusemachines is a global provider of enterprise AI products and services, on a mission to democratize AI. Leveraging proprietary AI Studio and AI Engines, the company helps drive the clientsโ€™ AI Enterprise Transformation, regardless of where they are in their Digital AI journeys. With offices in North America, Asia, and Latin America, Fusemachines provides a suite of enterprise AI offerings and specialty services that allow organizations of any size to implement and scale AI. Fusemachines serves companies in industries such as retail,ย  manufacturing, and government.

Fusemachines continues to actively pursue the mission of democratizing AI for the masses by providing high-quality AI education in underserved communities and helping organizations achieve their full potential with AI.

ย 

Important: Immigration Sponsorship Policy
This position is not elegible for employment visa sponsorship or transfer sponsorship now or in the future.

  • Direct Company Sponsorship: Such as H-1B, J-1, or TN visas.
  • Employer of Record: Listing Fusemachines as the immigration employer on any government documentation.
  • Written Documentation: Providing letters or other support for any work authorization (e.g., OPT, STEM OPT, CPT).
ย 

About the role

This is a remote full-time consulting position responsible for designing, building, and maintaining the infrastructure required for data integration, storage, processing, and analytics (BI, visualization and Advanced Analytics).

We are looking for a skilled Senior Data Engineer with a strong background in Python, SQL, PySpark, Azure, Databricks, Synapse, Azure Data Lake, DevOps and cloud-based large scale data applications with a passion for data quality, performance and cost optimization. The ideal candidate will develop in an Agile environment, contributing to the architecture, design, and implementation of Data products, including migration from Synapse to Azure Data Lake. This role involves hands-on coding, mentoring junior staff and collaboration with multi-disciplined teams to achieve project objectives.

Qualification & Experience

  • Must have a full-time Bachelor's degree in Computer Science or similar
  • At least 3 years of experience as a data engineer with strong expertise in Databricks, Azure, DevOps, or other hyperscalers.
  • 3+ years of experience with Azure DevOps, GitHub.
  • Proven experience delivering large scale projects and products for Data and Analytics, as a data engineer, including migrations.
  • Following certifications:
    • Databricks Certified Associate Developer for Apache Spark
    • Databricks Certified Data Engineer Associate
    • Microsoft Certified: Azure Fundamentals
    • Microsoft Certified: Azure Data Engineer Associate
    • Microsoft Exam: Designing and Implementing Microsoft DevOps Solutions (nice to have)

Required skills/Competencies

  • Strong programming Skills in one or more languages such as Python (must have), Scala, and proficiency in writing efficient and optimized code for data integration, migration, storage, processing and manipulation.
  • Strong understanding and experience with SQL and writing advanced SQL queries.
  • Thorough understanding of big data principles, techniques, and best practices.
  • Strong experience with scalable and distributed Data Processing Technologies such as Spark/PySpark (must have: experience with Azure Databricks), DBT and Kafka, to be able to handle large volumes of data.
  • Solid Databricks development experience with significant Python, PySpark, Spark SQL, Pandas, NumPy in Azure environment.
  • Strong experience in designing and implementing efficient ELT/ETL processes in Azure and Databricks and using open source solutions being able to develop custom integration solutions as needed.
  • Skilled in Data Integration from different sources such as APIs, databases, flat files, event streaming.
  • Expertise in data cleansing, transformation, and validation.
  • Proficiency with Relational Databases (Oracle, SQL Server, MySQL, Postgres, or similar) and NonSQL Databases (MongoDB or Table).
  • Good understanding of Data Modeling and Database Design Principles. Being able to design and implement efficient database schemas that meet the requirements of the data architecture to support data solutions.
  • Strong experience in designing and implementing Data Warehousing, data lake and data lake house, solutions in Azure and Databricks.
  • Good experience with Delta Lake, Unity Catalog, Delta Sharing, Delta Live Tables (DLT).
  • Strong understanding of the software development lifecycle (SDLC), especially Agile methodologies.
  • Strong knowledge of SDLC tools and technologies Azure DevOps and GitHub, including project management software (Jira, Azure Boards or similar), source code management (GitHub, Azure Repos or similar), CI/CD system (GitHub actions, Azure Pipelines, Jenkins or similar) and binary repository manager (Azure Artifacts or similar).
  • Strong understanding of DevOps principles, including continuous integration, continuous delivery (CI/CD), infrastructure as code (IaC โ€“ Terraform, ARM including hands-on experience), configuration management, automated testing, performance tuning and cost management and optimization.ย 
  • Strong knowledge in cloud computing specifically in Microsoft Azure services related to data and analytics, such as Azure Data Factory, Azure Databricks, Azure Synapse Analytics, Azure Data Lake, Azure Stream Analytics, SQL Server, Azure Blob Storage, Azure Data Lake Storage, Azure SQL Database, etc.
  • Experience in Orchestration using technologies like Databricks workflows and Apache Airflow.
  • Strong knowledge of data structures and algorithms and good software engineering practices.
  • Proven experience migrating from Azure Synapse to Azure Data Lake, or other technologies.
  • Strong analytical skills to identify and address technical issues, performance bottlenecks, and system failures.
  • Proficiency in debugging and troubleshooting issues in complex data and analytics environments and pipelines.
  • Good understanding of Data Quality and Governance, including implementation of data quality checks and monitoring processes to ensure that data is accurate, complete, and consistent.ย 
  • Experience with BI solutions including PowerBI is a plus.
  • Strong written and verbal communication skills to collaborate and articulate complex situations concisely with cross-functional teams, including business users, data architects, DevOps engineers, data analysts, data scientists, developers, and operations teams.
  • Ability to document processes, procedures, and deployment configurations.
  • Understanding of security practices, including network security groups, Azure Active Directory, encryption, and compliance standards.
  • Ability to implement security controls and best practices within data and analytics solutions, including proficient knowledge and working experience on various cloud security vulnerabilities and ways to mitigate them.ย 
  • Self-motivated with the ability to work well in a team, and experienced in mentoring and coaching different members of the team.
  • A willingness to stay updated with the latest services, Data Engineering trends, and best practices in the field.
  • Comfortable with picking up new technologies independently and working in a rapidly changing environment with ambiguous requirements.
  • Care about architecture, observability, testing, and building reliable infrastructure and data pipelines.

Responsibilities

  • Architect, design, develop, test and maintain high-performance, large-scale, complex data architectures, which support data integration (batch and real-time, ETL and ELT patterns from heterogeneous data systems: APIs and platforms), storage (data lakes, warehouses, data lake houses, etc), processing, orchestration and infrastructure. Ensuring the scalability, reliability, and performance of data systems, focusing on Databricks and Azure.
  • Contribute to detailed design, architectural discussions, and customer requirements sessions.
  • Actively participate in the design, development, and testing of big data products..
  • Construct and fine-tune Apache Spark jobs and clusters within the Databricks platform.
  • Migrate out of Azure Synapse to Azure Data Lake or other technologies.
  • Assess best practices and design schemas that match business needs for delivering a modern analytics solution (descriptive, diagnostic, predictive, prescriptive).
  • Design and implement data models and schemas that support efficient data processing and analytics.
  • Design and develop clear, maintainable code with automated testing using Pytest, unittest, integration tests, performance tests, regression tests, etc.
  • Collaborating with cross-functional teams and Product, Engineering, Data Scientists and Analysts to understand data requirements and develop data solutions, including reusable components meeting product deliverables.ย 
  • Evaluating and implementing new technologies and tools to improve data integration, data processing, storage and analysis.
  • Evaluate, design, implement and maintain data governance solutions: cataloging, lineage, data quality and data governance frameworks that are suitable for a modern analytics solution, considering industry-standard best practices and patterns.
  • Continuously monitor and fine-tune workloads and clusters to achieve optimal performance.
  • Provide guidance and mentorship to junior team members, sharing knowledge and best practices.
  • Maintain clear and comprehensive documentation of the solutions, configurations, and best practices implemented.
  • Promote and enforce best practices in data engineering, data governance, and data quality.
  • Ensure data quality and accuracy.
  • Design, Implement and maintain data security and privacy measures.
  • Be an active member of an Agile team, participating in all ceremonies and continuous improvement activities, being able to work independently as well as collaboratively.

Fusemachines is an Equal Opportunities Employer, committed to diversity and inclusion. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or any other characteristic protected by applicable federal, state, or local.

Powered by JazzHR

7csDcn3aMp