Data Engineer - Python/PySpark

Data Engineer - Python/PySpark

Diverse Lynx

Dallas, TX • On-site

$125K - $135K/yr

Other

This job posting has expired and is no longer accepting applications. Check out similar jobs


Job description

Data Engineer - Python/PySpark

Location: Dallas/Irving TX (3 Days onsite/week)

Fulltime

Salary Range: $125,000-$135,000/Year

Job Description:

  • Strong hands-on development experience in Python, PySpark, and SQL.
  • Experience building large-scale ETL/ELT pipelines for structured and unstructured data.
  • Deep understanding of Spark and distributed computing fundamentals (transformations, shuffles, optimization).
  • Experience with big data frameworks such as Hadoop and Spark.
  • Proficiency with Git-based repositories (Bitbucket / GitHub).
  • Experience working with AWS, Azure, or GCP environments.
  • Strong understanding of database design, data modeling, warehouse schemas (star/snowflake).
  • Experience with CI/CD automation and pipeline development.
  • Strong analytical and troubleshooting skills for resolving complex data issues.
  • Ability to collaborate with cross-functional teams and convert business requirements into technical solutions.
  • Design, develop, and maintain robust, scalable ETL/ELT pipelines.
  • Write efficient, reusable, and scalable code in Python and PySpark for distributed data processing.
  • Review existing data engineering code and identify opportunities for refactoring or performance improvement.
  • Implement data validation, cleansing, reconciliation, and quality checks across the data lifecycle.
  • Collaborate with IT and business stakeholders to understand data requirements and translate them into solutions.
  • Monitor pipeline performance, troubleshoot failures, and optimize for latency, throughput, and cost.
  • Participate in code reviews, enforce coding standards, and contribute to engineering best practices.
  • Build and maintain CI/CD pipelines for testing, packaging, and deployment of data pipelines.
  • Ensure data reliability, security, and consistency across environments.
  • Work with cloud services and big data platforms to support modern data architecture

Please advise your interest in exploring this opportunity. Sincerely Yours, Shashi Bhushan Diverse Lynx, LLC 300 Alexander Park Suite # 200 Princeton, NJ 08540 Tel: (732) 452-1006 Ext.344 shashi.bhushan@diverselynx.com www.diverselynx.com Diverse Lynx LLC is an Equal Employment Opportunity employer. All qualified applicants will receive due consideration for employment without any discrimination. All applicants will be evaluated solely on the basis of their ability, competence and their proven capability to perform the functions outlined in the corresponding role. We promote and support a diverse workforce across all levels in the company.


Diverse Lynx logo

About Diverse Lynx

Sourced by ZipRecruiter

Diverse Lynx, based in Princeton, NJ, US, is a reputable company in the Information Technology sector. The firm, as reflected through its website diverselynx.com, specializes in delivering comprehensive IT solutions. These solutions range from IT consulting to robust digital transformation strategies, IT staffing, and full-time placements services. The company was established in 2008, and it prides itself on providing simplified, efficient technology solutions designed to meet the unique needs of each client.

Industry

It services

Company size

51 - 200 Employees

Headquarters location

Princeton, NJ, US

Year founded

2002

Social media



Frequently asked questions

Q: What skills or qualities help someone succeed as a Data Software Engineer?

A: To succeed as a Data Software Engineer, key technical skills include proficiency in programming languages such as Python, Java, or C++, as well as expertise in data structures, algorithms, and software development methodologies like Agile. Additionally, strong soft skills like effective communication, problem-solving, and collaboration are crucial, as Data Software Engineers often work with cross-functional teams and stakeholders to design, develop, and deploy data-driven solutions. By combining technical expertise with strong soft skills, Data Software Engineers can effectively drive business outcomes, innovate, and adapt to the rapidly evolving landscape of data technology.

Q: What is the career path for a Data Software Engineer?

A: A Data Software Engineer's typical career progression involves starting as a Junior Software Engineer, where they focus on developing and maintaining data-driven software applications, and gradually advancing to roles such as Senior Software Engineer, Technical Lead, or Data Architect, where they oversee large-scale data systems and lead cross-functional teams. Key opportunities for skill development include learning programming languages like Python, SQL, and Java, as well as data science tools like Hadoop, Spark, and machine learning frameworks like TensorFlow and PyTorch. Long-term, Data Software Engineers may pursue leadership roles, such as Director of Engineering or Chief Technology Officer, or transition into related fields like data science, product management, or entrepreneurship.