Sr. Data Engineer
$114K - $137K/yr
Optimize data flow performance and minimize data latency across scientific and business use cases ... Develop and maintain Databricks pipelines, notebooks, and datasets using Python, Spark, and SQL.
$114K - $137K/yr
Optimize data flow performance and minimize data latency across scientific and business use cases ... Develop and maintain Databricks pipelines, notebooks, and datasets using Python, Spark, and SQL.
$114K - $137K/yr
Optimize data flow performance and minimize data latency across scientific and business use cases ... Develop and maintain Databricks pipelines, notebooks, and datasets using Python, Spark, and SQL.
Madison, WI · On-site
$114K - $137K/yr
Optimize data flow performance and minimize data latency across scientific and business use cases ... Develop and maintain Databricks pipelines, notebooks, and datasets using Python, Spark, and SQL.
Madison, WI · On-site
$114K - $137K/yr
Optimize data flow performance and minimize data latency across scientific and business use cases ... Develop and maintain Databricks pipelines, notebooks, and datasets using Python, Spark, and SQL.
Madison, WI · On-site
$115K - $138K/yr
Optimize data flow performance and minimize data latency across scientific and business use cases ... Develop and maintain Databricks pipelines, notebooks, and datasets using Python, Spark, and SQL.
Madison, WI · On-site
$115K - $138K/yr
Optimize data flow performance and minimize data latency across scientific and business use cases ... Develop and maintain Databricks pipelines, notebooks, and datasets using Python, Spark, and SQL.
$114K - $137K/yr
Optimize data flow performance and minimize data latency across scientific and business use cases ... Develop and maintain Databricks pipelines, notebooks, and datasets using Python, Spark, and SQL.
$114K - $137K/yr
Optimize data flow performance and minimize data latency across scientific and business use cases ... Develop and maintain Databricks pipelines, notebooks, and datasets using Python, Spark, and SQL.
Madison, WI · On-site
$114K - $137K/yr
Optimize data flow performance and minimize data latency across scientific and business use cases ... Develop and maintain Databricks pipelines, notebooks, and datasets using Python, Spark, and SQL.
Madison, WI · On-site
$114K - $137K/yr
Optimize data flow performance and minimize data latency across scientific and business use cases ... Develop and maintain Databricks pipelines, notebooks, and datasets using Python, Spark, and SQL.
Madison, WI · On-site
$35K - $56K/yr
Current student enrolled toward a degree in Computer Science, Artificial Intelligence, Data Science, or related field (Graduate level degree program preferred). * Demonstrated proficiency in Python ...
Madison, WI · On-site
$35K - $56K/yr
Current student enrolled toward a degree in Computer Science, Artificial Intelligence, Data Science, or related field (Graduate level degree program preferred). * Demonstrated proficiency in Python ...
Madison, WI · On-site
Reporting to the Chief Science Officer of Acceligen, who leads the team in the Genetic Advancement ... Collect, organize, and analyze large datasets using programming languages like R, Python, or ...
Madison, WI · On-site
Reporting to the Chief Science Officer of Acceligen, who leads the team in the Genetic Advancement ... Collect, organize, and analyze large datasets using programming languages like R, Python, or ...
Madison, WI · On-site
Reporting to the Chief Science Officer of Acceligen, who leads the team in the Genetic Advancement ... Collect, organize, and analyze large datasets using programming languages like R, Python, or ...
Madison, WI · On-site
Reporting to the Chief Science Officer of Acceligen, who leads the team in the Genetic Advancement ... Collect, organize, and analyze large datasets using programming languages like R, Python, or ...
Reporting to the Chief Science Officer of Acceligen, who leads the team in the Genetic Advancement ... Collect, organize, and analyze large datasets using programming languages like R, Python, or ...
Reporting to the Chief Science Officer of Acceligen, who leads the team in the Genetic Advancement ... Collect, organize, and analyze large datasets using programming languages like R, Python, or ...
Madison, WI · On-site
Reporting to the Chief Science Officer of Acceligen, who leads the team in the Genetic Advancement ... Collect, organize, and analyze large datasets using programming languages like R, Python, or ...
Madison, WI · On-site
Reporting to the Chief Science Officer of Acceligen, who leads the team in the Genetic Advancement ... Collect, organize, and analyze large datasets using programming languages like R, Python, or ...
Madison, WI · On-site
Reporting to the Chief Science Officer of Acceligen, who leads the team in the Genetic Advancement ... Collect, organize, and analyze large datasets using programming languages like R, Python, or ...
Madison, WI · On-site
Reporting to the Chief Science Officer of Acceligen, who leads the team in the Genetic Advancement ... Collect, organize, and analyze large datasets using programming languages like R, Python, or ...
Madison, WI · On-site
Reporting to the Chief Science Officer of Acceligen, who leads the team in the Genetic Advancement ... Collect, organize, and analyze large datasets using programming languages like R, Python, or ...
Madison, WI · On-site
Reporting to the Chief Science Officer of Acceligen, who leads the team in the Genetic Advancement ... Collect, organize, and analyze large datasets using programming languages like R, Python, or ...
Reporting to the Chief Science Officer of Acceligen, who leads the team in the Genetic Advancement ... Collect, organize, and analyze large datasets using programming languages like R, Python, or ...
Reporting to the Chief Science Officer of Acceligen, who leads the team in the Genetic Advancement ... Collect, organize, and analyze large datasets using programming languages like R, Python, or ...
Madison, WI · On-site
Reporting to the Chief Science Officer of Acceligen, who leads the team in the Genetic Advancement ... Collect, organize, and analyze large datasets using programming languages like R, Python, or ...
Madison, WI · On-site
Reporting to the Chief Science Officer of Acceligen, who leads the team in the Genetic Advancement ... Collect, organize, and analyze large datasets using programming languages like R, Python, or ...
Madison, WI · On-site
$36K - $57K/yr
Current student enrolled toward a Bachelor's degree in Computer Science, Engineering, Data Science, Mathematics, or a related field. * Demonstrated proficiency in Python and familiarity with AI ...
Madison, WI · On-site
$36K - $57K/yr
Current student enrolled toward a Bachelor's degree in Computer Science, Engineering, Data Science, Mathematics, or a related field. * Demonstrated proficiency in Python and familiarity with AI ...
Python, Scala, SQL development. * ETL data pipelines. * Designing and implementing data modeling ... Experience partnering with scientific research teams, such as biomarker discovery, computational ...
Python, Scala, SQL development. * ETL data pipelines. * Designing and implementing data modeling ... Experience partnering with scientific research teams, such as biomarker discovery, computational ...
Madison, WI · On-site
... Python and Java frameworks Dedicated leadership role in at least one full lifecycle of project ... Computer Science, Engineering or equivalent work experience Additional Information All your ...
Madison, WI · On-site
... Python and Java frameworks Dedicated leadership role in at least one full lifecycle of project ... Computer Science, Engineering or equivalent work experience Additional Information All your ...
The complex nature of the research requires a scientist with a blend of strong biology fundamentals and advanced tech skills (Python/R, ML, stats, databases, high-performance computing) to analyze ...
The complex nature of the research requires a scientist with a blend of strong biology fundamentals and advanced tech skills (Python/R, ML, stats, databases, high-performance computing) to analyze ...
The complex nature of the research requires a scientist with a blend of strong biology fundamentals and advanced tech skills (Python/R, ML, stats, databases, high-performance computing) to analyze ...
The complex nature of the research requires a scientist with a blend of strong biology fundamentals and advanced tech skills (Python/R, ML, stats, databases, high-performance computing) to analyze ...
Madison, WI · On-site
The complex nature of the research requires a scientist with a blend of strong biology fundamentals and advanced tech skills (Python/R, ML, stats, databases, high-performance computing) to analyze ...
Madison, WI · On-site
The complex nature of the research requires a scientist with a blend of strong biology fundamentals and advanced tech skills (Python/R, ML, stats, databases, high-performance computing) to analyze ...
$37.8K - $52.4K
2% of jobs
$52.4K - $66.9K
3% of jobs
$66.9K - $81.5K
6% of jobs
$81.5K - $96K
9% of jobs
$100.7K is the 25th percentile. Wages below this are outliers.
$96K - $110.6K
15% of jobs
The median wage is $120.3K / yr.
$110.6K - $125.2K
22% of jobs
$133.2K is the 75th percentile. Wages above this are outliers.
$125.2K - $139.7K
32% of jobs
$139.7K - $154.3K
3% of jobs
$154.3K - $168.9K
4% of jobs
$168.9K - $183.4K
1% of jobs
$183.4K - $198K
2% of jobs
$37.8K
$123.7K
$198K
| Aspect | Scientist Python | Data Analyst Python |
|---|---|---|
| Required Credentials | Bachelor's or Master's in Science, Data Science, or related fields; Python proficiency | Bachelor's in Statistics, Data Analysis, or related fields; Python skills |
| Work Environment | Research labs, R&D departments, tech companies | Business intelligence teams, marketing, finance departments |
| Employer & Industry Usage | Research institutions, tech firms, healthcare | Corporate, finance, retail, marketing |
| Common Search & Comparison | Yes | Yes |
Scientist Python and Data Analyst Python roles share similar skills like Python programming and data handling. However, Scientists typically focus on research, experimentation, and developing new models, often working in research-heavy environments. Data Analysts concentrate on interpreting existing data to inform business decisions, working mainly in corporate settings. Both roles require strong analytical skills and Python expertise, but their focus and work environments differ significantly.

$114K - $137K/yr
Full-time
Posted 20 days ago
This role is responsible for the design, development, and maintenance of data integration, analytics, and reporting solutions that support our animal genetics and bioinformatics workloads. The ideal candidate will possess expertise in Databricks and modern data engineering tools such as Azure Data Factory, combined with hands on experience working with biological, genomic, or other omics datasets. This position requires a proactive, self-motivated, and results-oriented individual with a passion for data, a strong understanding of data architecture and warehousing principles, and an appreciation for bioinformatics workflows in a commercial genetics environment.Â
ResponsibilitiesÂ
Data IntegrationÂ
Design, develop, and maintain robust and efficient ETL/ELT pipelines and processes on Databricks for both operational and bioinformatics datasets (e.g., genomic markers, phenotypic data, laboratory outputs).Â
Ingest, transform, and harmonize structured and semi-structured biological data from lab systems, LIMS, sequencing platforms, and external partners into the enterprise data platform.Â
Troubleshoot and resolve Databricks pipeline errors and performance issues.Â
Optimize data flow performance and minimize data latency across scientific and business use cases.Â
Implement data quality checks, validations, and reconciliation processes within ETL workflows, including domain-specific checks for genomic and phenotypic data.Â
Databricks DevelopmentÂ
Develop and maintain Databricks pipelines, notebooks, and datasets using Python, Spark, and SQL.Â
Optimize Databricks jobs for performance and cost-effectiveness, including largescale bioinformatics and analytics workloads.Â
Integrate Databricks with other data sources and systems, including lab instruments, genomic databases, and on-prem or cloud data stores.Â
Participate in the design and implementation of data lake architectures that support both traditional analytics and bioinformatics pipelines.Â
Data WarehousingÂ
Participate in the design and implementation of data warehousing solutions to support reporting, analytics, and scientific modeling.Â
Model and curate subject areas for genetics, reproduction, and bioinformatics (e.g., animals, pedigrees, genotypes, breeding values, trials).Â
Support data quality initiatives and implement data cleansing procedures across business and scientific domains.Â
Reporting and AnalyticsÂ
Collaborate with business users, scientists, geneticists, and bioinformaticians to understand data requirements for department-driven reporting and analytics needs.Â
Maintain and extend the existing library of complex dashboards and visualizations to surface genetic, reproductive, and operational insights.Â
Enable self-service analytics for R&D and product teams by exposing well- governed, documented data products.Â
Troubleshoot and resolve report issues, including performance bottlenecks and data inconsistencies.Â
Cloud Platform ExperienceÂ
Apply strong programming skills in Python, SQL, and Spark to build scalable data and bioinformatics workflows.Â
Use CI/CD and IaC tools (Terraform, ARM, CloudFormation) to automate deployment of data platform components and analytics environments.Â
Design and implement Databricks platform architecture on Azure and AWS infrastructure, including environments that support largescale scientific computation.Â
Contribute to cloud security, governance, and cost optimization practices for data and bioinformatics workloads.Â
Bioinformatics and Scientific CollaborationÂ
Partner with geneticists, biostatisticians, and bioinformaticians to translate scientific requirements into scalable data and platform architectures.Â
Support or orchestrate bioinformatics pipelines (e.g., variant processing, quality control, annotation, genotype imputation, genomic evaluation) using cloud and Databricks capabilities.Â
Ensure that data models, pipelines, and storage structures meet the needs of downstream analytics, predictive models, and genetic evaluations.Â
Advocate for best practices in managing sensitive biological and genetic data, including data governance, access control, and compliance with relevant standards and regulations.Â
Collaboration and CommunicationÂ
Thrive in an entrepreneurial, self-starting, and fast-paced environment, working both independently and with our highly skilled teams.Â
Collaborate effectively with business users, data analysts, scientists, and other IT teams.Â
Communicate technical information clearly and concisely, both verbally and in writing, to technical and nontechnical stakeholders.Â
Document all development work, data models, and procedures thoroughly, including bioinformatics and scientific data flows.Â
Continuous GrowthÂ
Keep abreast of the latest advancements in data integration, cloud platforms, bioinformatics tooling, and data engineering technologies.Â
Continuously improve skills and knowledge through training and self-learning in both data engineering and bioinformatics domains.Â
RequirementsÂ
Bachelor's degree in Computer Science, Information Systems, Bioinformatics, Computational Biology, or a related field; a Master's degree is an asset.Â
7+ years of experience in data integration and reporting, with experience designing and operating cloud-based data platforms.Â
Extensive experience with Databricks, including Python, Spark, and Delta Lake.Â
Strong proficiency with relational databases (e.g., SQL Server, RDS), including TSQL, stored procedures, and functions.Â
Experience with data warehousing concepts and best practices.Â
Experience with Microsoft Azure cloud platform; exposure to Microsoft Fabric is desirable.Â
Hands on experience working with biological, genomic, or other omics datasets in a bioinformatics or life sciences setting (e.g., sequence data, SNP arrays, GWAS outputs, phenotypic traits).Â
Familiarity with common bioinformatics tools, data formats (e.g., FASTQ, VCF, PLINK), and workflows is highly desirable.Â
Strong analytical and problem-solving skills, with the ability to reason about complex data and scientific requirements.Â
Excellent communication and interpersonal skills.Â
Ability to work independently and as part of a cross-functional team across IT, science, and business.Â
Experience with Agile methodologies.Â
Demonstrated background in bioinformatics or computational biology, preferably supporting genetics, breeding, or life science research in an applied or commercial context.Â
Must be legally authorized to work in the United States.
As a holding company with cooperative and private ownership, URUS is a family of businesses at the heart of the dairy and beef industry - Alta Genetics, GENEX, Genetics Australia, Leachman Cattle, Jetstream, PEAK, SCCL, Trans Ova Genetics and VAS. Each organization has its unique identity, products, and services. These companies work globally to provide cutting-edge dairy and beef genetics, customized reproductive services to maximize conceptions, dairy management information to take producers to the frontline of progressive dairy farming, and an array of products and services to help bovines reach their full genetic potential. URUS has 9 brands in 17 retail countries and employs nearly 2,800 people globally.