Senior Data Scientist
$110K - $125K/yr
Design/implement scalable data pipelines/ETL processes & CI/CD workflows for ingestion/preprocessing/aggregating blockchain & social media data. * Create dashboards/visualizations to deliver ...
Quick apply
$110K - $125K/yr
Design/implement scalable data pipelines/ETL processes & CI/CD workflows for ingestion/preprocessing/aggregating blockchain & social media data. * Create dashboards/visualizations to deliver ...
Quick apply
$110K - $125K/yr
Design/implement scalable data pipelines/ETL processes & CI/CD workflows for ingestion/preprocessing/aggregating blockchain & social media data. * Create dashboards/visualizations to deliver ...
New York, NY · On-site
$110K - $125K/yr
Design/implement scalable data pipelines/ETL processes & CI/CD workflows for ingestion/preprocessing/aggregating blockchain & social media data. * Create dashboards/visualizations to deliver ...
New York, NY · On-site
$110K - $125K/yr
Design/implement scalable data pipelines/ETL processes & CI/CD workflows for ingestion/preprocessing/aggregating blockchain & social media data. * Create dashboards/visualizations to deliver ...
Manhattan, NY · On-site
$126K - $151K/yr
Machine learning pipeline experience: feature stores, data preprocessing, model serving, MLOps tooling (e.g., MLflow, Feast, Airflow/Prefect) * Strong SQL and NoSQL skills; experience with Redis or ...
Manhattan, NY · On-site
$126K - $151K/yr
Machine learning pipeline experience: feature stores, data preprocessing, model serving, MLOps tooling (e.g., MLflow, Feast, Airflow/Prefect) * Strong SQL and NoSQL skills; experience with Redis or ...
New York, NY · On-site
$64.50 - $84.50/hr
Experience with data preprocessing, feature engineering, and model evaluation techniques. * Familiarity with cloud platforms like AWS, Azure, or GCP for deploying AI models. * Knowledge of MLOps ...
Quick apply
New York, NY · On-site
$64.50 - $84.50/hr
Experience with data preprocessing, feature engineering, and model evaluation techniques. * Familiarity with cloud platforms like AWS, Azure, or GCP for deploying AI models. * Knowledge of MLOps ...
New York, NY · On-site
$55 - $75.75/hr
Knowledge of data preprocessing, model validation, and evaluation techniques. * Strong problem-solving skills and ability to work in a fast-paced environment. Thanks, Sanjay Kumar sanjay.kumar@zodiac ...
Quick apply
New York, NY · On-site
$55 - $75.75/hr
Knowledge of data preprocessing, model validation, and evaluation techniques. * Strong problem-solving skills and ability to work in a fast-paced environment. Thanks, Sanjay Kumar sanjay.kumar@zodiac ...
$170K - $220K/yr
Conduct extensive EDA, feature engineering, and data preprocessing to ensure high-quality input for ML models. * Evaluate and optimize model performance using statistical and ML techniques. * Design ...
$170K - $220K/yr
Conduct extensive EDA, feature engineering, and data preprocessing to ensure high-quality input for ML models. * Evaluate and optimize model performance using statistical and ML techniques. * Design ...
Develop and maintain AI pipelines, including data preprocessing, feature extraction, model training, and evaluation. Strong technical skills in machine learning and AI, especially natural language ...
Quick apply
Develop and maintain AI pipelines, including data preprocessing, feature extraction, model training, and evaluation. Strong technical skills in machine learning and AI, especially natural language ...
New York, NY · On-site
$170K - $220K/yr
Conduct extensive EDA, feature engineering, and data preprocessing to ensure high-quality input for ML models. * Evaluate and optimize model performance using statistical and ML techniques. * Design ...
New York, NY · On-site
$170K - $220K/yr
Conduct extensive EDA, feature engineering, and data preprocessing to ensure high-quality input for ML models. * Evaluate and optimize model performance using statistical and ML techniques. * Design ...
$110K - $125K/yr
... preprocessing (cleaning/normalization/validation) for client accounts using rule-based logic/statistical checks to ensure data quality & prepare analysis-ready datasets for modeling/reporting.
Quick apply
$110K - $125K/yr
... preprocessing (cleaning/normalization/validation) for client accounts using rule-based logic/statistical checks to ensure data quality & prepare analysis-ready datasets for modeling/reporting.
New York, NY · On-site
$110K - $125K/yr
... preprocessing (cleaning/normalization/validation) for client accounts using rule-based logic/statistical checks to ensure data quality & prepare analysis-ready datasets for modeling/reporting.
New York, NY · On-site
$110K - $125K/yr
... preprocessing (cleaning/normalization/validation) for client accounts using rule-based logic/statistical checks to ensure data quality & prepare analysis-ready datasets for modeling/reporting.
Strong programming skills in R and/or Python for data preprocessing, data analysis, statistical modeling, and machine learning, as well as solid SQL skills for data querying and manipulation. Strong ...
Strong programming skills in R and/or Python for data preprocessing, data analysis, statistical modeling, and machine learning, as well as solid SQL skills for data querying and manipulation. Strong ...
New York, NY · On-site
Deep understanding of Data preprocessing, Prompt management, Caching, Validation, Advanced RAG, RLHF, and success measurement. * Thorough understanding of LLMOps, data pipelines and other common ...
New York, NY · On-site
Deep understanding of Data preprocessing, Prompt management, Caching, Validation, Advanced RAG, RLHF, and success measurement. * Thorough understanding of LLMOps, data pipelines and other common ...
Design and implement end-to-end ML pipelines , including data preprocessing, model training, evaluation, and deployment. * Collaborate with product and engineering teams to integrate generative ...
Design and implement end-to-end ML pipelines , including data preprocessing, model training, evaluation, and deployment. * Collaborate with product and engineering teams to integrate generative ...
Design and implement end-to-end ML pipelines , including data preprocessing, model training, evaluation, and deployment. * Collaborate with product and engineering teams to integrate generative ...
Design and implement end-to-end ML pipelines , including data preprocessing, model training, evaluation, and deployment. * Collaborate with product and engineering teams to integrate generative ...
Manhattan, NY · On-site
Design and implement end-to-end ML pipelines , including data preprocessing, model training, evaluation, and deployment. * Collaborate with product and engineering teams to integrate generative ...
Manhattan, NY · On-site
Design and implement end-to-end ML pipelines , including data preprocessing, model training, evaluation, and deployment. * Collaborate with product and engineering teams to integrate generative ...
New York, NY · On-site
$222.27/hr
By the end of the course, students will be able to comfortably program in R for effective data preprocessing, analysis and presentation. This course does not require prior experience in programming ...
New York, NY · On-site
$222.27/hr
By the end of the course, students will be able to comfortably program in R for effective data preprocessing, analysis and presentation. This course does not require prior experience in programming ...
Create and optimize end-to-end machine learning pipelines, from data preprocessing to model deployment, ensuring scalability and performance. * Optimizing Model Performance: Continuously fine-tune ...
Create and optimize end-to-end machine learning pipelines, from data preprocessing to model deployment, ensuring scalability and performance. * Optimizing Model Performance: Continuously fine-tune ...
Manhattan, NY · On-site
Design and implement end-to-end ML pipelines , including data preprocessing, model training, evaluation, and deployment. * Collaborate with product and engineering teams to integrate generative ...
Manhattan, NY · On-site
Design and implement end-to-end ML pipelines , including data preprocessing, model training, evaluation, and deployment. * Collaborate with product and engineering teams to integrate generative ...
Create and optimize end-to-end machine learning pipelines, from data preprocessing to model deployment, ensuring scalability and performance. * Optimizing Model Performance: Continuously fine-tune ...
Quick apply
Create and optimize end-to-end machine learning pipelines, from data preprocessing to model deployment, ensuring scalability and performance. * Optimizing Model Performance: Continuously fine-tune ...
Manhattan, NY · On-site
Create and optimize end-to-end machine learning pipelines, from data preprocessing to model deployment, ensuring scalability and performance. * Optimizing Model Performance: Continuously fine-tune ...
Manhattan, NY · On-site
Create and optimize end-to-end machine learning pipelines, from data preprocessing to model deployment, ensuring scalability and performance. * Optimizing Model Performance: Continuously fine-tune ...
$47K - $65.4K
1% of jobs
$65.4K - $83.8K
2% of jobs
$83.8K - $102.1K
4% of jobs
$102.1K - $120.5K
9% of jobs
$136.1K is the 25th percentile. Wages below this are outliers.
$120.5K - $138.8K
11% of jobs
$138.8K - $157.2K
7% of jobs
The median wage is $163.1K / yr.
$157.2K - $175.6K
50% of jobs
$175.6K - $193.9K
2% of jobs
$193.9K - $212.3K
1% of jobs
$212.3K - $230.6K
0% of jobs
$230.6K - $249K
13% of jobs
$47K
$168.8K
$249K
| Aspect | Data Preprocessing | Data Analysis |
|---|---|---|
| Primary Focus | Cleaning, transforming, and preparing raw data for analysis | Interpreting data to extract insights and support decision-making |
| Skills Required | Data cleaning, scripting, understanding of data formats | Statistical analysis, data visualization, critical thinking |
| Work Environment | Data engineering teams, data science projects | Business intelligence, research, data science teams |
| Tools Used | Python, R, SQL, ETL tools | Excel, Tableau, R, Python, statistical software |
While data preprocessing involves preparing raw data for analysis by cleaning and transforming it, data analysis focuses on interpreting the prepared data to uncover trends and insights. Both roles are essential in the data pipeline but serve different purposes in the data lifecycle.

Analyze large-scale blockchain, transactional, and social media datasets to identify patterns, trends, anomalies, and risk indicators.
Develop and apply machine learning models, including graph-based algorithms and NLP techniques, for threat detection, behavioral analysis, and monitoring.
Design and implement scalable data pipelines, ETL processes, and CI/CD workflows for ingesting, preprocessing, and aggregating blockchain and social media data.
Today, CertiK supports thousands of enterprise clients and Web3 projects globally, with a distributed international team spanning North America, Asia, and Europe. The company is backed by leading investors including Coatue, Goldman Sachs, Insight Partners, and Sequoia Capital, and has been recognized by organizations such as the World Economic Forum and CB Insights for its contributions to blockchain security innovation.
The primary responsibility of this role is to build/maintain ETL pipelines & process large datasets from APIs/databases/third-party platforms to enable real-time team analytics and automate data preprocessing (cleaning/normalization/validation) for client accounts using rule-based logic/statistical checks to ensure data quality & prepare analysis-ready datasets for modeling/reporting.
Target annual salary compensation for this role performed is $110,000 to $125,000. The exact compensation at which this job is filled will be determined by the skills and experience of qualified candidates.
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses and identifying potential inconsistencies or verification signals in application materials based on available information. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
Sourced by ZipRecruiter
Network security
51 - 200 Employees
New York, NY, US
2018