Machine Learning Data Associate Jobs in Philadelphia, PA

Principal Machine Learning Engineer

We are looking for a Principal Machine Learning Engineer to serve as a hands-on technical leader ... Perform exploratory data analysis, data quality assessment, feature engineering, model training ...

Medical Guardian

Principal Machine Learning Engineer

Philadelphia, PA · Remote

Medical Guardian

Principal Machine Learning Engineer

Philadelphia, PA · On-site

Quick apply

Medical Guardian

Principal Machine Learning Engineer

Philadelphia, PA · On-site

Penn Medicine

Senior Machine Learning Software Engineer

Philadelphia, PA · On-site

$123K - $163K/yr

PennDNA Data Science Location : 3600 Civic Center Blvd, Philadelphia, PA Hours: M-F, Daylight Summary : Working with a team of data scientists and ML engineers, the Senior Machine Learning Software ...

Penn Medicine

Senior Machine Learning Software Engineer

Philadelphia, PA · On-site

$123K - $163K/yr

Medical Guardian

Principal Machine Learning Engineer

Philadelphia, PA · On-site

Medical Guardian

Principal Machine Learning Engineer

Philadelphia, PA · On-site

Comcast

Principal Machine Learning Engineer

Philadelphia, PA · On-site +1

Partner closely with Product, Engineering, and Data teams to align ML capabilities with business ... Build end-to-end machine learning solutions-from data pipelines to model deployment, monitoring ...

Comcast

Principal Machine Learning Engineer

Philadelphia, PA · On-site +1

Susquehanna International Group, LLP

Machine Learning Researcher - PhD: 2026

Philadelphia, PA · On-site

As a Machine Learning Researcher, you will apply advanced ML techniques to a wide range of ... Hands-on experience applying deep learning on time series data * Strong foundation in mathematics ...

Susquehanna International Group, LLP

Machine Learning Researcher - PhD: 2026

Philadelphia, PA · On-site

Varsity Tutors

Data Science Tutor

Trenton, NJ · Remote

$18 - $40/hr

Deep knowledge of statistical analysis, data wrangling, exploratory data analysis, machine learning, data visualization, SQL, Python or R programming, hypothesis testing, and communication of data ...

Varsity Tutors

Data Science Tutor

Trenton, NJ · Remote

$18 - $40/hr

Instacart

Machine Learning Engineer (PhD Intern)

Wilmington, DE · On-site

We use machine learning and Internet-scale data to elevate customer experience, improve efficiency, and reduce cost. As an example, we manage catalog data imported from hundreds of retailers, and we ...

Instacart

Machine Learning Engineer (PhD Intern)

Wilmington, DE · On-site

Instacart

Machine Learning Engineer (PhD Intern)

West Chester, PA · On-site

Instacart

Machine Learning Engineer (PhD Intern)

West Chester, PA · On-site

Instacart

Machine Learning Engineer (PhD Intern)

Philadelphia, PA · On-site

Instacart

Machine Learning Engineer (PhD Intern)

Philadelphia, PA · On-site

Varsity Tutors

Data Science Tutor

Chester, PA · Remote

$18 - $40/hr

Varsity Tutors

Data Science Tutor

Chester, PA · Remote

$18 - $40/hr

Instacart

Machine Learning Engineer (PhD Intern)

Trenton, NJ · On-site

Instacart

Machine Learning Engineer (PhD Intern)

Trenton, NJ · On-site

Pwc

AML and Sanctions- Data Scientist- Senior Associate

Philadelphia, PA

$77K - $202K/yr

... Associate & Summary At PwC, our people in data and analytics focus on leveraging data to drive ... Responsibilities - Apply machine learning and natural language processing to financial crime ...

Pwc

AML and Sanctions- Data Scientist- Senior Associate

Philadelphia, PA

$77K - $202K/yr

Postdoctoral Scholar, AI/ML for drug metabolite prediction and LC-MS analytical chemistry

Spring House, PA

Python, Machine Learning, Deep Learning, Data Science, General Chemistry Preferred Area of Study: AI/ML for Chemistry Preferred Knowledge, Skills and Abilities : * Strong understanding of analytical ...

Postdoctoral Scholar, AI/ML for drug metabolite prediction and LC-MS analytical chemistry

Spring House, PA

Data Cloud Merge

Entry Level Business Analyst

Philadelphia, PA

Solid understanding of statistical analysis, predictive analytics, machine learning, data mining, quantitative analytics, and optimization algorithms. * Advanced working SQL knowledge and experience ...

Data Cloud Merge

Entry Level Business Analyst

Philadelphia, PA

Data Cloud Merge

Entry Level Business Analyst

Philadelphia, PA · On-site

Data Cloud Merge

Entry Level Business Analyst

Philadelphia, PA · On-site

Comcast

Principal Machine Learning Engineer

Philadelphia, PA · On-site +1

Partner closely with Product, Engineering, and Data teams to align ML capabilities with business ... Build end-to-end machine learning solutions--from data pipelines to model deployment, monitoring ...

Comcast

Principal Machine Learning Engineer

Philadelphia, PA · On-site +1

SynergisticIT

Java Programmer with Cloud (Junior Level)

Philadelphia, PA · On-site

... For data Science/Data Analyst/AI/Machine learning Positions Preferred SKILLS Associate or Bachelors degree or Masters degree in Computer Science, Computer Engineering, Electrical Engineering ...

SynergisticIT

Java Programmer with Cloud (Junior Level)

Philadelphia, PA · On-site

Vangard, Inc.

Data Engineer, Specialist

Wayne, PA

$103K - $124K/yr

Machine Learning * Data Governance * Minimum of five years data analytics, programming, database administration, or data management experience. * Undergraduate degree or equivalent combination of ...

Vangard, Inc.

Data Engineer, Specialist

Wayne, PA

$103K - $124K/yr

Vanguard Group, Inc.

Data Engineer, Specialist

Wayne, PA · On-site

$103K - $124K/yr

Vanguard Group, Inc.

Data Engineer, Specialist

Wayne, PA · On-site

$103K - $124K/yr

Showing results 1-20

Machine Learning Data Associate Jobs in Philadelphia, PA

Machine Learning Data Associate information

See Philadelphia, PA salary details

$18

$31

How much do machine learning data associate jobs pay per hour?

As of Jun 28, 2026, the average hourly pay for machine learning data associate in Philadelphia, PA is $18.91, according to ZipRecruiter salary data. Most workers in this role earn between $15.53 and $20.14 per hour, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive as a Machine Learning Data Associate, and why are they important?

To thrive as a Machine Learning Data Associate, you need strong analytical skills, attention to detail, and a basic understanding of data annotation and labeling processes, often supported by a degree in computer science or a related field. Familiarity with data management tools, annotation platforms, and sometimes scripting languages like Python is typically required. Strong communication, collaboration, and problem-solving abilities help you work efficiently with data science teams and ensure high-quality outcomes. These skills and qualities are crucial for producing accurate datasets that directly impact the effectiveness of machine learning models.

What is the salary of ML data associate?

The salary of a Machine Learning Data Associate typically ranges from $40,000 to $70,000 annually, depending on experience, location, and company size. Entry-level positions may start lower, while experienced professionals with specialized skills in data annotation and tools like Python or SQL can earn higher salaries.

What are Machine Learning Data Associates?

Machine Learning Data Associates are professionals who support the development of machine learning models by preparing, labeling, and validating data sets. Their work ensures that data used for training algorithms is accurate, consistent, and properly annotated. They may also assist with data cleaning, quality checks, and sometimes basic data analysis tasks. This role is crucial in industries where high-quality labeled data is essential for building effective AI systems.

What is the difference between Machine Learning Data Associate vs Data Analyst?

Aspect	Machine Learning Data Associate	Data Analyst
Required Skills	Data cleaning, labeling, basic programming, understanding of ML workflows	Data interpretation, visualization, statistical analysis
Work Environment	Tech companies, AI startups, research labs	Business, finance, marketing, healthcare sectors
Common Certifications	Data Science certifications, Python, SQL	Excel, Tableau, SQL certifications

The main difference is that Machine Learning Data Associates focus on preparing and labeling data specifically for machine learning models, while Data Analysts interpret data to generate insights for business decisions. Both roles require strong data skills and often overlap, but their primary objectives and work environments differ.

Is ML data associate a good job?

A Machine Learning Data Associate role involves preparing and managing data for machine learning models, often requiring skills in data cleaning, annotation, and familiarity with tools like Python or SQL. It can be a good entry-level position for those interested in AI and data science, offering opportunities to develop technical skills and gain industry experience. Job satisfaction depends on individual interests and career goals in technology and data fields.

How much do ML data associates make in the US?

Machine Learning Data Associates in the US typically earn between $35,000 and $60,000 annually, depending on experience, location, and employer. Entry-level positions may start lower, while those with specialized skills in data annotation, labeling, or familiarity with tools like Labelbox or CVAT can command higher salaries.

How does a Machine Learning Data Associate typically collaborate with data scientists and engineers within a project team?

As a Machine Learning Data Associate, you play a vital role in supporting data scientists and engineers by annotating, cleaning, and organizing large datasets to ensure high data quality. You'll frequently communicate with team members to clarify labeling guidelines, provide feedback on data inconsistencies, and report any edge cases encountered during annotation. This collaboration ensures that the datasets used for training machine learning models are accurate and comprehensive, directly impacting the success of the project. Expect regular team meetings and ongoing feedback loops to maintain alignment with evolving project requirements.

What does a machine learning data associate do?

A machine learning data associate is responsible for collecting, cleaning, and organizing data used to train machine learning models. They ensure data quality and consistency, often using tools like SQL, Python, or data annotation platforms, to support accurate model development and deployment.

Machine Learning Data Associate jobs near you

Infographic showing various Machine Learning Data Associate job openings in Philadelphia, PA as of June 2026, with employment types broken down into 1% As Needed, 84% Full Time, 11% Part Time, 1% Temporary, 2% Contract, and 1% Nights. Highlights an 88% Physical, 3% Hybrid, and 9% Remote job distribution, with an average salary of $39,328 per year, or $18.9 per hour.

Principal Machine Learning Engineer

Medical Guardian

Philadelphia, PA • Remote

Apply

Full-time

Medical, Dental, Vision, Life, Retirement, PTO

Posted 8 days ago

Job description

About Medical Guardian:

Founded in 2005, Medical Guardian is a fast-growing digital health and safety company on a mission to help people live a life without limits. With 13 consecutive years on the Inc. 5000 list of Fastest Growing Companies, we're redefining what it means to age confidently and independently.

We support over 625,000 members nationwide with life-saving emergency response systems and remote patient monitoring solutions. Trusted by families, healthcare providers, and care managers, our work is powered by a culture of innovation, compassion, and purpose.

Medical Guardian boasts a 95% customer satisfaction rate, a #1 ranking on 16 medical alert consumer choice sites and achieves a 4.7+ star rating on Google Reviews.

Position Overview:

We are looking for a Principal Machine Learning Engineer to serve as a hands-on technical leader for machine learning, predictive modeling, scoring, decisioning, and applied AI initiatives. This role will primarily focus on building, validating, deploying, and improving machine learning models, while also bringing principal-level judgment to problem definition, model design, stakeholder engagement, and production readiness.

This is a hands-on model-building role first. The ideal candidate should be comfortable spending most of their time working directly with data, features, models, scoring logic, validation methods, production workflows, and model improvement. They should also be able to operate with the maturity of a principal-level engineer: shaping unclear problems, making pragmatic technical decisions, mentoring others, and driving work forward without waiting for perfect requirements.

Key Responsibilities:

Hands-On Model Development

Build, test, validate, and improve machine learning models for scoring, prediction, prioritization, risk detection, engagement, intervention targeting, and decision support.
Perform exploratory data analysis, data quality assessment, feature engineering, model training, model selection, and performance evaluation.
Develop practical ML models that balance predictive performance, explainability, stability, maintainability, and business usefulness.
Work with structured, semi-structured, and operational data to create model-ready datasets and reusable features.
Use tools such as Python, SQL, Spark, Databricks, MLflow, scikit-learn, XGBoost, or similar platforms and libraries.
Move quickly from data exploration to prototype to validated model to production-ready capability.

Scoring, Scorecards, and Transparent Models

Design and implement predictive scores, risk tiers, score bands, thresholds, cut points, and intervention logic.
Build transparent and interpretable models where explainability is important, including logistic regression, generalized linear models, decision trees, monotonic models, calibrated models, scorecard-style models, or explainable boosting approaches.
Evaluate models for accuracy, calibration, stability, drift, fairness, interpretability, and operational usefulness.
Help stakeholders understand what a score represents, how it should be used, how it should not be used, and how changes in the score should be interpreted.
Document model logic, features, assumptions, limitations, validation results, and recommended usage in a way that business and technical stakeholders can understand.
Define the evidence needed to show that a model or score is valid, stable, explainable, actionable, and useful.

Production ML and MLOps

Partner with data engineering, analytics engineering, platform engineering, and application engineering teams to move models from experimentation into reliable production workflows.
Support model deployment, batch scoring, real-time or near-real-time inference, model versioning, monitoring, retraining, and performance tracking.
Help define data pipelines, feature pipelines, inference flows, model outputs, feedback loops, and monitoring requirements.
Ensure models are observable, supportable, secure, scalable, and aligned with enterprise architecture and governance expectations.
Establish practical monitoring and feedback loops to determine whether models continue to perform and create value over time.

Product and Rapid-Build Execution

Operate effectively in a rapid-build, startup-like environment where speed, ownership, and pragmatic decision-making are important.
Turn early-stage ideas, ambiguous business needs, and rough concepts into working ML products, scores, prototypes, and production capabilities.
Bring a product-engineering mindset to ML development, including user needs, workflow integration, adoption, usability, feedback loops, and measurable outcomes.
Drive work forward without waiting for perfect requirements, while still identifying critical assumptions, risks, dependencies, and evidence needed before scaling.
Partner with business and product stakeholders to define MVPs, iterate quickly, learn from usage, and improve models over time.
Make smart tradeoffs between quick prototypes, durable platforms, transparent models, GenAI-enabled workflows, and longer-term ML architecture.

Generative AI and AI Automation

Support the design and development of GenAI-enabled solutions, including LLM-powered workflows, RAG, summarization, conversational agents, document intelligence, and decision-support tools.
Help evaluate when GenAI is appropriate versus when traditional ML, rules, analytics, or transparent scoring models are a better fit.
Partner with product, engineering, and business stakeholders to integrate predictive models, scores, and GenAI outputs into practical workflows.
Apply appropriate evaluation, guardrails, monitoring, privacy controls, and human-in-the-loop processes for GenAI use cases.
Help the organization balance innovation with explainability, safety, reliability, privacy, and operational usefulness.

Requirement Shaping and Stakeholder Partnership

Work directly with business, product, analytics, operations, and engineering stakeholders to clarify what a model is intended to predict, explain, recommend, or trigger.
Translate business questions into measurable ML objectives, target variables, features, validation approaches, and success metrics.
Ask practical questions early: who will use the score, what action will it inform, what does a false positive or false negative mean, and how will we know the model is creating value?
Communicate model behavior, tradeoffs, limitations, and recommended usage clearly to both technical and non-technical audiences.
Help the team avoid becoming an AI ticket factory by shaping solutions, not just executing requests.

Principal-Level Technical Leadership

Provide technical leadership through hands-on example, strong engineering judgment, and clear recommendations.
Proactively identify model risks, data gaps, unclear requirements, design issues, and opportunities for improvement.
Help establish practical standards for model development, validation, documentation, monitoring, and production readiness.
Mentor other engineers and data scientists through code reviews, design reviews, modeling guidance, and shared best practices.
Demonstrate high ownership by driving clarity, execution, and continuous improvement.

Required Qualifications:

8+ years of professional experience in machine learning, data science, software engineering, analytics engineering, applied AI, or related technical fields.
5+ years of hands-on machine learning model development experience, including feature engineering, model training, validation, evaluation, and iteration.
3+ years of experience deploying, operationalizing, or supporting models in production or business-critical environments.
Strong hands-on experience with Python and SQL.
Experience with modern ML and data platforms such as Databricks, Spark, MLflow, Snowflake, Azure, AWS, or similar technologies.
Strong understanding of model evaluation, calibration, thresholding, score interpretation, monitoring, drift, retraining, and production ML lifecycle management.
Experience translating ambiguous business problems into concrete ML designs, model requirements, validation plans, and measurable outcomes.
Ability to explain model behavior, model performance, assumptions, limitations, and tradeoffs to both technical and non-technical stakeholders.
Strong engineering discipline, including clean code, reproducibility, versioning, testing, documentation, and maintainability.
Ability to work independently as a senior hands-on contributor while also providing technical leadership and modeling judgment.

Preferred Qualifications:

10+ years of relevant professional experience in ML, data science, applied AI, software engineering, decisioning systems, commercial software, or production analytics.
Experience building scorecards, risk scores, health scores, engagement scores, churn scores, fraud scores, credit-style models, prioritization models, or operational decision-support models.
Experience with transparent or interpretable models such as logistic regression, GLMs, GAMs, decision trees, monotonic models, calibrated models, scorecard-based models, or Explainable Boosting Machines.
Experience designing score bands, thresholds, risk tiers, intervention rules, recommended actions, or decision logic based on model outputs.
Experience working in commercial software, SaaS, digital products, gaming, fintech, healthtech, consumer technology, marketplace, or other product-driven environments.
Experience building ML, AI, analytics, or decisioning capabilities embedded into customer-facing products, operational workflows, commercial platforms, or revenue-impacting systems.
Experience in startup, scale-up, innovation lab, new product development, or rapid-build environments where the candidate had to operate with ambiguity and drive work forward independently.
Experience partnering with product managers, designers, software engineers, business leaders, and operational teams to turn ML models into usable product capabilities.
Experience building MVPs, validating assumptions, iterating based on feedback, and maturing prototypes into production systems.
Experience with GenAI, LLMs, RAG, AI agents, prompt engineering, model evaluation, conversational AI, summarization, document intelligence, or AI-enabled workflow automation.
Experience combining traditional ML models with GenAI-enabled workflows, such as using predictive scores to trigger outreach, summarize customer/member context, recommend next actions, or support human decision-making.
Experience in healthcare, population health, remote patient monitoring, insurance, financial services, safety, operations, or other domains where model trust and explainability are important.
Experience with MLOps practices including model registries, deployment pipelines, monitoring, drift detection, retraining strategies, and model governance.

Success in This Role Looks Like:

High-quality models and scores are built, validated, deployed, monitored, and improved over time.
Model outputs are explainable and trusted by business and operational stakeholders.
Scores are connected to real decisions, workflows, interventions, or measurable outcomes.
The organization moves faster because this person can turn ambiguity into working ML capabilities.
The ML team has stronger standards for model development, validation, documentation, monitoring, and production readiness.
Business partners understand what the models do, how to use them, where their limitations are, and how to interpret changes in outputs.
The team avoids building models in isolation and instead builds ML capabilities that are connected to products, workflows, users, and business value.
GenAI is applied thoughtfully where it improves workflow, decision support, summarization, automation, or user experience, without replacing appropriate model governance or human judgment.

Benefits

Health Care Plan (Medical, Dental & Vision)
Paid Time Off (Vacation, Sick Time Off & Holidays)
Company Paid Short Term Disability and Life Insurance
Retirement Plan (401k) with Company Match

About Medical Guardian

Sourced by ZipRecruiter

Industry

Manufacturing

Company size

201 - 500 Employees

Headquarters location

Philadelphia, PA, US

Year founded

2005

Website

medicalguardian.com

Social media

View All Medical Guardian Jobs

Apply

Machine Learning Data Associate Jobs in Philadelphia, PA

Principal Machine Learning Engineer

Principal Machine Learning Engineer

Principal Machine Learning Engineer

Principal Machine Learning Engineer

Senior Machine Learning Software Engineer

Senior Machine Learning Software Engineer

Principal Machine Learning Engineer

Principal Machine Learning Engineer

Principal Machine Learning Engineer

Principal Machine Learning Engineer

Machine Learning Researcher - PhD: 2026

Machine Learning Researcher - PhD: 2026

Data Science Tutor

Data Science Tutor

Machine Learning Engineer (PhD Intern)

Machine Learning Engineer (PhD Intern)

Machine Learning Engineer (PhD Intern)

Machine Learning Engineer (PhD Intern)

Machine Learning Engineer (PhD Intern)

Machine Learning Engineer (PhD Intern)

Data Science Tutor

Data Science Tutor

Machine Learning Engineer (PhD Intern)

Machine Learning Engineer (PhD Intern)

AML and Sanctions- Data Scientist- Senior Associate

AML and Sanctions- Data Scientist- Senior Associate

Postdoctoral Scholar, AI/ML for drug metabolite prediction and LC-MS analytical chemistry

Postdoctoral Scholar, AI/ML for drug metabolite prediction and LC-MS analytical chemistry

Entry Level Business Analyst

Entry Level Business Analyst

Entry Level Business Analyst

Entry Level Business Analyst

Principal Machine Learning Engineer

Principal Machine Learning Engineer

Java Programmer with Cloud (Junior Level)

Java Programmer with Cloud (Junior Level)

Data Engineer, Specialist

Data Engineer, Specialist

Data Engineer, Specialist

Data Engineer, Specialist

Machine Learning Data Associate information

See Philadelphia, PA salary details

How much do machine learning data associate jobs pay per hour?

Principal Machine Learning Engineer

Share this job

Job description

About Medical Guardian

Industry

Company size

Headquarters location

Year founded

Website

Social media

Share this job