ย AI/ML Data Scientist
ย Location: New York, NY
Role Summary:
Develop and evaluate machine learning and GenAI models to drive clinical, operational, or financial insights from healthcare data.
ย
Responsibilities:
- Train supervised and unsupervised models using Python (XGBoost, LightGBM, sklearn, PyTorch)
- Conduct data profiling, feature engineering, model evaluation using stratified validation
- Implement model explainability (SHAP, LIME) and bias audits
- Monitor model drift, fairness, and clinical relevance over time
- Fine-tune or prompt LLMs for use cases such as summarization or information extraction
- Build and evaluate multi-agent workflows using GenAI and frameworks like LangChain
- Design prompt libraries and benchmark hallucination rates, sensitivity to input phrasing
- Track experiments using MLflow, Weights & Biases
ย
Required Qualifications:
- 3โ5 years in applied data science or machine learning
- Experience with Snowflake SQL, Python ML ecosystem, healthcare data
ย
Preferred Qualifications:
- Exposure to Epic Clarity data or unstructured clinical notes
- Hands-on with transformer models, LLM APIs, and embedding-based retrieval