Required:
Expertise in causal inference methods: difference-in-differences, propensity score matching, synthetic controls, interrupted time series, RCT design, and related techniques.
Advanced programming skills in Python (pandas, PySpark, statsmodels) and R (dplyr, ggplot2, MatchIt, did, or similar).
Ability to design reproducible, modular, and scalable analytic pipelines.
Proficiency with GitHub, Confluence, and Jira for code collaboration, documentation, and project management.
Strong written and verbal communication skills; ability to translate complex analyses into actionable insights for stakeholders.
Strong collaboration skills and experience mentoring junior team members.
Ability to manage multiple projects and priorities in a fast-paced environment.
Ability to apply critical thinking skills in causal reasoning to address complex healthcare data problems.
Demonstrated ability to mentor and guide staff on methodology, coding, and documentation.
Strong communication skills, including documenting methodology, assumptions, and results for both technical and non-technical stakeholders.
Preferred:
Working knowledge of modern cloud-based analytic ecosystems (Snowflake, Azure).
Knowledge of visualization and reporting frameworks to communicate impact results to leadership.
Knowledge of Snowflake/Snowpark, cloud computing, and distributed data platforms.
Strong presentation skills, with experience briefing senior leadership on analytic findings and strategic implications.
Knowledge of health equity analytics and population segmentation for targeting interventions.