Projects
This page features publicly available projects and publications. Additional proprietary work from industry roles is not listed here.
Project Highlights
Situation
High-stakes domains like medicine need decision models that practitioners can inspect, trust, and explain — not black-box algorithms.
Task
Build an accessible R tool for constructing fast-and-frugal decision trees grounded in academic decision science.
Action
Designed and published the FFTrees R package with a companion algorithm, visualization tools, and an interactive web builder.
Result
Published in Judgment and Decision Making and cited over 100 times — with applications in radiation dose triage for lung cancer, lung cancer screening criteria, and clinical course prediction in respiratory disease.
Situation
A team of data scientists at a large pharmaceutical company relied on thousands of lines of fragile, undocumented SQL and SAS — slow to run, hard to audit, and inaccessible to non-expert users.
Task
Overhaul the team's end-to-end EHR data pipeline into a reproducible, human-readable system.
Action
Designed and led development of FlatironKitchen — an internal R package covering database connections, cohort derivation, variable creation, and visualization in a unified, human-readable API.
Result
Became the team's analytical backbone; presented at rstudio::conf 2020 to an audience of data scientists across industry. A senior epidemiologist reported: "It's like heaven."
Situation
A provider of longitudinal real-world claims data needed a way for clients to get fast, rigorous answers to common health economics and outcomes research (HEOR) questions — without waiting weeks for a bespoke study.
Task
Design and build an MVP interface for implementing custom cohort analyses following rigorous scientific standards.
Action
Led MVP design and development in collaboration with engineers, epidemiologists, biostatisticians, and senior leadership — building an R and SQL-powered dashboard with ICD/CPT-based cohort builders and configurable comparison arms.
Result
A fully functional prototype delivering clinical, demographic, cost, and outcome summaries in minutes — giving clients a no-code path to decide whether a full study was worth pursuing.
Open-Source Scientific Projects
Publications, open-source packages, and books — distinct from applied industry work.
See me speak
A selection of technical talks given at data science conferences, with recorded video available online.
Using R to Derive Robust Insights from Real-World Health Care Data
Virtual
▶ Watch Video
FlatironKitchen: How we overhauled a Frankensteinian SQL workflow with the Tidyverse
San Francisco, CA
▶ Watch Video
FFTrees: An R package to create, visualise and use fast and frugal decision trees
Brussels, Belgium
▶ Watch Video