Understanding Health Policy Through Data
Health policy decisions affect millions of lives and billions of dollars in public spending. Yet the data underlying these decisions is often more complex than it appears. Our health policy research applies applied economics methods to questions about healthcare spending, program integrity, and the gap between what data shows and what commentators claim.
Medicaid and Program Integrity
Medicaid serves over 90 million Americans and accounts for roughly one-sixth of national health spending. When HHS released 227 million rows of billing data in February 2026, amateur analysts claimed to find billions in fraud within hours. Our four-part series examines what that data can and cannot actually support.
The findings challenge quick conclusions: the dataset contains only 7 columns with no diagnosis codes, 40% of provider exclusions in the federal LEIE database are not fraud-related, and billing volume more closely proxies patient need than fraudulent behavior. A simple logistic regression matched the performance of complex machine learning models when validated honestly.
Why Health Policy Needs Better Methods
Health policy analysis sits at the intersection of several challenges: large administrative datasets with limited variables, strong political incentives to find dramatic results, and real consequences when analysis goes wrong. Providers flagged as suspicious may lose their ability to serve vulnerable populations. Genuine fraud may go undetected when screening methods optimize for volume rather than behavioral anomalies.
Our approach treats health policy questions the same way we treat food security or education policy: start with the data limitations, document every assumption, and let the analysis speak for itself.
Research Methodology
Our health policy research draws on:
- Administrative data: CMS Medicaid billing records, LEIE exclusion database, NPI provider registry
- Statistical methods: Logistic regression, classification validation, honest train-test splitting
- Policy analysis: Cost-effectiveness frameworks, program evaluation design