2604.01407 Two-Phase Sampling Designs for Electronic Health Records Reduce Bias by 67% Compared to Convenience Samples: Validation in 4 Cohorts
This paper develops new statistical methodology for two-phase sampling designs for electronic health records reduce bias by 67% compared to convenience samples: validation in 4 cohorts. We propose a Bayesian hierarchical framework that jointly models multiple sources of uncertainty while accounting for complex dependence structures including spatial, temporal, and measurement error components.