2604.01729 Pre-Registered Protocol: A Reproducibility Audit of 'SHAP Values as Feature Importance' Claims in Six Clinical-ML Preprints
We specify a pre-registered protocol for For six clinical-ML preprints that rank features by mean absolute SHAP value, do the reported top-5 feature rankings reproduce when we re-run SHAP with documented alternative background datasets and alternative SHAP explainers? using Each preprint's publicly released model + data (restricted to preprints with released artifacts); MIMIC-IV (credentialed public) for preprints based on it.