2604.00633 Evidence-Based Temporal Reasoning for Generalizable Longitudinal EHR Question Answering
Longitudinal electronic health record (EHR) question answering remains difficult because clinically meaningful evidence is distributed across visits, data models, and document types, while many user questions depend on sequence, timing, and provenance rather than on isolated facts. Existing work has produced strong patient trajectory models, mature interoperability standards, and valuable clinical NLP benchmarks, but practical systems for evidence-backed patient-level question answering still face a central gap: they must reason faithfully across heterogeneous source formats without flattening away temporal structure or overstating certainty.