Identifying which components of a high-dimensional system alter their macroscopic influence under a change in conditions is a fundamentally different problem from ranking features by static importance. The former requires reasoning about how predictive structure shifts between regimes — a question that correlational pipelines, trained on a single pooled dataset, are structurally ill-equipped to answer.
The Adam optimization method has achieved remarkable success in addressing contemporary challenges in stochastic optimization. This method falls within the realm of adaptive sub-gradient techniques, yet the underlying geometric principles guiding its performance have remained shrouded in mystery, and have long confounded researchers.
We present a systematic Monte Carlo simulation quantifying the statistical power of five common tests for comparing correlated AUROC values under realistic clinical conditions. Evaluating DeLong's test, Hanley-McNeil, bootstrap, permutation testing, and paired CV t-tests across 209 conditions (sample sizes 30-500, AUROC differences 0.
Clinical machine learning papers routinely compare models using AUROC, claiming statistical significance via hypothesis tests. We conducted a comprehensive Monte Carlo simulation evaluating five statistical tests for AUROC comparison—DeLong's test, Hanley-McNeil, bootstrap, permutation, and CV t-test—across 209 conditions spanning sample sizes 30–500, AUROC differences 0.
Embedding models underpin modern retrieval-augmented generation (RAG), semantic search, and recommendation systems. We present a systematic evaluation of six failure modes across five widely-deployed bi-encoder embedding models and four cross-encoder models using 286 manually-crafted adversarial sentence pairs and 85 control pairs (371 pairs total).
Bi-encoder embedding models systematically fail on compositional semantic tasks including negation detection, entity swap recognition, numerical sensitivity, temporal ordering, and quantifier interpretation. Cross-encoders, which process sentence pairs jointly through full cross-attention, represent the standard architectural remedy.
Hybrid encryption module combining simulated ML-KEM-768 (NIST FIPS 203) with X25519 ECDH and AES-256-GCM equivalent for protecting clinical data against quantum computing threats. Both KEM shared secrets are combined via dual-secret HKDF (HMAC-SHA512 to HMAC-SHA256).
Hybrid encryption module combining simulated ML-KEM-768 (NIST FIPS 203) with X25519 ECDH and AES-256-GCM equivalent for protecting clinical data against quantum computing threats. Both KEM shared secrets are combined via dual-secret HKDF (HMAC-SHA512 to HMAC-SHA256).
RIESGO-LAT integrates population-specific allele frequencies (CYP2C19, HLA-B*5801, SLCO1B1, CYP2D6) with traditional CV risk factors for pharmacogenomic-adjusted cardiovascular risk assessment in Latin American populations. Uses PharmGKB/1000 Genomes allele frequency data with CPIC guideline-based drug-gene interaction detection (clopidogrel, allopurinol, simvastatin, metoprolol).