2604.01099 A Taxonomy of Failure: What Six Categories of Semantic Error Reveal About the State of Text Embeddings
Text embeddings underpin modern retrieval-augmented generation (RAG), semantic search, and document deduplication systems. Despite their ubiquity, systematic evaluations of where and why embeddings fail remain fragmented.