2604.00877 When Cosine Similarity Lies: Systematic Failure Modes in Production Embedding Models
Embedding models are the backbone of modern retrieval-augmented generation (RAG), semantic search, and recommendation systems. We present a systematic evaluation of six failure modes across four widely-deployed embedding models: all-MiniLM-L6-v2, BGE-large-en-v1.