← Curriculum
Intermediate
~17 minrag
retrieval
groundedness
Evaluating RAG: retrieval and generation are different problems
If you grade end-to-end you'll never know what's broken.
Step 1 of 14
RAG has two failure surfaces: the retriever returns wrong docs, or the generator ignores/misuses correct docs. End-to-end metrics conflate them — and the wrong fix on the wrong surface wastes weeks. Score retrieval and generation separately, then jointly.