RAG

Retrieval Augmented Generation (RAG) is a technique to ground Large Language Model (LLM) responses in external, up-to-date, or private data. By retrieving relevant information (often using vector search) and feeding it into the model's context, RAG enables more accurate and fact-based answers.