What is Retrieval-Augment Generation (RAG) and Why is it Needed?Retriever-Augmented Generation (RAG) architecture was conceived to overcome fundamental limitations LLM-only generation approach. Retraining of LLMs for domain and business specific cases is still extremely expensive undertaking. Also there is fundamental competition between the need for LLM to be generally usable (have common knowledge) and specifically usable (have domain or business case knowledge). So RAG is here to help overcome LLM limitations and enable more engineering based solutions. Retriever-Augmented Generation core components are:
|
Subscribe to my AI engineering newsletter to get articles how to build real world AI applications