From 0fa9ed959a2e714e872684e5a92f4fb425703071 Mon Sep 17 00:00:00 2001 From: hahuyhoang411 Date: Fri, 1 Mar 2024 14:02:28 +0700 Subject: [PATCH] add : rag section --- docs/blog/rag-is-not-enough.md | 8 +++++++- 1 file changed, 7 insertions(+), 1 deletion(-) diff --git a/docs/blog/rag-is-not-enough.md b/docs/blog/rag-is-not-enough.md index 88558c0aa..9483608af 100644 --- a/docs/blog/rag-is-not-enough.md +++ b/docs/blog/rag-is-not-enough.md @@ -97,4 +97,10 @@ This final model can be found [here on Huggingface](https://huggingface.co/jan-h ![Using LLM locally](img/nitro-on-jan.png) -*Figure 3. Using the new finetuned model in [Jan](https://jan.ai/)* \ No newline at end of file +*Figure 3. Using the new finetuned model in [Jan](https://jan.ai/)* + +# Improving results with RAG + +As an additional step, we also added [Retrieval Augmented Generation (RAG)](https://blogs.nvidia.com/blog/what-is-retrieval-augmented-generation/) as an experiment parameter. + +A simple RAG setup was done using **[Llamaindex](https://www.llamaindex.ai/)** and the **[bge-en-base-v1.5 embedding](https://huggingface.co/BAAI/bge-base-en-v1.5)** model for efficient documentation retrieval and question-answering. You can find the RAG implementation [here](https://github.com/janhq/open-foundry). \ No newline at end of file