From 7f811fc4090d19ce70a5f5cec9c988a9044f04f7 Mon Sep 17 00:00:00 2001
From: hahuyhoang411 <hahuyhoanghhh41@gmail.com>
Date: Fri, 1 Mar 2024 14:58:37 +0700
Subject: [PATCH] add: link to mistral + typo

---
 docs/blog/rag-is-not-enough.md | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/docs/blog/rag-is-not-enough.md b/docs/blog/rag-is-not-enough.md
index 0fc58367c..a351d3531 100644
--- a/docs/blog/rag-is-not-enough.md
+++ b/docs/blog/rag-is-not-enough.md
@@ -31,7 +31,7 @@ Problems still arise with catastrophic forgetting in general tasks, commonly obs
 
 ## Selecting a strong foundation model
 
-Mistral 7B continues to outshine [Meta's Llama-2 7B](https://huggingface.co/meta-llama/Llama-2-7b) and [Google's Gemma 7B](https://huggingface.co/google/gemma-7b) on meaningful benchmarks, so we selected this as a starting point. 
+[Mistral 7B](https://huggingface.co/mistralai/Mistral-7B-v0.1) continues to outshine [Meta's Llama-2 7B](https://huggingface.co/meta-llama/Llama-2-7b) and [Google's Gemma 7B](https://huggingface.co/google/gemma-7b) on meaningful benchmarks, so we selected this as a starting point. 
 
 Having a robust base model is critical. In our experiments, using Mistral as a starting point ensured the highest accuracy for subsequent specialized adaptations.
 
@@ -51,7 +51,7 @@ Mistral alone has known, poor math capabilities, which we needed for our highly
 
 We found model merging to be a viable approach where each iteration is cost-effective + fast to deploy.
 
-We ended up with [Stealth v1.1](https://huggingface.co/jan-hq/stealth-v1.1), a [SLERP](https://github.com/Digitous/LLM-SLERP-Merge) merge of Mistral with the following:
+We ended up with [Stealth 7B v1.1](https://huggingface.co/jan-hq/stealth-v1.1), a [SLERP](https://github.com/Digitous/LLM-SLERP-Merge) merge of Mistral with the following:
 
 - [WizardMath](https://huggingface.co/WizardLM/WizardMath-7B-V1.1) for its math capabilities
 - [WizardCoder](https://huggingface.co/WizardLM/WizardCoder-Python-7B-V1.0) for its coding capabilities
@@ -65,7 +65,7 @@ Merging different LLMs can lead to the mixed answering style because each model
 
 Thus, we applied Direct Preference Optimization ([DPO](https://arxiv.org/abs/2305.18290)) using the [Intel's Orca DPO pairs](https://huggingface.co/datasets/Intel/orca_dpo_pairs) dataset, chosen for its helpful answering style in general, math and coding concentration.
 
-This approach result in a final model - [Stealth v1.2](https://huggingface.co/jan-hq/stealth-v1.2), with minimal loss, and realign to our technical preferences.
+This approach result in a final model - [Stealth 7B v1.2](https://huggingface.co/jan-hq/stealth-v1.2), with minimal loss, and realign to our technical preferences.
 
 ## **Using our own technical documentation**