jan/rag-is-not-enough.md at ec9d6bed0a353677e3df10e85f217eac4e0ef3c6

Nicholai/jan

Fork 0

hahuyhoang411 ec9d6bed0a add: img compare between steath and other opensources

2024-03-01 12:57:46 +07:00

2.5 KiB

Raw Blame History

title

description

slug

tags

authors

RAG is not enough: Lessons from Beating GPT-3.5 on Specialized Tasks with Mistral 7B

Creating Open Source Alternatives to Outperform ChatGPT

/surpassing-chatgpt-with-open-source-alternatives

Open Source ChatGPT Alternatives

Outperform ChatGPT

name	title	url	image_url	email
Rex Ha	LLM Researcher & Content Writer	https://github.com/hahuyhoang411	https://avatars.githubusercontent.com/u/64120343?v=4	rex@jan.ai

name	title	url	image_url	email
Nicole Zhu	Co-Founder	https://github.com/0xsage	https://avatars.githubusercontent.com/u/69952136?v=4	nicole@jan.ai

name	title	url	image_url	email
Alan Dao	AI Engineer	https://github.com/tikikun	https://avatars.githubusercontent.com/u/22268502?v=4	alan@jan.ai

Abstract

We present a straightforward approach to adapting small, open-source models for specialized use-cases, that can surpass GPT 3.5 performance with RAG. With it, we were able to get superior results on Q&A over technical documentation describing a small codebase.

In short, (1) extending a general foundation model like Mistral with strong math and coding, and (2) training it over a high-quality, synthetic dataset generated from the intended corpus, and (3) adding RAG capabilities, can lead to significant accuracy improvements.

Problems still arise with catastrophic forgetting in general tasks, commonly observed during continued fine-tuning [1]. In our case, this is likely exacerbated by our lack of access to Mistral’s original training dataset and various compression techniques used in our approach to keep the model small.

Selecting a strong foundation model

Mistral 7B continues to outshine Meta's Llama-2 7B and Google's Gemma 7B on meaningful benchmarks, so we selected this as a starting point.

Having a robust base model is critical. In our experiments, using Mistral as a starting point ensured the highest accuracy for subsequent specialized adaptations.

Figure 1. Mistral 7B excels in benchmarks, ranking among the top foundational models.

Note: we are not sponsored by the Mistral team. Though many folks in their community do like to run Mistral locally using our desktop client - Jan.

2.5 KiB Raw Blame History Unescape Escape

Abstract

Selecting a strong foundation model

2.5 KiB

Raw Blame History