Louis
|
489e8aab24
|
Sync release 0.4.9 to dev (#2407)
* fix: move tensorrt executable to engine (#2400)
* fix: move tensorrt executable to engine
Signed-off-by: James <james@jan.ai>
* some update
Signed-off-by: hiro <hiro@jan.ai>
* chore: bump tensorrt version
* fix: wrong destroy path
* fix: install extensions in parallel
* chore: update path for tensorrt engine (#2404)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
---------
Signed-off-by: James <james@jan.ai>
Signed-off-by: hiro <hiro@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>
Co-authored-by: Louis <louis@jan.ai>
* Release/v0.4.9 (#2421)
* fix: turn off experimental settings should also turn off quick ask (#2411)
* fix: app glitches 1s generating response before starting model (#2412)
* fix: disable experimental feature should also disable vulkan (#2414)
* fix: model load stuck on windows when can't get CPU core count (#2413)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
* feat: TensorRT-LLM engine update support (#2415)
* fix: engine update
* chore: add remove prepopulated models
Signed-off-by: James <james@jan.ai>
* update tinyjensen url
Signed-off-by: James <james@jan.ai>
* update llamacorn
Signed-off-by: James <james@jan.ai>
* update Mistral 7B Instruct v0.1 int4
Signed-off-by: James <james@jan.ai>
* update tensorrt
Signed-off-by: James <james@jan.ai>
* update
Signed-off-by: hiro <hiro@jan.ai>
* update
Signed-off-by: James <james@jan.ai>
* prettier
Signed-off-by: James <james@jan.ai>
* update mistral config
Signed-off-by: James <james@jan.ai>
* fix some lint
Signed-off-by: James <james@jan.ai>
---------
Signed-off-by: James <james@jan.ai>
Signed-off-by: hiro <hiro@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>
* Tensorrt LLM disable turing support (#2418)
Co-authored-by: Hien To <tominhhien97@gmail.com>
* chore: add prompt template tensorrtllm (#2375)
* chore: add prompt template tensorrtllm
* Add Prompt template for mistral and correct model metadata
---------
Co-authored-by: Hien To <tominhhien97@gmail.com>
* fix: correct tensorrt mistral model.json (#2419)
---------
Signed-off-by: James <james@jan.ai>
Signed-off-by: hiro <hiro@jan.ai>
Co-authored-by: Louis <louis@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>
Co-authored-by: hiento09 <136591877+hiento09@users.noreply.github.com>
Co-authored-by: Hien To <tominhhien97@gmail.com>
---------
Signed-off-by: James <james@jan.ai>
Signed-off-by: hiro <hiro@jan.ai>
Co-authored-by: NamH <NamNh0122@gmail.com>
Co-authored-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>
Co-authored-by: hiento09 <136591877+hiento09@users.noreply.github.com>
Co-authored-by: Hien To <tominhhien97@gmail.com>
|
2024-03-19 12:20:09 +07:00 |
|
Meta Spartan
|
0348aa3321
|
feat: Groq Inference Extension (#2263)
* feat: Groq Inference Extension
* Add Groq supported models
* Fix folder typo
* Add Groq options to interface and new API Key saving, tested working
* Fix linting
|
2024-03-18 06:40:20 +07:00 |
|
Louis
|
af5bcea773
|
fix: gate quick ask with feature toggle (#2331)
|
2024-03-12 20:10:59 +07:00 |
|
Louis
|
a3aceb8f60
|
fix: settings page state loop and dark theme (#2065)
* fix: settings page state loop and dark theme
* fix: crash on visiting settings page
|
2024-02-18 14:24:54 +07:00 |
|
Louis
|
5f95841fab
|
refactor: reduce IPC & API handlers - shared node logics (#2011)
|
2024-02-14 08:50:28 +07:00 |
|