11 Commits

Author SHA1 Message Date
Louis
358d8f83f9
chore: add llama3.2 and qwen models 2024-10-02 11:36:55 +07:00
Hoang Ha
852ff18d74
bump version (#3082) 2024-06-21 16:21:20 +07:00
Hoang Ha
71a707aa77
adjust correct ngl number (#3081) 2024-06-21 14:34:38 +07:00
Hoang Ha
1e0d4f3753
Feat: Adjust model hub v0.4.13 (#2879)
* fix: correct phi3

* redundant phi2 dolphin

* add: hermes llama3

* add: ngl settings

* correct ctx len

* correct ngl

* correct maxlen + ngl

* disable phi3

* add ngl

* add ngl

* add ngl

* add ngl

* add ngl

* add ngl

* add ngl

* remove redundant  hermes pro

* add ngl

* add ngl

* add ngl

* remove miqu

* add ngl

* add ngl

* add ngl

* add ngl

* remove redundant

* add ngl

* add ngl

* add ngl

* add ngl

* add ngl

* add ngl

* add ngl

* add ngl

* add ngl

* version package bump

* feat: resolve issue of cannot found model in the extensions due to the removal

* feat: completely remove hermes-pro-7b

* feat: completely remove openhermes-neural-7b and miqu-70b, and add llama3-hermes-8b via renaming from Rex

* fix: correct description

---------

Co-authored-by: Van-QA <van@jan.ai>
2024-05-13 11:48:03 +07:00
Hoang Ha
2008aae100
Feat: Correct context length for models (#2867)
* fix: correct ctx

* version bump

* fix: correct ctxlen

* fix: correct ctxlen

* version bump

* fix: correct ctx + q4

* fix: correct ctxlen

* fix: correct ctx

* fix: correct ctx

* fix: correct ctx len

* fix: correct ctx

* fix: correct ctx

* fix: correct ctx

* fix: correct ctx

* fix: correct ctx

* fix: correct ctx

* fix: correct ctx

* fix: correct ctx

* version bump
2024-05-06 18:04:51 +07:00
Hoang Ha
984838a7bc
fix: bump version 2024-04-24 22:19:11 +07:00
Hoang Ha
d14c3af99b
add: featured 2024-04-24 16:35:05 +07:00
Van Pham
67db45ff3c
chore: add model.json for Llama3 and other outdated model version (#2773)
* chore: add model.json for Llama3 and other outdated model version

* fix: consistency format

* fix: correct folder id

* update: bump version

* add: stop words

* fix: model.json

* Update extensions/inference-nitro-extension/resources/models/llama3-8b-instruct/model.json

* Update extensions/inference-nitro-extension/resources/models/llama3-8b-instruct/model.json

Based on suggested change

Co-authored-by: Nikolaus Kühn <nikolaus.kuehn@commercetools.com>

---------

Co-authored-by: Van-QA <van@jan.ai>
Co-authored-by: Hoang Ha <64120343+hahuyhoang411@users.noreply.github.com>
Co-authored-by: Louis <louis@jan.ai>
Co-authored-by: Nikolaus Kühn <nikolaus.kuehn@commercetools.com>
2024-04-22 21:40:22 +07:00
Van Pham
e43ee8ec2c
Bump nitro to 0.3.22 (#2740)
* Bump nitro to 0.3.22

* Update model.json for Command-r-34b

Remove Coming Soon and Unavailable
2024-04-17 01:00:16 +07:00
Hoang Ha
b908ae2933
Chore: Change CommandR to unavailable (#2722)
* fix: move to comming soon

* fix: Q4 for consistancy

* version pump extension

* pump version model

* fix: highlight unsupported tag

---------

Co-authored-by: Louis <louis@jan.ai>
2024-04-15 12:57:52 +07:00
NamH
e0d6049d66
chore: extension should register its own models (#2601)
* chore: extension should register its own models

Signed-off-by: James <james@jan.ai>

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-04-05 14:18:58 +07:00