173 Commits

Author SHA1 Message Date
NamH
31dedb5774
chore: cortex version update (#3098) 2024-07-12 14:37:14 +07:00
jan-service-account
241d98f99c
Update cortex cpp nightly to version 0.4.18 (#3072)
* Update cortex cpp nightly to version 0.4.17

* update linux downloadnitro

* c‌ortex 0.4.18

---------

Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Van Pham <64197333+Van-QA@users.noreply.github.com>
2024-07-12 14:37:14 +07:00
Hoang Ha
852ff18d74
bump version (#3082) 2024-06-21 16:21:20 +07:00
Hoang Ha
71a707aa77
adjust correct ngl number (#3081) 2024-06-21 14:34:38 +07:00
hiento09
e9e1435aff
Revert "Update cortex cpp nightly to version 0.4.12-16.06.24" (#3052) 2024-06-17 10:34:06 +07:00
github-actions[bot]
52e965a75c Update cortex cpp nightly to version 0.4.12-16.06.24 2024-06-16 11:42:23 +00:00
Hoang Ha
f702506e58
Chore: model hub v0.5.1 update (#3036)
* init model

* init qwen2

* version bump

* refactor: correct icon

* chore: Refactor/issue template feature request (#3037)

* refactor: add issue template form for bug

* refactor: config blank_issues_enabled: false

* refactor: config feature request

* refactor: config feature request

---------

Co-authored-by: Van-QA <van@jan.ai>

* refactor: correct icon

* refactor: allow blank issue

---------

Co-authored-by: Van-QA <van@jan.ai>
Co-authored-by: Van Pham <64197333+Van-QA@users.noreply.github.com>
2024-06-13 15:06:07 +07:00
Van Pham
1f5cce887a
chore: Bump-cortex-0.4.13 (#3027) 2024-06-12 16:09:17 +07:00
NamH
6ee5d16e5c
fix: duplicate role inside messages cause some model to refuse to answer (#3006)
* fix: duplicate role inside messages cause some model to refuse to answer

Signed-off-by: James <namnh0122@gmail.com>

* update

* Bump cortex to 0.4.12

* some model require not empty message

update

---------

Signed-off-by: James <namnh0122@gmail.com>
Co-authored-by: Van Pham <64197333+Van-QA@users.noreply.github.com>
2024-06-10 16:19:31 +07:00
Hoang Ha
bd5a0ea8ab
Chore: Model Hub update (#2966)
* fix: correct size

* version bump

* add: codestral 22b

* add: codestral 22b

* versino bump

* upgrade to v3

* Update stop token default-model.json 

confirmed with Rex

* fix: whitespace

---------

Co-authored-by: Van Pham <64197333+Van-QA@users.noreply.github.com>
2024-05-30 12:33:47 +07:00
Van Pham
9ac5696e35
chore/Bump-cortex-0.4.11 (#2962) 2024-05-29 17:57:41 +07:00
Hoang Ha
25daba9696
Chore: aya update (#2941)
* init

* init

* fix: correct format

* version bump

* add: aya 8b, aya 35b, phi3

* fix: stop token

* fix: stop token
2024-05-24 18:10:23 +07:00
Van Pham
9cf9fa0dd3
Bump cortex to 0.4.9 (#2940) 2024-05-24 13:01:25 +07:00
Van Pham
f7c089c765
Bump cortex to 0.4.8 (#2938) 2024-05-22 21:21:02 +07:00
Hoang Ha
385ebb7750
Chore: phi3 long-context update (#2936)
* init

* init

* fix: correct version

* version bump

* correct url

* remove small

* correct size
2024-05-22 21:20:42 +07:00
Hoang Ha
65b8d8e66b
Fix: Phi-3 doesn't display (#2928)
* fix: params correction

* add phi

* version bump
2024-05-20 23:45:06 +07:00
Louis
e78d057f0f
fix: cortex process is not terminated properly (#2921)
* chore: bump cortex-cpp to 0.4.6

* Bump cortex 0.4.7

---------

Co-authored-by: Van Pham <64197333+Van-QA@users.noreply.github.com>
2024-05-18 14:14:56 +07:00
Louis
537ef20a54
chore: replace nitro by cortex-cpp (#2912) 2024-05-16 17:46:49 +07:00
Hoang Ha
218259945f
Chore: Add phi3 (#2914)
* init

* version bump

* fix: correct template
2024-05-16 14:58:21 +07:00
Hoang Ha
1e0d4f3753
Feat: Adjust model hub v0.4.13 (#2879)
* fix: correct phi3

* redundant phi2 dolphin

* add: hermes llama3

* add: ngl settings

* correct ctx len

* correct ngl

* correct maxlen + ngl

* disable phi3

* add ngl

* add ngl

* add ngl

* add ngl

* add ngl

* add ngl

* add ngl

* remove redundant  hermes pro

* add ngl

* add ngl

* add ngl

* remove miqu

* add ngl

* add ngl

* add ngl

* add ngl

* remove redundant

* add ngl

* add ngl

* add ngl

* add ngl

* add ngl

* add ngl

* add ngl

* add ngl

* add ngl

* version package bump

* feat: resolve issue of cannot found model in the extensions due to the removal

* feat: completely remove hermes-pro-7b

* feat: completely remove openhermes-neural-7b and miqu-70b, and add llama3-hermes-8b via renaming from Rex

* fix: correct description

---------

Co-authored-by: Van-QA <van@jan.ai>
2024-05-13 11:48:03 +07:00
Hoang Ha
2008aae100
Feat: Correct context length for models (#2867)
* fix: correct ctx

* version bump

* fix: correct ctxlen

* fix: correct ctxlen

* version bump

* fix: correct ctx + q4

* fix: correct ctxlen

* fix: correct ctx

* fix: correct ctx

* fix: correct ctx len

* fix: correct ctx

* fix: correct ctx

* fix: correct ctx

* fix: correct ctx

* fix: correct ctx

* fix: correct ctx

* fix: correct ctx

* fix: correct ctx

* version bump
2024-05-06 18:04:51 +07:00
Louis
63a2f22414
Merge branch 'dev' into main 2024-04-25 14:14:54 +07:00
Hoang Ha
355ed9ff4f
Merge pull request #2812 from janhq/fix/model-version
Feat: Bump version
2024-04-24 22:48:29 +07:00
Hoang Ha
f9a8e06a4f
fix: version bump 2024-04-24 22:46:56 +07:00
Hoang Ha
eb3593e96a
fix: bump version 2024-04-24 22:24:22 +07:00
Hoang Ha
785b84d9ec
fix: bump version 2024-04-24 22:24:00 +07:00
Hoang Ha
ec589b1f22
fix: bump version 2024-04-24 22:23:40 +07:00
Hoang Ha
4d80f5c3c1
fix: bump version 2024-04-24 22:22:14 +07:00
Hoang Ha
984838a7bc
fix: bump version 2024-04-24 22:19:11 +07:00
Hoang Ha
68b0018d55
fix: version bump 2024-04-24 16:40:27 +07:00
Hoang Ha
e076c5ba4e
fix: remove featured 2024-04-24 16:38:47 +07:00
Hoang Ha
f5c4324f79
fix: remove featured 2024-04-24 16:38:30 +07:00
Hoang Ha
6bf12e42a8
fix: remove featured 2024-04-24 16:38:09 +07:00
Hoang Ha
3810b1a009
fix: remove featured 2024-04-24 16:37:48 +07:00
Hoang Ha
d14c3af99b
add: featured 2024-04-24 16:35:05 +07:00
Hoang Ha
3c294d6a48
Chore: Add phi-3 (#2794)
* add: phi-3

* chore: bump version

* fix: correct model id
2024-04-24 14:17:42 +07:00
Louis
da161cd159
fix: override cpu_threads setting from model.json (#2789) 2024-04-23 15:09:48 +07:00
Van Pham
67db45ff3c
chore: add model.json for Llama3 and other outdated model version (#2773)
* chore: add model.json for Llama3 and other outdated model version

* fix: consistency format

* fix: correct folder id

* update: bump version

* add: stop words

* fix: model.json

* Update extensions/inference-nitro-extension/resources/models/llama3-8b-instruct/model.json

* Update extensions/inference-nitro-extension/resources/models/llama3-8b-instruct/model.json

Based on suggested change

Co-authored-by: Nikolaus Kühn <nikolaus.kuehn@commercetools.com>

---------

Co-authored-by: Van-QA <van@jan.ai>
Co-authored-by: Hoang Ha <64120343+hahuyhoang411@users.noreply.github.com>
Co-authored-by: Louis <louis@jan.ai>
Co-authored-by: Nikolaus Kühn <nikolaus.kuehn@commercetools.com>
2024-04-22 21:40:22 +07:00
NamH
95632788e4
chore: default context length to 2048 (#2746) 2024-04-17 19:14:51 +07:00
NamH
a2cb1353cd
fix: cannot download phin34 model (#2745)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-04-17 18:36:02 +07:00
Van Pham
e43ee8ec2c
Bump nitro to 0.3.22 (#2740)
* Bump nitro to 0.3.22

* Update model.json for Command-r-34b

Remove Coming Soon and Unavailable
2024-04-17 01:00:16 +07:00
NamH
31397de2d1
Refactor/deprecate hugging face ext (#2620)
* refactor: deprecate huggingface extension

Signed-off-by: James <james@jan.ai>
2024-04-16 17:23:45 +07:00
Louis
9369ac3e8b
Merge branch 'dev' into main 2024-04-15 14:57:31 +07:00
Andreas Deininger
81e8889568
Fix typos (#2714) 2024-04-15 13:27:28 +07:00
Hoang Ha
b908ae2933
Chore: Change CommandR to unavailable (#2722)
* fix: move to comming soon

* fix: Q4 for consistancy

* version pump extension

* pump version model

* fix: highlight unsupported tag

---------

Co-authored-by: Louis <louis@jan.ai>
2024-04-15 12:57:52 +07:00
hiento09
aff6a7d11a
Bump nitro to -.3.16-hotfix (#2702)
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-04-12 15:24:52 +07:00
Van Pham
8dbd2524b8
Revert to 0.3.16 due to Nitro issue (#2700) 2024-04-12 13:00:47 +07:00
Van Pham
4a9a9f27df
Revert to 0.3.14 due to Nitro issue (#2699) 2024-04-12 12:35:53 +07:00
Louis
fa9d8ab9a5
fix: switch between models get stuck at generating (#2698) 2024-04-12 12:34:22 +07:00
NamH
7d67087919
fix: add markdown support for extension description (#2691)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-04-11 17:43:59 +07:00