120 Commits

Author SHA1 Message Date
hiro
b94e1437c6 chore: Bump to 0.2.11 for queue 2024-01-18 08:42:13 +07:00
hiro
80903bb640 chore: Pump nitro version to 0.2.10 2024-01-18 08:28:41 +07:00
Louis
f4f861d0e9
refactor: introduce node module in nitro extension (#1630) 2024-01-17 11:28:54 +07:00
Louis
f025f111cc
chore: typo model.json (#1599) 2024-01-15 15:50:20 +07:00
Louis
ca28fe51d1
fix: #1569 - Does not apply thread settings when loading model (#1576) 2024-01-14 22:02:36 +07:00
NamH
4a2f5bce8d
fix(Thread): #1212 thread.json not created when user change thread settings (#1570)
Signed-off-by: nam <namnh0122@gmail.com>
2024-01-14 17:46:25 +07:00
NamH
59564b710e
fix(Thread): #1336 not allow creating too many unfinished thread (#1538)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-01-14 16:22:29 +07:00
Louis
dbd874055d
fix: gpu check module export does not work in extension (#1536) 2024-01-11 20:54:53 +07:00
hiento09
a0e55cde8f
Feature integrate antivirus scanner to ci (#1529)
* Revert nitro to 0.2.6

* Update nitro to 0.2.7 and add ci antivirus scanner

---------

Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-01-11 18:26:36 +07:00
Louis
19153c4137
fix: stop word update (#1457)
* fix: stop word update

* fix: enable stop settings

---------

Co-authored-by: hahuyhoang411 <hahuyhoanghhh41@gmail.com>
2024-01-11 10:03:59 +07:00
Louis
74ed081e4f
refactor: add app and nitro log - resolve dependencies issue (#1447)
* refactor: add app and nitro log - resolve dependencies issue

* fix: update guidance message on inference error

* chore: add timestamp to log files

* chore: add clear logs action
2024-01-10 16:43:28 +07:00
NamH
9183330480
fix(InferenceExtension): #1067 sync the nitro process state (#1493)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-01-10 14:15:17 +07:00
hiento09
31fdd89f0e
Revert nitro to 0.2.6 (#1491)
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-01-10 13:47:39 +07:00
hiento09
1350413e4f
Bump nitro to 0.2.8 and change Jan App to support cuda >= 11.7 (#1476) 2024-01-10 00:09:18 +07:00
hiento09
6c9cc9bb86
Bump nitro to 0.2.7 (#1474) 2024-01-09 16:29:50 +07:00
Louis
f82cf0d014
chore: error message update (#1473) 2024-01-09 16:21:14 +07:00
hiento09
d19e8baf22
Bump nitro version to 0.2.6 (#1458) 2024-01-08 23:37:20 +07:00
Hoang Ha
0dd12950df
fix(magicoder): correct the id and update parameters (#1448)
fix(capybara): change description + parameters

fix(codeninja): update description + add parameters

delete(magicoder): this model is failed to load due to gguf

fix(deepseek-1.3b): update description + add parameters

fix(deepseek-34b): update description + add parameters

fix(dolphin8x7b): update description + parameters

fix(llama2-chat-7b): update description + add parameters

fix(llama2-chat-70b): update description + add parameters

fix(mistral-7b): update description + add parameters

fix(mistral): degrade to v0.1 due to giberish output from v0.2

fix(mixtral): update desciption + parameters

fix(noromaid): update description + parameters

fix(mistral7b): correct the prompt template

update(openchat): add parameters

fix(openhermesneural): update parameter + description

fix(openhermesneural): update max_tokens

delete(pandora): remove pandora

fix(phi2): update parameter

update(phind34b): update description + change to q4 + add parameter

update(solar-slerp): update parameter

fix(starling): update description + parameter

fix(openchat): add to recommended

fix(stealth): add parameters

fix(tinyllama): update description + parameters

fix(trinity): update description + parameters

fix(tulu2): update description + parameters

fix(stealth): change max_tokens to 4096

update(wizardcoder): update parameter + description

update(wizardcoder): update  description

update(wizardcoder): change to q4

fix(yarn-mistral): update parameters

fix(yarn-mistral): change to q4

update(wizardcoder): correct the size of q4 model

fix(model-extension): pump to 1.0.20

fix(tinyllama): change to q8

fix(phi-2): change to q8

fix(deepseek34b): change to q4

fix(mistral7b): update the stop token

fix(starling): remove recommended
2024-01-08 23:00:01 +07:00
NamH
9ca6487183
chore: clean up use os hook (#1418)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-01-07 13:16:56 +07:00
Hoang Ha
7f4be92700
Update model hub (#1383)
* fix(openchat): remove from featured model

* fix(model-extension): update to version 1.0.19

* fix(openchat): fix typo
2024-01-05 15:32:30 +07:00
NamH
764e4c6fe3
fix(Model): remove unsupported default model setting params (#1382)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-01-05 15:13:21 +07:00
Hoang Ha
84a09ae03f
Chore/update model hub (#1342)
* fix(mistral-ins): clean redundant parameters

* add(yarn-mistral): update new requested model

* fix(trinity-v1): delete trinity v1 from the hub

* add(tulu-2-70b): llama 70b alternative

* fix(lzlv-70b): delete lzlv-70b and changed to tulu-2

* fix(mistral-ins): upgrade model version to v0.2

* fix(model-extention): pump version to 1.0.18

* add(dolphin 8x7b): update the current  best moe finetuned model

* add(openchat): the best 7b model

* fix(tinyllama): pump version of the model to v1

* fix(stealth): upgrade stealth to v1.3

* Revert "fix(stealth): upgrade stealth to v1.3"

This reverts commit da24df3fb5d69f93d92cc4dd45f991d548aff6aa.

* fix(stealth): upgrade version to v1.3
2024-01-05 13:50:35 +07:00
Louis
1400ef053c
fix: model migration stopped working (#1378) 2024-01-05 13:50:17 +07:00
hiento09
f11a59bece
Add detect cuda version (#1351)
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-01-04 22:53:21 +07:00
Louis
c07b418ff2
fix: Error occurred: Unexpected token "d", "data: ..." is not a valid JSON (#1332) 2024-01-04 18:22:07 +07:00
NamH
26eb1d9a67
feat(Model): #1028 made model.json optional (#1314)
* feat(Model): #1028 made model.json optional

Signed-off-by: James <james@jan.ai>

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-01-04 16:50:32 +07:00
hiento09
c16da28186
correct type utf-8 (#1311)
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-01-03 16:01:50 +07:00
hiento09
0a7e26d5fb
Fix memory on mac included cached and swap (#1298)
* Fix memory on mac included cached and swap

* set inteval monitor to 0.5s

---------

Co-authored-by: Hien To <hien@jan.ai>
2024-01-03 02:09:17 +07:00
hiento09
45fdadf1ea
Switch from systeminformation to os-utils to resolve bitdefender antivirus on windows, and reduce memory leak for monitor extension (#1282)
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-01-02 13:39:22 +07:00
hiento09
b5078599ec
fix ci test run failed (#1267)
Co-authored-by: Hien To <tominhhien97@gmail.com>
2023-12-31 12:10:10 +07:00
hiento09
5300172836
Fix wrong linux nitro path (#1266)
Co-authored-by: Hien To <tominhhien97@gmail.com>
2023-12-31 11:37:55 +07:00
Hoang Ha
f68909bb20
Merge branch 'main' into add/model-list 2023-12-29 18:34:24 +07:00
Louis
a9579a3cbd
fix: remove redundant gpu detection prompt event (#1254) 2023-12-29 16:37:34 +07:00
Hoang Ha
d99fc186b9
Merge branch 'main' into add/model-list 2023-12-29 16:23:48 +07:00
hiento09
1ec8174700
Feature GPU detection for Jan on Windows and Linux (#1242)
* Add js function to generate gpu and cuda detection

* inference nitro manage via json file instead of bash and bat script

* Add /usr/lib/x86_64-linux-gnu/ to linux check gpu

* chore: add CPU - GPU toggle

* correct file path

* fix: exist file sync check

* fix: get resources path

* Fix error jan/engines create existed error

* Seting sync to file

* Fix error show notification for GPU

* Set notify default to true

---------

Co-authored-by: Hien To <tominhhien97@gmail.com>
Co-authored-by: Louis <louis@jan.ai>
2023-12-29 15:56:36 +07:00
hahuyhoang411
7fb2a60bca fix(package.json): update version to 1.0.17 2023-12-28 18:35:34 +07:00
Louis
5250061c11
feat: Jan Server, API and decoupled clients (#948)
* chore: expose fs apis

* chore: correct electron import path

* update download api

Signed-off-by: James <james@jan.ai>

* update chat_completion

Signed-off-by: James <james@jan.ai>

* fix electron import

Signed-off-by: James <james@jan.ai>

* feat: adding API support at 1337 (#991)

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>

* feat: Add /chat/completion api and handler

* chore: add todo for modelList

* chore: read engine.json for openai chat_completion (#1030)

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>

* refactor: move routes to shared node module

* refactor: exported modules from core with types (#1172)

* refactor: exported modules from core with types

* fix: fix file reading args

* refactor: fileManager handles

* fix: app issues with server refactoring

* refactor: shared server module (#1210)

* chore: resolve main

* chore: update makefile

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: NamH <NamNh0122@gmail.com>
Co-authored-by: hiro <vuonghoainam.work@gmail.com>
2023-12-28 17:57:39 +07:00
hiento09
cfbc5674fe
Remove un-use resource checking to free ram on windows - monitor extension (#1241)
Co-authored-by: Hien To <tominhhien97@gmail.com>
2023-12-28 15:34:11 +07:00
Louis
7feaf9694d
feat: Deprecate model.json ready state in favor of .download ext (#1238)
* feat: Deprecate model.json ready state in favor of .download ext

* refactor: resolve ts ignore

* chore: fix warning

* fix: path polyfill on Windows
2023-12-28 14:06:13 +07:00
hiento09
d4b4540505
add sleep 500ms if platform is windows (#1215)
Co-authored-by: Hien To <tominhhien97@gmail.com>
2023-12-26 19:15:38 +07:00
hiento09
7c0b45f0c5
Fix switch thread crash nitro windows linux (#1214)
* Add fix bug #1204 on windows

* nitro gpu exit on kill

* correct bat script syntax

* Remove wait 5 second before start nitro

---------

Co-authored-by: Hien To <tominhhien97@gmail.com>
2023-12-26 17:33:20 +07:00
hahuyhoang411
2aab5ff355 fix(package.json): update version to 1.0.16 2023-12-21 20:55:26 +07:00
hiro
f0c889ce6b
Merge pull request #1128 from janhq/feat/gh_codeql
feat: Add codeQL analysis for push main and pr main
2023-12-21 15:43:27 +07:00
hiento09
4f93e14d16
Fix token speed slow in machine has multi gpus (#1157)
* Update bat script windows choose GPU has highest ram to start nitro

* Update bash script for linux to choose gpu has highest vram

---------

Co-authored-by: Hien To <tominhhien97@gmail.com>
2023-12-21 15:38:21 +07:00
Louis
8d5fce9165
fix: malicious redirections & security risk 2023-12-21 14:10:33 +07:00
Louis
4e044df044
chore: bump model-extension version 2023-12-21 13:46:29 +07:00
Louis
e0370210a7
fix: #1096 yield error message upon thread switching (#1109) 2023-12-20 14:58:47 +07:00
NamH
84fb5ef346
fix(Thread): #1042 allow create new thread by clicking Use in Jan Hub (#1103)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2023-12-19 17:06:57 +07:00
Louis
4653030bc1
fix: #1097 streaming response is replaced by error message (#1099) 2023-12-19 16:42:13 +07:00
Louis
e3c1787d57
fix: app failed to load model sometime due to race condition (#1071) 2023-12-19 11:10:07 +07:00