80 Commits

Author SHA1 Message Date
Louis
0e48be67e8
feat: support multiple model binaries (#1659)
* feat: Support multiple model binaries

* fix: Update downloadModel with multiple binaries handler

* feat: Add 3 models with multiple binaries

* chore: fix model download

* fix: model file lookup & model path

* chore: add .prettierrc

* chore: refactor docs

* chore: bump model version

* fix(capybara): add filename

* fix(codeninja): add file name + llama model path

* fix(default): add llama model path

* fix(deepseek coder): add filename

* fix(deepseek 33B): add filename

* fix(dolphin mixtral): add filename

* fix(llama2-chat): add filename

* fix(llama2-70B): add filename

* fix(mistral 7b): add filename + model path

* fix(bakllava): correct size model

* fix(llava-7b): correct size model

* fix(llava-13b): correct size model

* fix(mixtral-8x7b): add file name + modelpath

* fix(noramaid-7b): add file name + modelpath

* fix(openchat-7b): add file name + modelpath

* fix(openhermes-7b): add file name + modelpath

* fix(phi2-3b): add file name + modelpath

* fix(phind): add file name + modelpath

* fix(solarslerp): add file name + modelpath

* fix(starling): add file name + modelpath

* fix(stealth): add file name + modelpath

* fix(tinyllama): add file name + modelpath

* fix(trinity): add file name + modelpath

* fix(tulu): add file name + modelpath

* fix(wizardcoder): add file name + modelpath

* fix(yi): add file name + modelpath

* update from source -> sources

Signed-off-by: James <james@jan.ai>

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: hiro <vuonghoainam.work@gmail.com>
Co-authored-by: hahuyhoang411 <hahuyhoanghhh41@gmail.com>
Co-authored-by: James <james@jan.ai>
2024-01-25 14:05:33 +07:00
hiro
410c6f9a57 chore(nitro): 0.2.11 -> 0.2.12 2024-01-25 10:20:47 +07:00
hiro
9da59b4612 fix: Change initial value for reduce as 1 instead of 0 2024-01-23 17:35:34 +07:00
hiro
0608d77da5 chore: remove deps node-os-utils 2024-01-23 17:35:34 +07:00
hiro
5a9b0cf8b8 chore: add log to get data on testing 2024-01-23 17:35:34 +07:00
hiro
a0f6f00806 fix: Add fix for physical cpu core count 2024-01-23 17:35:34 +07:00
hiro
17479e7967 chore: reformat and add types 2024-01-23 17:35:34 +07:00
NamH
4cf47777e6
feat: allow user to move jan folder (#1649)
* feat: allow user to move jan folder

Signed-off-by: James <james@jan.ai>

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: Louis <louis@jan.ai>
2024-01-22 14:37:46 +07:00
Louis
a50ea4a634
refactor: deprecate extension type implementation (#1677)
* refactor: deprecate extension type implementation

* chore: update README.md

* refactor: EventName to use the events defined in /types
2024-01-22 10:26:49 +07:00
hiento09
42c416ebd5
fix/#1650-hogging-resources (#1663) 2024-01-18 19:10:37 +07:00
hiro
b94e1437c6 chore: Bump to 0.2.11 for queue 2024-01-18 08:42:13 +07:00
hiro
80903bb640 chore: Pump nitro version to 0.2.10 2024-01-18 08:28:41 +07:00
Louis
f4f861d0e9
refactor: introduce node module in nitro extension (#1630) 2024-01-17 11:28:54 +07:00
Louis
ca28fe51d1
fix: #1569 - Does not apply thread settings when loading model (#1576) 2024-01-14 22:02:36 +07:00
Louis
dbd874055d
fix: gpu check module export does not work in extension (#1536) 2024-01-11 20:54:53 +07:00
hiento09
a0e55cde8f
Feature integrate antivirus scanner to ci (#1529)
* Revert nitro to 0.2.6

* Update nitro to 0.2.7 and add ci antivirus scanner

---------

Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-01-11 18:26:36 +07:00
Louis
74ed081e4f
refactor: add app and nitro log - resolve dependencies issue (#1447)
* refactor: add app and nitro log - resolve dependencies issue

* fix: update guidance message on inference error

* chore: add timestamp to log files

* chore: add clear logs action
2024-01-10 16:43:28 +07:00
NamH
9183330480
fix(InferenceExtension): #1067 sync the nitro process state (#1493)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-01-10 14:15:17 +07:00
hiento09
31fdd89f0e
Revert nitro to 0.2.6 (#1491)
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-01-10 13:47:39 +07:00
hiento09
1350413e4f
Bump nitro to 0.2.8 and change Jan App to support cuda >= 11.7 (#1476) 2024-01-10 00:09:18 +07:00
hiento09
6c9cc9bb86
Bump nitro to 0.2.7 (#1474) 2024-01-09 16:29:50 +07:00
Louis
f82cf0d014
chore: error message update (#1473) 2024-01-09 16:21:14 +07:00
hiento09
d19e8baf22
Bump nitro version to 0.2.6 (#1458) 2024-01-08 23:37:20 +07:00
NamH
764e4c6fe3
fix(Model): remove unsupported default model setting params (#1382)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-01-05 15:13:21 +07:00
hiento09
f11a59bece
Add detect cuda version (#1351)
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-01-04 22:53:21 +07:00
Louis
c07b418ff2
fix: Error occurred: Unexpected token "d", "data: ..." is not a valid JSON (#1332) 2024-01-04 18:22:07 +07:00
hiento09
c16da28186
correct type utf-8 (#1311)
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-01-03 16:01:50 +07:00
hiento09
45fdadf1ea
Switch from systeminformation to os-utils to resolve bitdefender antivirus on windows, and reduce memory leak for monitor extension (#1282)
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-01-02 13:39:22 +07:00
hiento09
b5078599ec
fix ci test run failed (#1267)
Co-authored-by: Hien To <tominhhien97@gmail.com>
2023-12-31 12:10:10 +07:00
hiento09
5300172836
Fix wrong linux nitro path (#1266)
Co-authored-by: Hien To <tominhhien97@gmail.com>
2023-12-31 11:37:55 +07:00
Louis
a9579a3cbd
fix: remove redundant gpu detection prompt event (#1254) 2023-12-29 16:37:34 +07:00
hiento09
1ec8174700
Feature GPU detection for Jan on Windows and Linux (#1242)
* Add js function to generate gpu and cuda detection

* inference nitro manage via json file instead of bash and bat script

* Add /usr/lib/x86_64-linux-gnu/ to linux check gpu

* chore: add CPU - GPU toggle

* correct file path

* fix: exist file sync check

* fix: get resources path

* Fix error jan/engines create existed error

* Seting sync to file

* Fix error show notification for GPU

* Set notify default to true

---------

Co-authored-by: Hien To <tominhhien97@gmail.com>
Co-authored-by: Louis <louis@jan.ai>
2023-12-29 15:56:36 +07:00
Louis
5250061c11
feat: Jan Server, API and decoupled clients (#948)
* chore: expose fs apis

* chore: correct electron import path

* update download api

Signed-off-by: James <james@jan.ai>

* update chat_completion

Signed-off-by: James <james@jan.ai>

* fix electron import

Signed-off-by: James <james@jan.ai>

* feat: adding API support at 1337 (#991)

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>

* feat: Add /chat/completion api and handler

* chore: add todo for modelList

* chore: read engine.json for openai chat_completion (#1030)

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>

* refactor: move routes to shared node module

* refactor: exported modules from core with types (#1172)

* refactor: exported modules from core with types

* fix: fix file reading args

* refactor: fileManager handles

* fix: app issues with server refactoring

* refactor: shared server module (#1210)

* chore: resolve main

* chore: update makefile

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: NamH <NamNh0122@gmail.com>
Co-authored-by: hiro <vuonghoainam.work@gmail.com>
2023-12-28 17:57:39 +07:00
hiento09
cfbc5674fe
Remove un-use resource checking to free ram on windows - monitor extension (#1241)
Co-authored-by: Hien To <tominhhien97@gmail.com>
2023-12-28 15:34:11 +07:00
Louis
7feaf9694d
feat: Deprecate model.json ready state in favor of .download ext (#1238)
* feat: Deprecate model.json ready state in favor of .download ext

* refactor: resolve ts ignore

* chore: fix warning

* fix: path polyfill on Windows
2023-12-28 14:06:13 +07:00
hiento09
d4b4540505
add sleep 500ms if platform is windows (#1215)
Co-authored-by: Hien To <tominhhien97@gmail.com>
2023-12-26 19:15:38 +07:00
hiento09
7c0b45f0c5
Fix switch thread crash nitro windows linux (#1214)
* Add fix bug #1204 on windows

* nitro gpu exit on kill

* correct bat script syntax

* Remove wait 5 second before start nitro

---------

Co-authored-by: Hien To <tominhhien97@gmail.com>
2023-12-26 17:33:20 +07:00
hiento09
4f93e14d16
Fix token speed slow in machine has multi gpus (#1157)
* Update bat script windows choose GPU has highest ram to start nitro

* Update bash script for linux to choose gpu has highest vram

---------

Co-authored-by: Hien To <tominhhien97@gmail.com>
2023-12-21 15:38:21 +07:00
Louis
e0370210a7
fix: #1096 yield error message upon thread switching (#1109) 2023-12-20 14:58:47 +07:00
NamH
84fb5ef346
fix(Thread): #1042 allow create new thread by clicking Use in Jan Hub (#1103)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2023-12-19 17:06:57 +07:00
Louis
4653030bc1
fix: #1097 streaming response is replaced by error message (#1099) 2023-12-19 16:42:13 +07:00
Louis
e3c1787d57
fix: app failed to load model sometime due to race condition (#1071) 2023-12-19 11:10:07 +07:00
hiento09
9398b734a1
Posthog disable click event and increase timeout for nitro load model request (#1060)
Co-authored-by: Service Account <service@jan.ai>
2023-12-18 21:10:36 +07:00
hiento09
fde176955a
bump nitro version to 0.1.30 (#1036) 2023-12-15 17:39:49 +07:00
0xSage
c556427905 more copy nits 2023-12-15 16:33:49 +08:00
Faisal Amir
4f41dab9b5 feat: move stop inference button into the send button 2023-12-14 16:40:51 +07:00
hiro
257011309f fix: Kill nitro many times app onDispose 2023-12-13 16:47:34 +07:00
hiro
96fa392fba
Update extensions/inference-nitro-extension/src/module.ts
Co-authored-by: Louis <louis@jan.ai>
2023-12-13 16:40:56 +07:00
hiro
539f11e583 feat: Add api to kill nitro 2023-12-13 16:35:59 +07:00
hiro
7f60265b3e chore: Bump nitro to 0.1.27 to support api to kill process 2023-12-13 16:35:37 +07:00