15 Commits

Author SHA1 Message Date
hiento09
31fdd89f0e
Revert nitro to 0.2.6 (#1491)
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-01-10 13:47:39 +07:00
hiento09
1350413e4f
Bump nitro to 0.2.8 and change Jan App to support cuda >= 11.7 (#1476) 2024-01-10 00:09:18 +07:00
hiento09
6c9cc9bb86
Bump nitro to 0.2.7 (#1474) 2024-01-09 16:29:50 +07:00
hiento09
d19e8baf22
Bump nitro version to 0.2.6 (#1458) 2024-01-08 23:37:20 +07:00
hiento09
f11a59bece
Add detect cuda version (#1351)
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-01-04 22:53:21 +07:00
hiento09
5300172836
Fix wrong linux nitro path (#1266)
Co-authored-by: Hien To <tominhhien97@gmail.com>
2023-12-31 11:37:55 +07:00
hiento09
7c0b45f0c5
Fix switch thread crash nitro windows linux (#1214)
* Add fix bug #1204 on windows

* nitro gpu exit on kill

* correct bat script syntax

* Remove wait 5 second before start nitro

---------

Co-authored-by: Hien To <tominhhien97@gmail.com>
2023-12-26 17:33:20 +07:00
hiento09
4f93e14d16
Fix token speed slow in machine has multi gpus (#1157)
* Update bat script windows choose GPU has highest ram to start nitro

* Update bash script for linux to choose gpu has highest vram

---------

Co-authored-by: Hien To <tominhhien97@gmail.com>
2023-12-21 15:38:21 +07:00
Louis
4653030bc1
fix: #1097 streaming response is replaced by error message (#1099) 2023-12-19 16:42:13 +07:00
hiento09
fde176955a
bump nitro version to 0.1.30 (#1036) 2023-12-15 17:39:49 +07:00
hiro
7f60265b3e chore: Bump nitro to 0.1.27 to support api to kill process 2023-12-13 16:35:37 +07:00
hiro
8f5c5e1e42 chore: Bump nitro to 0.1.26 2023-12-12 19:41:48 +07:00
hiro
f2eb8635da chore: Bumpt nitro bin version to version 0.1.23 2023-12-11 20:53:53 +07:00
hiro
6d3bf24d5c chore: remove gitkeep 2023-12-08 23:06:08 +07:00
hiro
c01737ff69 refactor: Change inference-extension to inference-nitro-extension 2023-12-08 23:06:08 +07:00