138 Commits

Author SHA1 Message Date
Hoang Ha
d14c3af99b
add: featured 2024-04-24 16:35:05 +07:00
Louis
da161cd159
fix: override cpu_threads setting from model.json (#2789) 2024-04-23 15:09:48 +07:00
Van Pham
67db45ff3c
chore: add model.json for Llama3 and other outdated model version (#2773)
* chore: add model.json for Llama3 and other outdated model version

* fix: consistency format

* fix: correct folder id

* update: bump version

* add: stop words

* fix: model.json

* Update extensions/inference-nitro-extension/resources/models/llama3-8b-instruct/model.json

* Update extensions/inference-nitro-extension/resources/models/llama3-8b-instruct/model.json

Based on suggested change

Co-authored-by: Nikolaus Kühn <nikolaus.kuehn@commercetools.com>

---------

Co-authored-by: Van-QA <van@jan.ai>
Co-authored-by: Hoang Ha <64120343+hahuyhoang411@users.noreply.github.com>
Co-authored-by: Louis <louis@jan.ai>
Co-authored-by: Nikolaus Kühn <nikolaus.kuehn@commercetools.com>
2024-04-22 21:40:22 +07:00
NamH
95632788e4
chore: default context length to 2048 (#2746) 2024-04-17 19:14:51 +07:00
NamH
a2cb1353cd
fix: cannot download phin34 model (#2745)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-04-17 18:36:02 +07:00
Van Pham
e43ee8ec2c
Bump nitro to 0.3.22 (#2740)
* Bump nitro to 0.3.22

* Update model.json for Command-r-34b

Remove Coming Soon and Unavailable
2024-04-17 01:00:16 +07:00
NamH
31397de2d1
Refactor/deprecate hugging face ext (#2620)
* refactor: deprecate huggingface extension

Signed-off-by: James <james@jan.ai>
2024-04-16 17:23:45 +07:00
Louis
9369ac3e8b
Merge branch 'dev' into main 2024-04-15 14:57:31 +07:00
Andreas Deininger
81e8889568
Fix typos (#2714) 2024-04-15 13:27:28 +07:00
Hoang Ha
b908ae2933
Chore: Change CommandR to unavailable (#2722)
* fix: move to comming soon

* fix: Q4 for consistancy

* version pump extension

* pump version model

* fix: highlight unsupported tag

---------

Co-authored-by: Louis <louis@jan.ai>
2024-04-15 12:57:52 +07:00
hiento09
aff6a7d11a
Bump nitro to -.3.16-hotfix (#2702)
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-04-12 15:24:52 +07:00
Van Pham
8dbd2524b8
Revert to 0.3.16 due to Nitro issue (#2700) 2024-04-12 13:00:47 +07:00
Van Pham
4a9a9f27df
Revert to 0.3.14 due to Nitro issue (#2699) 2024-04-12 12:35:53 +07:00
Louis
fa9d8ab9a5
fix: switch between models get stuck at generating (#2698) 2024-04-12 12:34:22 +07:00
NamH
7d67087919
fix: add markdown support for extension description (#2691)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-04-11 17:43:59 +07:00
Louis
02c49e796d
fix: race condition issue - reading settings.json file (#2683)
* fix: race condition issue - reading settings.json file

* fix: cannot reset data while starting model

* chore: remove extension suffix
2024-04-11 15:37:46 +07:00
hiento09
ebdaaa6c10
bump nitro version to 0.3.21 (#2680)
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-04-11 12:08:39 +07:00
Louis
065ed03099
fix: wrong monitoring system information type (#2679) 2024-04-11 11:07:31 +07:00
Louis
b19234ed71
chore: Extension should have product name in manifest (#2675)
* chore: Extension should have product name in manifest

* chore: typo
2024-04-11 09:50:58 +07:00
Louis
d93d74c86b
feat: nitro additional dependencies (#2674) 2024-04-11 09:13:02 +07:00
Louis
3f23de6c28
feat: move log into monitoring extension (#2662) 2024-04-10 14:35:15 +07:00
hiento09
2931a46799
Bump nitro to 0.3.19 (#2663)
Bump nitro to 0.3.19
2024-04-09 22:43:23 +07:00
hiento09
5fd6025175
Bump nitro version to 0.3.18 (#2652) 2024-04-09 12:31:10 +07:00
Louis
9479beb7d1
fix: unload model while loading cause unknown error (#2649)
* fix: unload model while loading cause unknown error

* chore: mask placeholder
2024-04-09 11:31:42 +07:00
NamH
e0d6049d66
chore: extension should register its own models (#2601)
* chore: extension should register its own models

Signed-off-by: James <james@jan.ai>

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-04-05 14:18:58 +07:00
Louis
1eaf13b13e
fix: cancel loading model with stop action (#2607) 2024-04-04 10:57:54 +07:00
NamH
fa35aa6e14
feat: dynamically register extension settings (#2494)
* feat: add extesion settings

Signed-off-by: James <james@jan.ai>

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: Louis <louis@jan.ai>
2024-03-29 15:44:46 +07:00
Louis
9551996e34
chore: load, unload model and inference synchronously 2024-03-25 12:25:30 +07:00
Louis
acbec78dbf
fix: refactor inference engines to extends AIEngine (#2347)
* fix: refactor nitro to extends localoaiengine

* fix: refactor openai extension

* chore: refactor groq extension

* chore: refactor triton tensorrt extension

* chore: add tests

* chore: refactor engines
2024-03-22 09:35:14 +07:00
Louis
ff7ec39915
fix: incompatible browser dependency (#2439)
* fix: incompatible browser dependency

* fix: update model extension to use rollup

* fix: test timeout
2024-03-21 16:54:42 +07:00
Louis
d85d02693b
feat: Nitro-Tensorrt-LLM Extension (#2280)
* feat: tensorrt-llm-extension

* fix: loading

* feat: add download tensorrt llm runner

Signed-off-by: James <james@jan.ai>

* feat: update to rollupjs instead of webpack for monitoring extension

Signed-off-by: James <james@jan.ai>

* feat: move update nvidia info to monitor extension

Signed-off-by: James <james@jan.ai>

* allow download tensorrt

Signed-off-by: James <james@jan.ai>

* update

Signed-off-by: James <james@jan.ai>

* allow download tensor rt based on gpu setting

Signed-off-by: James <james@jan.ai>

* update downloaded models

Signed-off-by: James <james@jan.ai>

* feat: add extension compatibility

* dynamic tensor rt engines

Signed-off-by: James <james@jan.ai>

* update models

Signed-off-by: James <james@jan.ai>

* chore: remove ts-ignore

* feat: getting installation state from extension

Signed-off-by: James <james@jan.ai>

* chore: adding type for decompress

Signed-off-by: James <james@jan.ai>

* feat: update according Louis's comment

Signed-off-by: James <james@jan.ai>

* feat: add progress for installing extension

Signed-off-by: James <james@jan.ai>

* chore: remove args from extension installation

* fix: model download does not work properly

* fix: do not allow user to stop tensorrtllm inference

* fix: extension installed style

* fix: download tensorrt does not update state

Signed-off-by: James <james@jan.ai>

* chore: replace int4 by fl16

* feat: modal for installing extension

Signed-off-by: James <james@jan.ai>

* fix: start download immediately after press install

Signed-off-by: James <james@jan.ai>

* fix: error switching between engines

* feat: rename inference provider to ai engine and refactor to core

* fix: missing ulid

* fix: core bundler

* feat: add cancel extension installing

Signed-off-by: James <james@jan.ai>

* remove mocking for mac

Signed-off-by: James <james@jan.ai>

* fix: show models only when extension is ready

* add tensorrt badge for model

Signed-off-by: James <james@jan.ai>

* fix: copy

* fix: add compatible check (#2342)

* fix: add compatible check

Signed-off-by: James <james@jan.ai>

* fix: copy

* fix: font

* fix: copy

* fix: broken monitoring extension

* chore: bump engine

* fix: copy

* fix: model copy

* fix: copy

* fix: model json

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: Louis <louis@jan.ai>

* fix: vulkan support

* fix: installation button padding

* fix: empty script

* fix: remove hard code string

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: NamH <NamNh0122@gmail.com>
2024-03-14 14:07:22 +07:00
NamH
f36d740b1e
feat: add quick ask (#2197)
* feat: add quick ask

Signed-off-by: James <james@jan.ai>

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: Louis <louis@jan.ai>
2024-03-08 10:01:37 +07:00
Louis
42675891a6
chore: bump nitro 0.3.14 (#2183) 2024-02-28 11:25:30 +07:00
NamH
773963a456
feat: add import model (#2104)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-02-26 16:15:10 +07:00
Louis
3c8caf3345
fix: correct vulkan settings (#2128) 2024-02-22 21:18:39 +07:00
hiento09
2f4bffdbef
Bump nitro from 0.3.12 to 0.3.13 (#2124)
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-02-22 18:00:37 +07:00
hiento09
a71c74d468
Fix: Linux vulkan binary path (#2123)
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-02-22 14:48:00 +07:00
hiro
926f19bd9b
feat: Add nitro vulkan to support AMD GPU/ APU and Intel Arc GPU (#2056)
* feat: add vulkan support on windows and linux

* fix: correct vulkan settings

* fix: gpu settings and enable Vulkan support

* fix: vulkan support 1 device at a time only

* inference-nitro-extension add download vulkaninfo

---------

Co-authored-by: Louis <louis@jan.ai>
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-02-22 11:19:36 +07:00
Louis
7fbc6cb6c0
fix: failed to bind port - nitro error message copy (#2101)
* fix: failed to bind port - nitro error message copy

* fix: copy
2024-02-20 13:54:21 +07:00
hiento09
2cbbe1bcd3
Fix bug #2005 docker blank website (#2093)
* Web: change API_BASE_URL to build time env

* Update Dockerfile and Docker Compose by adding env API_BASE_URL

* Update make clean

* INFERENCE_URL get from baseApiUrl

* Fix error settings/settings.json not found when start server at the first time

* Update README docker

---------

Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-02-19 23:30:59 +07:00
hiro
69244e6ced
chore: Update version.txt to 0.3.12 (#2057) 2024-02-16 23:41:11 +07:00
Louis
3412a23654
chore: prettier fix (#2019) 2024-02-15 08:38:05 +07:00
Louis
f0fd2c5a2a
fix: model path backward compatible (#2018) 2024-02-14 23:04:46 +07:00
Louis
3b51f3d1aa
chore: bump nitro 0.3.9 (#2016) 2024-02-14 16:05:27 +07:00
0xgokuz
875c2bc3c9
feat: Thread titles should auto-summarize Topic (#1976) 2024-02-10 19:16:42 +07:00
Louis
eb09399fbf
chore: reduce bundle size (#1970)
* chore: reduce bundle size

* chore: trimming langchainjs

* chore: trim pdf-parse
2024-02-09 19:23:56 +07:00
Louis
5890ade451
chore: server download progress + S3 (#1925)
* fix: reduce the number of api call

Signed-off-by: James <james@jan.ai>

* fix: download progress

Signed-off-by: James <james@jan.ai>

* chore: save blob

* fix: server boot up

* fix: download state not updating

Signed-off-by: James <james@jan.ai>

* fix: copy assets

* Add Dockerfile CPU for Jan Server and Jan Web

* Add Dockerfile GPU for Jan Server and Jan Web

* feat: S3 adapter

* Update check find count from ./pre-install and correct copy:asserts command

* server add bundleDependencies @janhq/core

* server add bundleDependencies @janhq/core

* fix: update success/failed download state (#1945)

* fix: update success/failed download state

Signed-off-by: James <james@jan.ai>

* fix: download model progress and state handling for both Desktop and Web

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: Louis <louis@jan.ai>

* chore: refactor

* fix: load models empty first time open

* Add Docker compose

* fix: assistants onUpdate

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: Hien To <tominhhien97@gmail.com>
Co-authored-by: NamH <NamNh0122@gmail.com>
2024-02-07 17:54:35 +07:00
hiento09
4471b2c941
feat: User Selectable GPUs and GPU-based Model Recommendations (#1730) 2024-02-06 17:31:46 +07:00
Louis
f43fae2e86
Merge pull request #1919 from janhq/main
Sync release 0.4.6 to dev
2024-02-05 10:08:39 +07:00
Louis
eaa3053d40
fix: openAIEmbedding now requires top level API Key configuration (#1902)
* fix: openAIEmbedding now requires top level API Key configuration

* chore: typo
2024-02-02 13:28:21 +07:00