222 Commits

Author SHA1 Message Date
Van Pham
8c3dd3a1af
chore: correct copy for Jan data folder to highlight folder intact (#2751) 2024-04-18 11:48:19 +07:00
NamH
95632788e4
chore: default context length to 2048 (#2746) 2024-04-17 19:14:51 +07:00
NamH
31397de2d1
Refactor/deprecate hugging face ext (#2620)
* refactor: deprecate huggingface extension

Signed-off-by: James <james@jan.ai>
2024-04-16 17:23:45 +07:00
Faisal Amir
49401bd8eb
fix: broken several ui when model name too long (#2728) 2024-04-15 21:52:36 +07:00
Louis
9369ac3e8b
Merge branch 'dev' into main 2024-04-15 14:57:31 +07:00
Faisal Amir
1619478250
enhancement: update input component dynamic text alignment (#2712) 2024-04-15 09:48:57 +07:00
Louis
997d0e72c5
fix: add new shortcut to create new thread (#2701) 2024-04-12 14:06:47 +07:00
NamH
7d67087919
fix: add markdown support for extension description (#2691)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-04-11 17:43:59 +07:00
Louis
065ed03099
fix: wrong monitoring system information type (#2679) 2024-04-11 11:07:31 +07:00
Louis
b19234ed71
chore: Extension should have product name in manifest (#2675)
* chore: Extension should have product name in manifest

* chore: typo
2024-04-11 09:50:58 +07:00
Louis
d93d74c86b
feat: nitro additional dependencies (#2674) 2024-04-11 09:13:02 +07:00
Louis
3f23de6c28
feat: move log into monitoring extension (#2662) 2024-04-10 14:35:15 +07:00
NamH
c0c038f4f7
chore: filter gguf files when importing model on windows (#2653)
Co-authored-by: Louis <louis@jan.ai>
2024-04-09 10:36:01 +07:00
Louis
f8cf93a906
chore: add GPU driver and toolkit status (#2628) 2024-04-08 09:50:16 +07:00
Louis
1eaf13b13e
fix: cancel loading model with stop action (#2607) 2024-04-04 10:57:54 +07:00
NamH
345c7d58e6
chore: some wordings in extension settings (#2573)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-04-02 15:31:20 +07:00
NamH
fa35aa6e14
feat: dynamically register extension settings (#2494)
* feat: add extesion settings

Signed-off-by: James <james@jan.ai>

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: Louis <louis@jan.ai>
2024-03-29 15:44:46 +07:00
NamH
3ecdb81881
fix: file explore on windows show empty when importing model (#2484)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-03-29 01:24:53 +07:00
NamH
67e285fa96
chore: remove rmdirsync from core api since it is deprecated (#2459)
* chore: remove rmdirsync from core api since it is deprecated

Signed-off-by: James <james@jan.ai>

* chore: remove mkdirsync

Signed-off-by: James <james@jan.ai>

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-03-22 17:57:16 +07:00
NamH
419683d415
fix: move jan to not empty destination folder not functioning (#2464)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-03-22 16:03:39 +07:00
Louis
acbec78dbf
fix: refactor inference engines to extends AIEngine (#2347)
* fix: refactor nitro to extends localoaiengine

* fix: refactor openai extension

* chore: refactor groq extension

* chore: refactor triton tensorrt extension

* chore: add tests

* chore: refactor engines
2024-03-22 09:35:14 +07:00
NamH
b8d86df688
Fix/unable factory reset windows nitro running (#2422)
* fix: unable to factory reset when nitro is running on windows

---------

Signed-off-by: James <james@jan.ai>
2024-03-19 18:05:03 +07:00
Louis
489e8aab24
Sync release 0.4.9 to dev (#2407)
* fix: move tensorrt executable to engine (#2400)

* fix: move tensorrt executable to engine

Signed-off-by: James <james@jan.ai>

* some update

Signed-off-by: hiro <hiro@jan.ai>

* chore: bump tensorrt version

* fix: wrong destroy path

* fix: install extensions in parallel

* chore: update path for tensorrt engine (#2404)

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>

---------

Signed-off-by: James <james@jan.ai>
Signed-off-by: hiro <hiro@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>
Co-authored-by: Louis <louis@jan.ai>

* Release/v0.4.9 (#2421)

* fix: turn off experimental settings should also turn off quick ask (#2411)

* fix: app glitches 1s generating response before starting model (#2412)

* fix: disable experimental feature should also disable vulkan (#2414)

* fix: model load stuck on windows when can't get CPU core count (#2413)

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>

* feat: TensorRT-LLM engine update support (#2415)

* fix: engine update

* chore: add remove prepopulated models

Signed-off-by: James <james@jan.ai>

* update tinyjensen url

Signed-off-by: James <james@jan.ai>

* update llamacorn

Signed-off-by: James <james@jan.ai>

* update Mistral 7B Instruct v0.1 int4

Signed-off-by: James <james@jan.ai>

* update tensorrt

Signed-off-by: James <james@jan.ai>

* update

Signed-off-by: hiro <hiro@jan.ai>

* update

Signed-off-by: James <james@jan.ai>

* prettier

Signed-off-by: James <james@jan.ai>

* update mistral config

Signed-off-by: James <james@jan.ai>

* fix some lint

Signed-off-by: James <james@jan.ai>

---------

Signed-off-by: James <james@jan.ai>
Signed-off-by: hiro <hiro@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>

* Tensorrt LLM disable turing support (#2418)

Co-authored-by: Hien To <tominhhien97@gmail.com>

* chore: add prompt template tensorrtllm (#2375)

* chore: add prompt template tensorrtllm

* Add Prompt template for mistral and correct model metadata

---------

Co-authored-by: Hien To <tominhhien97@gmail.com>

* fix: correct tensorrt mistral model.json (#2419)

---------

Signed-off-by: James <james@jan.ai>
Signed-off-by: hiro <hiro@jan.ai>
Co-authored-by: Louis <louis@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>
Co-authored-by: hiento09 <136591877+hiento09@users.noreply.github.com>
Co-authored-by: Hien To <tominhhien97@gmail.com>

---------

Signed-off-by: James <james@jan.ai>
Signed-off-by: hiro <hiro@jan.ai>
Co-authored-by: NamH <NamNh0122@gmail.com>
Co-authored-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>
Co-authored-by: hiento09 <136591877+hiento09@users.noreply.github.com>
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-03-19 12:20:09 +07:00
Meta Spartan
0348aa3321
feat: Groq Inference Extension (#2263)
* feat: Groq Inference Extension

* Add Groq supported models

* Fix folder typo

* Add Groq options to interface and new API Key saving, tested working

* Fix linting
2024-03-18 06:40:20 +07:00
NamH
5f19983de1
fix: some regressions for tensorrt nightly build (#2380)
* fix: some regressions for tensorrt nightly build

Signed-off-by: James <james@jan.ai>
---------

Signed-off-by: hiro <hiro@jan.ai>
Signed-off-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-03-15 17:45:56 +07:00
Louis
758afdbeb4
fix: incompatible GPU error message (#2357)
* fix: incompatible GPU error message

* fix: change port
2024-03-14 22:11:55 +07:00
Faisal Amir
8120ad2a03
fix: badge or progress tensorRtExtensionItem (#2349) 2024-03-14 18:37:54 +07:00
Louis
d85d02693b
feat: Nitro-Tensorrt-LLM Extension (#2280)
* feat: tensorrt-llm-extension

* fix: loading

* feat: add download tensorrt llm runner

Signed-off-by: James <james@jan.ai>

* feat: update to rollupjs instead of webpack for monitoring extension

Signed-off-by: James <james@jan.ai>

* feat: move update nvidia info to monitor extension

Signed-off-by: James <james@jan.ai>

* allow download tensorrt

Signed-off-by: James <james@jan.ai>

* update

Signed-off-by: James <james@jan.ai>

* allow download tensor rt based on gpu setting

Signed-off-by: James <james@jan.ai>

* update downloaded models

Signed-off-by: James <james@jan.ai>

* feat: add extension compatibility

* dynamic tensor rt engines

Signed-off-by: James <james@jan.ai>

* update models

Signed-off-by: James <james@jan.ai>

* chore: remove ts-ignore

* feat: getting installation state from extension

Signed-off-by: James <james@jan.ai>

* chore: adding type for decompress

Signed-off-by: James <james@jan.ai>

* feat: update according Louis's comment

Signed-off-by: James <james@jan.ai>

* feat: add progress for installing extension

Signed-off-by: James <james@jan.ai>

* chore: remove args from extension installation

* fix: model download does not work properly

* fix: do not allow user to stop tensorrtllm inference

* fix: extension installed style

* fix: download tensorrt does not update state

Signed-off-by: James <james@jan.ai>

* chore: replace int4 by fl16

* feat: modal for installing extension

Signed-off-by: James <james@jan.ai>

* fix: start download immediately after press install

Signed-off-by: James <james@jan.ai>

* fix: error switching between engines

* feat: rename inference provider to ai engine and refactor to core

* fix: missing ulid

* fix: core bundler

* feat: add cancel extension installing

Signed-off-by: James <james@jan.ai>

* remove mocking for mac

Signed-off-by: James <james@jan.ai>

* fix: show models only when extension is ready

* add tensorrt badge for model

Signed-off-by: James <james@jan.ai>

* fix: copy

* fix: add compatible check (#2342)

* fix: add compatible check

Signed-off-by: James <james@jan.ai>

* fix: copy

* fix: font

* fix: copy

* fix: broken monitoring extension

* chore: bump engine

* fix: copy

* fix: model copy

* fix: copy

* fix: model json

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: Louis <louis@jan.ai>

* fix: vulkan support

* fix: installation button padding

* fix: empty script

* fix: remove hard code string

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: NamH <NamNh0122@gmail.com>
2024-03-14 14:07:22 +07:00
Louis
bff20ab87b
fix: put quick ask feature toggle under experimental feature (#2338) 2024-03-13 10:18:20 +07:00
Louis
af5bcea773
fix: gate quick ask with feature toggle (#2331) 2024-03-12 20:10:59 +07:00
NamH
f36d740b1e
feat: add quick ask (#2197)
* feat: add quick ask

Signed-off-by: James <james@jan.ai>

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: Louis <louis@jan.ai>
2024-03-08 10:01:37 +07:00
Louis
7556430693
fix: should not attach error messages to the completion request (#2258) 2024-03-06 16:15:46 +07:00
Faisal Amir
635435fbb8
Revert feat: temporary remove dark mode (#2221)
* Revert "feat: temporary remove dark mode :(  (#2168)"

This reverts commit 222b4ad897c275dab0eaec3c8a8472bf3df7afc4.

* fix: revert darkmode and fix darkmode for import model

* fix: prettier format import model

---------

Co-authored-by: Louis <louis@jan.ai>
2024-03-04 14:46:10 +07:00
NamH
b70e7fb917
fix: comment from QA regarding import model (#2213)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-03-03 20:38:01 +07:00
NamH
52f84dce9f
chore: clean up some redundant code (#2215)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-03-03 17:54:01 +07:00
NamH
d7070d8c4a
fix: some bugs for import model (#2181)
* fix: some bugs for import model

Signed-off-by: James <james@jan.ai>

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-02-27 23:59:37 +07:00
Faisal Amir
222b4ad897
feat: temporary remove dark mode :( (#2168)
* remove darkmode

* fix progress component background color
2024-02-27 20:39:57 +07:00
NamH
773963a456
feat: add import model (#2104)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-02-26 16:15:10 +07:00
hiro
926f19bd9b
feat: Add nitro vulkan to support AMD GPU/ APU and Intel Arc GPU (#2056)
* feat: add vulkan support on windows and linux

* fix: correct vulkan settings

* fix: gpu settings and enable Vulkan support

* fix: vulkan support 1 device at a time only

* inference-nitro-extension add download vulkaninfo

---------

Co-authored-by: Louis <louis@jan.ai>
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-02-22 11:19:36 +07:00
Louis
780f957b9a
fix: expand assistant and model settings by default (#2081)
* fix: expand assistant and model settings by default

* fix: add proxy enabled toggle
2024-02-19 12:44:04 +07:00
Louis
96df4ad67f
fix: move jan data folder - error handling - no write permission granted (#2077) 2024-02-18 23:05:36 +07:00
Louis
56034d682a
fix: should stop running the model when GPU settings are changed (#2067) 2024-02-18 15:09:15 +07:00
Louis
a3aceb8f60
fix: settings page state loop and dark theme (#2065)
* fix: settings page state loop and dark theme

* fix: crash on visiting settings page
2024-02-18 14:24:54 +07:00
NamH
b7e94aac02
fix: disable gpu drop down box if there's no GPU ready (#2046)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-02-16 10:46:42 +07:00
NamH
f5934d5a60
fix: ui for disabled state of gpu acceleration (#2034) 2024-02-15 22:38:23 +07:00
Faisal Amir
6f520b4534
feat: imporve UI/UX gpu acceleration feature (#1990) 2024-02-13 15:36:42 +07:00
hiento09
4471b2c941
feat: User Selectable GPUs and GPU-based Model Recommendations (#1730) 2024-02-06 17:31:46 +07:00
NamH
01fec49798
fix: reduce the number of api call (#1896)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-02-05 13:13:39 +07:00
Louis
eaa3053d40
fix: openAIEmbedding now requires top level API Key configuration (#1902)
* fix: openAIEmbedding now requires top level API Key configuration

* chore: typo
2024-02-02 13:28:21 +07:00
Louis
9e4658f5b2
fix: show generate response on message send (#1895)
* fix: show generate response on message send

* chore: typo
2024-02-01 23:59:45 +07:00