231 Commits

Author SHA1 Message Date
Louis
8e8dfd4b37
refactor: introduce inference tools (#2493) 2024-03-25 23:26:05 +07:00
Louis
9551996e34
chore: load, unload model and inference synchronously 2024-03-25 12:25:30 +07:00
Louis
c2f6330daf
chore: log system information for debugging (#2453) 2024-03-22 12:34:44 +07:00
Louis
acbec78dbf
fix: refactor inference engines to extends AIEngine (#2347)
* fix: refactor nitro to extends localoaiengine

* fix: refactor openai extension

* chore: refactor groq extension

* chore: refactor triton tensorrt extension

* chore: add tests

* chore: refactor engines
2024-03-22 09:35:14 +07:00
Louis
ff7ec39915
fix: incompatible browser dependency (#2439)
* fix: incompatible browser dependency

* fix: update model extension to use rollup

* fix: test timeout
2024-03-21 16:54:42 +07:00
NamH
b8d86df688
Fix/unable factory reset windows nitro running (#2422)
* fix: unable to factory reset when nitro is running on windows

---------

Signed-off-by: James <james@jan.ai>
2024-03-19 18:05:03 +07:00
Louis
489e8aab24
Sync release 0.4.9 to dev (#2407)
* fix: move tensorrt executable to engine (#2400)

* fix: move tensorrt executable to engine

Signed-off-by: James <james@jan.ai>

* some update

Signed-off-by: hiro <hiro@jan.ai>

* chore: bump tensorrt version

* fix: wrong destroy path

* fix: install extensions in parallel

* chore: update path for tensorrt engine (#2404)

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>

---------

Signed-off-by: James <james@jan.ai>
Signed-off-by: hiro <hiro@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>
Co-authored-by: Louis <louis@jan.ai>

* Release/v0.4.9 (#2421)

* fix: turn off experimental settings should also turn off quick ask (#2411)

* fix: app glitches 1s generating response before starting model (#2412)

* fix: disable experimental feature should also disable vulkan (#2414)

* fix: model load stuck on windows when can't get CPU core count (#2413)

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>

* feat: TensorRT-LLM engine update support (#2415)

* fix: engine update

* chore: add remove prepopulated models

Signed-off-by: James <james@jan.ai>

* update tinyjensen url

Signed-off-by: James <james@jan.ai>

* update llamacorn

Signed-off-by: James <james@jan.ai>

* update Mistral 7B Instruct v0.1 int4

Signed-off-by: James <james@jan.ai>

* update tensorrt

Signed-off-by: James <james@jan.ai>

* update

Signed-off-by: hiro <hiro@jan.ai>

* update

Signed-off-by: James <james@jan.ai>

* prettier

Signed-off-by: James <james@jan.ai>

* update mistral config

Signed-off-by: James <james@jan.ai>

* fix some lint

Signed-off-by: James <james@jan.ai>

---------

Signed-off-by: James <james@jan.ai>
Signed-off-by: hiro <hiro@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>

* Tensorrt LLM disable turing support (#2418)

Co-authored-by: Hien To <tominhhien97@gmail.com>

* chore: add prompt template tensorrtllm (#2375)

* chore: add prompt template tensorrtllm

* Add Prompt template for mistral and correct model metadata

---------

Co-authored-by: Hien To <tominhhien97@gmail.com>

* fix: correct tensorrt mistral model.json (#2419)

---------

Signed-off-by: James <james@jan.ai>
Signed-off-by: hiro <hiro@jan.ai>
Co-authored-by: Louis <louis@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>
Co-authored-by: hiento09 <136591877+hiento09@users.noreply.github.com>
Co-authored-by: Hien To <tominhhien97@gmail.com>

---------

Signed-off-by: James <james@jan.ai>
Signed-off-by: hiro <hiro@jan.ai>
Co-authored-by: NamH <NamNh0122@gmail.com>
Co-authored-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>
Co-authored-by: hiento09 <136591877+hiento09@users.noreply.github.com>
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-03-19 12:20:09 +07:00
Meta Spartan
0348aa3321
feat: Groq Inference Extension (#2263)
* feat: Groq Inference Extension

* Add Groq supported models

* Fix folder typo

* Add Groq options to interface and new API Key saving, tested working

* Fix linting
2024-03-18 06:40:20 +07:00
NamH
6dfa23cd02
fix: unable to start web with quick ask (#2402)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-03-17 13:17:39 +07:00
NamH
03a0978f53
fix: some costmetic issues: badges corner, recommended for tensorrt models(#2346)
Signed-off-by: hiro <hiro@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>
2024-03-14 15:58:42 +07:00
Louis
d85d02693b
feat: Nitro-Tensorrt-LLM Extension (#2280)
* feat: tensorrt-llm-extension

* fix: loading

* feat: add download tensorrt llm runner

Signed-off-by: James <james@jan.ai>

* feat: update to rollupjs instead of webpack for monitoring extension

Signed-off-by: James <james@jan.ai>

* feat: move update nvidia info to monitor extension

Signed-off-by: James <james@jan.ai>

* allow download tensorrt

Signed-off-by: James <james@jan.ai>

* update

Signed-off-by: James <james@jan.ai>

* allow download tensor rt based on gpu setting

Signed-off-by: James <james@jan.ai>

* update downloaded models

Signed-off-by: James <james@jan.ai>

* feat: add extension compatibility

* dynamic tensor rt engines

Signed-off-by: James <james@jan.ai>

* update models

Signed-off-by: James <james@jan.ai>

* chore: remove ts-ignore

* feat: getting installation state from extension

Signed-off-by: James <james@jan.ai>

* chore: adding type for decompress

Signed-off-by: James <james@jan.ai>

* feat: update according Louis's comment

Signed-off-by: James <james@jan.ai>

* feat: add progress for installing extension

Signed-off-by: James <james@jan.ai>

* chore: remove args from extension installation

* fix: model download does not work properly

* fix: do not allow user to stop tensorrtllm inference

* fix: extension installed style

* fix: download tensorrt does not update state

Signed-off-by: James <james@jan.ai>

* chore: replace int4 by fl16

* feat: modal for installing extension

Signed-off-by: James <james@jan.ai>

* fix: start download immediately after press install

Signed-off-by: James <james@jan.ai>

* fix: error switching between engines

* feat: rename inference provider to ai engine and refactor to core

* fix: missing ulid

* fix: core bundler

* feat: add cancel extension installing

Signed-off-by: James <james@jan.ai>

* remove mocking for mac

Signed-off-by: James <james@jan.ai>

* fix: show models only when extension is ready

* add tensorrt badge for model

Signed-off-by: James <james@jan.ai>

* fix: copy

* fix: add compatible check (#2342)

* fix: add compatible check

Signed-off-by: James <james@jan.ai>

* fix: copy

* fix: font

* fix: copy

* fix: broken monitoring extension

* chore: bump engine

* fix: copy

* fix: model copy

* fix: copy

* fix: model json

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: Louis <louis@jan.ai>

* fix: vulkan support

* fix: installation button padding

* fix: empty script

* fix: remove hard code string

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: NamH <NamNh0122@gmail.com>
2024-03-14 14:07:22 +07:00
Louis
e5369ed151
fix: do not migrate extensions from quick ask window (#2336) 2024-03-13 01:11:21 +07:00
Louis
af5bcea773
fix: gate quick ask with feature toggle (#2331) 2024-03-12 20:10:59 +07:00
Louis
b9b421a495
fix: quick app bugs (#2327) 2024-03-12 17:25:50 +07:00
NamH
f36d740b1e
feat: add quick ask (#2197)
* feat: add quick ask

Signed-off-by: James <james@jan.ai>

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: Louis <louis@jan.ai>
2024-03-08 10:01:37 +07:00
Louis
49ad855843
feat: prompt user to download an update manually (#2261) 2024-03-07 09:36:42 +07:00
Louis
5ca3069fa1
fix: image upload button and drag event are not enabled (#2248)
* fix: image upload button and drag event are not enabled

* fix: add tooltips for unsupported model
2024-03-06 09:09:38 +07:00
hiro
e6c10202e0
feat: Jan can see (#2069)
* feat: jan can see

feat: Add GPT-4 Vision model (Preview)

fix: Add visionModel as property in ModelInfo

fix: Fix condition to load local messages in useSetActiveThread hook

feat: Enable Image as input for chat

fix: Update model parameters in JSON files for remote GPT models

fix: Add thread as optional

fix: Add support for message as image

fix: Linter

fix: Update proxyModel to proxy_model and add textModel

chore: Change proxyModel to proxy_model

fix: Update settings with visionModel and textModel

fix: vision model passed through the retrieval tool

fix: linter

* fix: could not load image and request is not able to be sent

---------

Co-authored-by: Louis <louis@jan.ai>
2024-03-05 08:33:09 +07:00
Faisal Amir
635435fbb8
Revert feat: temporary remove dark mode (#2221)
* Revert "feat: temporary remove dark mode :(  (#2168)"

This reverts commit 222b4ad897c275dab0eaec3c8a8472bf3df7afc4.

* fix: revert darkmode and fix darkmode for import model

* fix: prettier format import model

---------

Co-authored-by: Louis <louis@jan.ai>
2024-03-04 14:46:10 +07:00
NamH
b70e7fb917
fix: comment from QA regarding import model (#2213)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-03-03 20:38:01 +07:00
NamH
52f84dce9f
chore: clean up some redundant code (#2215)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-03-03 17:54:01 +07:00
NamH
d7070d8c4a
fix: some bugs for import model (#2181)
* fix: some bugs for import model

Signed-off-by: James <james@jan.ai>

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-02-27 23:59:37 +07:00
Faisal Amir
883d631452
fix space between progress bar and title list of gpu (#2177) 2024-02-27 21:25:31 +07:00
Faisal Amir
222b4ad897
feat: temporary remove dark mode :( (#2168)
* remove darkmode

* fix progress component background color
2024-02-27 20:39:57 +07:00
Faisal Amir
45efcad233
fix: style list of gpu on system monitor (#2172) 2024-02-27 16:55:01 +07:00
Faisal Amir
b1e748b958
fix: system monitor expand overlap tooltip ribbon (#2158) 2024-02-27 13:24:32 +07:00
NamH
773963a456
feat: add import model (#2104)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-02-26 16:15:10 +07:00
Louis
90108745e2
Merge pull request #2151 from janhq/main
Sync release 0.4.7 to dev
2024-02-26 11:15:21 +07:00
Faisal Amir
563f37f884
fix: system monitor ui (#2135) 2024-02-23 17:12:26 +07:00
Louis
98b7fd2c3b
fix: chore UI (#2125) 2024-02-22 19:38:56 +07:00
Faisal Amir
56be7742e7
feat: revamp system monitor (#2097)
* feat: revamp system monitor

* feat: revamp system monitor ui

* remove system monitor page

* fix e2e test navigation

* added click outside system monitor

* update height content system monitor
2024-02-22 13:51:23 +07:00
Faisal Amir
94bee0b1b6
fix: flow edit message (#2113)
* fix flow edit message

* flx flow eidt message

* remove unused import
2024-02-21 21:55:37 +07:00
Louis
cffc382caf
fix: tools section should be expanded by default (#2110) 2024-02-21 08:45:37 +07:00
Louis
7fbc6cb6c0
fix: failed to bind port - nitro error message copy (#2101)
* fix: failed to bind port - nitro error message copy

* fix: copy
2024-02-20 13:54:21 +07:00
Faisal Amir
6b88d4df4d
fix: remove caret down icon when tab selected into remote model (#2102) 2024-02-20 13:09:08 +07:00
Louis
780f957b9a
fix: expand assistant and model settings by default (#2081)
* fix: expand assistant and model settings by default

* fix: add proxy enabled toggle
2024-02-19 12:44:04 +07:00
Louis
47b890bba5
fix: message should only be interrupted when i start another thread (#2053)
* fix: message should only be interrupted when i start another thread

* fix: thread lost message streaming if navigate to another thread

* fix: state issue with useThreads
2024-02-16 17:34:23 +07:00
Louis
63cffca51e
chore: refactor watch system resource hook (#2048) 2024-02-16 13:22:21 +07:00
NamH
42da19a463
fix: download mutilple binaries (#2043)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-02-16 11:32:14 +07:00
Louis
a96053ebef
fix: app should generate thread title with length restriction (#2037) 2024-02-15 22:38:13 +07:00
Van Pham
82b361a5be
feat: Initialize POM structure with fixtures on Playwright (#2015)
* feat: video recorder on failures

* feat: fixture for sample page class

* feat: video recorder on failures

* feat: fixture for sample page class

* feat: video recorder on failures

* feat: fixture for sample page class

* feat: Apply Screenshot on failures

* feat: set timeout by default

* chore: clean up import

* feat: video recorder on failures

* feat: fixture for sample page class

* feat: add wait for app update

* chore: correct timeout

* chore: correct timeout

* chore: test timeout

* chore: test timeout

* chore: test timeout

* chore: browser context config

* chore: temporally disable the video recorder to bypass issue
2024-02-15 20:18:02 +07:00
Faisal Amir
ab77622e73
fix: content setting right panel default to collapse (#2026) 2024-02-15 15:53:15 +07:00
Louis
3ab23d571f
fix: local server blank parameters if there is no thread selected (#2028)
* fix: local server blank parameters if there is no thread selected

* fix: show errors only when the selected model is started while starting the server

* fix: show loading progress of selected model only
2024-02-15 15:48:45 +07:00
Louis
3412a23654
chore: prettier fix (#2019) 2024-02-15 08:38:05 +07:00
Louis
f2e31874e1
fix: resolve state update loop infinitive rerendering (#2017)
* fix: resolve state update loop infinitive rerendering

* fix: thread creation issue
2024-02-14 19:40:18 +07:00
Faisal Amir
6f520b4534
feat: imporve UI/UX gpu acceleration feature (#1990) 2024-02-13 15:36:42 +07:00
Faisal Amir
7ae6e35746
feat: add edit messages users (#1974)
* feat: add edit message user

* fix: delete messages should delete the rest below

---------

Co-authored-by: Louis <louis@jan.ai>
2024-02-12 13:00:24 +07:00
Louis
ac18fc1181
fix: decouple thread summary update (#1994) 2024-02-11 20:22:12 +07:00
0xgokuz
875c2bc3c9
feat: Thread titles should auto-summarize Topic (#1976) 2024-02-10 19:16:42 +07:00
Faisal Amir
5ec4b8e532
feat: revamp ui dropdown list model option (#1977)
* feat: add modal troubleshooting guideline

* resolve inconsistent message hidden

* feat: revamp ui dropdown list model option

* display model id and copy button

* add function copy id model from dropdownlist

* add info concurrently send requests to one active local model
2024-02-10 00:51:16 +07:00