851 Commits

Author SHA1 Message Date
Louis
3b755f7e8c
Merge pull request #2591 from janhq/main
use `tabular-nums` for percentages in ModelCancelDownload (#2553)
2024-04-03 10:59:58 +07:00
Louis
3fded8fd20
fix: regenerate message should not include the assistant's previous response (#2583) 2024-04-02 22:17:35 +07:00
Louis
6f0ee23776
fix: missing load model error message (#2581) 2024-04-02 22:16:43 +07:00
Louis
9797e5b05f
fix: broken auto link (#2580) 2024-04-02 18:45:35 +07:00
Masataka Yakura
2a6f7745c8
use tabular-nums for percentages in ModelCancelDownload (#2553)
with this patch numbers in percentages have the same width so all the component widths stay the same

https://tailwindcss.com/docs/font-variant-numeric#tabular-figures
2024-04-02 17:54:05 +07:00
Louis
89653eda08
fix: disabling Vulkan causes users to be stuck in GPU-accelerated mode (#2578) 2024-04-02 17:35:15 +07:00
hiento09
7feaf0b3bd
Change npm registry to nexus for CI test and enable turbo remote cache (#2535)
* Change npm registry to nexus for CI test

* Change npm registry to nexus for CI test

* Add yarn.lock

* Remove clean step

* Revert to disable yarn.lock file

* Turn NPM Proxy to env

---------

Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-04-02 15:34:26 +07:00
NamH
345c7d58e6
chore: some wordings in extension settings (#2573)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-04-02 15:31:20 +07:00
Louis
c6198ae423
fix: quick ask blocks app update (#2525) 2024-04-02 12:32:12 +07:00
Louis
f6d3b53ab5
Merge branch 'main' into dev
# Conflicts:
#	web/screens/Chat/ErrorMessage/index.tsx
2024-04-02 11:09:59 +07:00
GenkaOk
dd8be07120
Set name assistant (#2529) 2024-04-01 11:32:06 +07:00
NamH
96af5fb85a
fix: quick ask improvement (#2543)
* docs: Update README.md

* fix: quick ask improvement

Signed-off-by: James <james@jan.ai>

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: hieu-jan <150573299+henryh0x1@users.noreply.github.com>
Co-authored-by: James <james@jan.ai>
2024-03-30 08:59:52 +07:00
Louis
1451f678db
fix: file input update delay (#2540) 2024-03-29 22:02:48 +07:00
Louis
228a363914
fix: image model does not work when retrieval tool is enabled (#2538) 2024-03-29 16:07:49 +07:00
NamH
fa35aa6e14
feat: dynamically register extension settings (#2494)
* feat: add extesion settings

Signed-off-by: James <james@jan.ai>

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: Louis <louis@jan.ai>
2024-03-29 15:44:46 +07:00
NamH
132487acec
fix: show default error message from inference engine (#2533)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-03-29 11:53:34 +07:00
NamH
3ecdb81881
fix: file explore on windows show empty when importing model (#2484)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-03-29 01:24:53 +07:00
Louis
6e01e04c76
fix: should not show broken image (#2526) 2024-03-28 16:32:33 +07:00
NamH
1f8dc893ba
fix: remove files and memory when user clean thread (#2524)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-03-28 13:54:38 +07:00
Louis
784af8cf55
fix: model recommendation label is not based on VRAM (#2517) 2024-03-27 18:21:33 +07:00
Louis
fe730fbe42
fix: load model fails - generating button is not updated (#2498) 2024-03-27 15:52:06 +07:00
Louis
7857a6e75e
fix: upload document mid-thread does not work (#2504) 2024-03-26 22:22:54 +07:00
Louis
d290ae1977
fix: text overflow 2024-03-25 23:58:36 +07:00
Louis
8e8dfd4b37
refactor: introduce inference tools (#2493) 2024-03-25 23:26:05 +07:00
Louis
14a67463dc
chore: refactor core folder structure - module based 2024-03-25 16:20:06 +07:00
Louis
77cbdc2dcf
refactor: introduce message request builder (#2481) 2024-03-25 12:50:29 +07:00
Louis
9551996e34
chore: load, unload model and inference synchronously 2024-03-25 12:25:30 +07:00
NamH
67e285fa96
chore: remove rmdirsync from core api since it is deprecated (#2459)
* chore: remove rmdirsync from core api since it is deprecated

Signed-off-by: James <james@jan.ai>

* chore: remove mkdirsync

Signed-off-by: James <james@jan.ai>

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-03-22 17:57:16 +07:00
Louis
254a79ccbe
fix: turborepo extensions (#2392)
* fix: turborepo extensions

Update package.json

Update Makefile

Update Makefile

Update Makefile

Update Makefile

Update Makefile

Update package.json

* chore: turbo cache

* fix: install extensions in parallel

* fix: timeout issue

* Turbo cache using s3

* Remove cache task

---------

Co-authored-by: Hien To <tominhhien97@gmail.com>
Co-authored-by: Service Account <service@jan.ai>
2024-03-22 17:53:20 +07:00
NamH
419683d415
fix: move jan to not empty destination folder not functioning (#2464)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-03-22 16:03:39 +07:00
Louis
5edc24d8e0
fix: deleting threads manually breaks model settings and document upload (#2456)
* fix: blank model settings and rag does not work

* fix: fallback support fro previous broken threads
2024-03-22 13:56:46 +07:00
Louis
c2f6330daf
chore: log system information for debugging (#2453) 2024-03-22 12:34:44 +07:00
Louis
acbec78dbf
fix: refactor inference engines to extends AIEngine (#2347)
* fix: refactor nitro to extends localoaiengine

* fix: refactor openai extension

* chore: refactor groq extension

* chore: refactor triton tensorrt extension

* chore: add tests

* chore: refactor engines
2024-03-22 09:35:14 +07:00
Louis
ff7ec39915
fix: incompatible browser dependency (#2439)
* fix: incompatible browser dependency

* fix: update model extension to use rollup

* fix: test timeout
2024-03-21 16:54:42 +07:00
Faisal Amir
f75d7e3ca7
fix: remove decodeURI for avoid URI malformed (#2416) 2024-03-20 10:08:18 +07:00
NamH
b8d86df688
Fix/unable factory reset windows nitro running (#2422)
* fix: unable to factory reset when nitro is running on windows

---------

Signed-off-by: James <james@jan.ai>
2024-03-19 18:05:03 +07:00
Louis
489e8aab24
Sync release 0.4.9 to dev (#2407)
* fix: move tensorrt executable to engine (#2400)

* fix: move tensorrt executable to engine

Signed-off-by: James <james@jan.ai>

* some update

Signed-off-by: hiro <hiro@jan.ai>

* chore: bump tensorrt version

* fix: wrong destroy path

* fix: install extensions in parallel

* chore: update path for tensorrt engine (#2404)

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>

---------

Signed-off-by: James <james@jan.ai>
Signed-off-by: hiro <hiro@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>
Co-authored-by: Louis <louis@jan.ai>

* Release/v0.4.9 (#2421)

* fix: turn off experimental settings should also turn off quick ask (#2411)

* fix: app glitches 1s generating response before starting model (#2412)

* fix: disable experimental feature should also disable vulkan (#2414)

* fix: model load stuck on windows when can't get CPU core count (#2413)

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>

* feat: TensorRT-LLM engine update support (#2415)

* fix: engine update

* chore: add remove prepopulated models

Signed-off-by: James <james@jan.ai>

* update tinyjensen url

Signed-off-by: James <james@jan.ai>

* update llamacorn

Signed-off-by: James <james@jan.ai>

* update Mistral 7B Instruct v0.1 int4

Signed-off-by: James <james@jan.ai>

* update tensorrt

Signed-off-by: James <james@jan.ai>

* update

Signed-off-by: hiro <hiro@jan.ai>

* update

Signed-off-by: James <james@jan.ai>

* prettier

Signed-off-by: James <james@jan.ai>

* update mistral config

Signed-off-by: James <james@jan.ai>

* fix some lint

Signed-off-by: James <james@jan.ai>

---------

Signed-off-by: James <james@jan.ai>
Signed-off-by: hiro <hiro@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>

* Tensorrt LLM disable turing support (#2418)

Co-authored-by: Hien To <tominhhien97@gmail.com>

* chore: add prompt template tensorrtllm (#2375)

* chore: add prompt template tensorrtllm

* Add Prompt template for mistral and correct model metadata

---------

Co-authored-by: Hien To <tominhhien97@gmail.com>

* fix: correct tensorrt mistral model.json (#2419)

---------

Signed-off-by: James <james@jan.ai>
Signed-off-by: hiro <hiro@jan.ai>
Co-authored-by: Louis <louis@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>
Co-authored-by: hiento09 <136591877+hiento09@users.noreply.github.com>
Co-authored-by: Hien To <tominhhien97@gmail.com>

---------

Signed-off-by: James <james@jan.ai>
Signed-off-by: hiro <hiro@jan.ai>
Co-authored-by: NamH <NamNh0122@gmail.com>
Co-authored-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>
Co-authored-by: hiento09 <136591877+hiento09@users.noreply.github.com>
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-03-19 12:20:09 +07:00
Meta Spartan
0348aa3321
feat: Groq Inference Extension (#2263)
* feat: Groq Inference Extension

* Add Groq supported models

* Fix folder typo

* Add Groq options to interface and new API Key saving, tested working

* Fix linting
2024-03-18 06:40:20 +07:00
NamH
6dfa23cd02
fix: unable to start web with quick ask (#2402)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-03-17 13:17:39 +07:00
NamH
5f19983de1
fix: some regressions for tensorrt nightly build (#2380)
* fix: some regressions for tensorrt nightly build

Signed-off-by: James <james@jan.ai>
---------

Signed-off-by: hiro <hiro@jan.ai>
Signed-off-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-03-15 17:45:56 +07:00
NamH
3e27e97110
fix: use model from model hub not load correct model in thread screen (#2368)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-03-15 10:33:59 +07:00
Louis
758afdbeb4
fix: incompatible GPU error message (#2357)
* fix: incompatible GPU error message

* fix: change port
2024-03-14 22:11:55 +07:00
Louis
aab8ee84af
fix: disable rag & stream settings from tensorrt model.json (#2351) 2024-03-14 19:04:31 +07:00
Faisal Amir
8120ad2a03
fix: badge or progress tensorRtExtensionItem (#2349) 2024-03-14 18:37:54 +07:00
NamH
d9c3852997
fix: ts error when declar var in case (#2348)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-03-14 17:16:14 +07:00
NamH
03a0978f53
fix: some costmetic issues: badges corner, recommended for tensorrt models(#2346)
Signed-off-by: hiro <hiro@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>
2024-03-14 15:58:42 +07:00
Louis
d85d02693b
feat: Nitro-Tensorrt-LLM Extension (#2280)
* feat: tensorrt-llm-extension

* fix: loading

* feat: add download tensorrt llm runner

Signed-off-by: James <james@jan.ai>

* feat: update to rollupjs instead of webpack for monitoring extension

Signed-off-by: James <james@jan.ai>

* feat: move update nvidia info to monitor extension

Signed-off-by: James <james@jan.ai>

* allow download tensorrt

Signed-off-by: James <james@jan.ai>

* update

Signed-off-by: James <james@jan.ai>

* allow download tensor rt based on gpu setting

Signed-off-by: James <james@jan.ai>

* update downloaded models

Signed-off-by: James <james@jan.ai>

* feat: add extension compatibility

* dynamic tensor rt engines

Signed-off-by: James <james@jan.ai>

* update models

Signed-off-by: James <james@jan.ai>

* chore: remove ts-ignore

* feat: getting installation state from extension

Signed-off-by: James <james@jan.ai>

* chore: adding type for decompress

Signed-off-by: James <james@jan.ai>

* feat: update according Louis's comment

Signed-off-by: James <james@jan.ai>

* feat: add progress for installing extension

Signed-off-by: James <james@jan.ai>

* chore: remove args from extension installation

* fix: model download does not work properly

* fix: do not allow user to stop tensorrtllm inference

* fix: extension installed style

* fix: download tensorrt does not update state

Signed-off-by: James <james@jan.ai>

* chore: replace int4 by fl16

* feat: modal for installing extension

Signed-off-by: James <james@jan.ai>

* fix: start download immediately after press install

Signed-off-by: James <james@jan.ai>

* fix: error switching between engines

* feat: rename inference provider to ai engine and refactor to core

* fix: missing ulid

* fix: core bundler

* feat: add cancel extension installing

Signed-off-by: James <james@jan.ai>

* remove mocking for mac

Signed-off-by: James <james@jan.ai>

* fix: show models only when extension is ready

* add tensorrt badge for model

Signed-off-by: James <james@jan.ai>

* fix: copy

* fix: add compatible check (#2342)

* fix: add compatible check

Signed-off-by: James <james@jan.ai>

* fix: copy

* fix: font

* fix: copy

* fix: broken monitoring extension

* chore: bump engine

* fix: copy

* fix: model copy

* fix: copy

* fix: model json

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: Louis <louis@jan.ai>

* fix: vulkan support

* fix: installation button padding

* fix: empty script

* fix: remove hard code string

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: NamH <NamNh0122@gmail.com>
2024-03-14 14:07:22 +07:00
Louis
bff20ab87b
fix: put quick ask feature toggle under experimental feature (#2338) 2024-03-13 10:18:20 +07:00
Louis
e5369ed151
fix: do not migrate extensions from quick ask window (#2336) 2024-03-13 01:11:21 +07:00
Louis
af5bcea773
fix: gate quick ask with feature toggle (#2331) 2024-03-12 20:10:59 +07:00