200 Commits

Author SHA1 Message Date
Louis
254a79ccbe
fix: turborepo extensions (#2392)
* fix: turborepo extensions

Update package.json

Update Makefile

Update Makefile

Update Makefile

Update Makefile

Update Makefile

Update package.json

* chore: turbo cache

* fix: install extensions in parallel

* fix: timeout issue

* Turbo cache using s3

* Remove cache task

---------

Co-authored-by: Hien To <tominhhien97@gmail.com>
Co-authored-by: Service Account <service@jan.ai>
2024-03-22 17:53:20 +07:00
Louis
b8cee875b1
fix: app shows wrong toast on stopping inference (#2460) 2024-03-22 14:40:15 +07:00
Louis
c2f6330daf
chore: log system information for debugging (#2453) 2024-03-22 12:34:44 +07:00
Louis
acbec78dbf
fix: refactor inference engines to extends AIEngine (#2347)
* fix: refactor nitro to extends localoaiengine

* fix: refactor openai extension

* chore: refactor groq extension

* chore: refactor triton tensorrt extension

* chore: add tests

* chore: refactor engines
2024-03-22 09:35:14 +07:00
Louis
ff7ec39915
fix: incompatible browser dependency (#2439)
* fix: incompatible browser dependency

* fix: update model extension to use rollup

* fix: test timeout
2024-03-21 16:54:42 +07:00
NamH
b8d86df688
Fix/unable factory reset windows nitro running (#2422)
* fix: unable to factory reset when nitro is running on windows

---------

Signed-off-by: James <james@jan.ai>
2024-03-19 18:05:03 +07:00
Louis
489e8aab24
Sync release 0.4.9 to dev (#2407)
* fix: move tensorrt executable to engine (#2400)

* fix: move tensorrt executable to engine

Signed-off-by: James <james@jan.ai>

* some update

Signed-off-by: hiro <hiro@jan.ai>

* chore: bump tensorrt version

* fix: wrong destroy path

* fix: install extensions in parallel

* chore: update path for tensorrt engine (#2404)

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>

---------

Signed-off-by: James <james@jan.ai>
Signed-off-by: hiro <hiro@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>
Co-authored-by: Louis <louis@jan.ai>

* Release/v0.4.9 (#2421)

* fix: turn off experimental settings should also turn off quick ask (#2411)

* fix: app glitches 1s generating response before starting model (#2412)

* fix: disable experimental feature should also disable vulkan (#2414)

* fix: model load stuck on windows when can't get CPU core count (#2413)

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>

* feat: TensorRT-LLM engine update support (#2415)

* fix: engine update

* chore: add remove prepopulated models

Signed-off-by: James <james@jan.ai>

* update tinyjensen url

Signed-off-by: James <james@jan.ai>

* update llamacorn

Signed-off-by: James <james@jan.ai>

* update Mistral 7B Instruct v0.1 int4

Signed-off-by: James <james@jan.ai>

* update tensorrt

Signed-off-by: James <james@jan.ai>

* update

Signed-off-by: hiro <hiro@jan.ai>

* update

Signed-off-by: James <james@jan.ai>

* prettier

Signed-off-by: James <james@jan.ai>

* update mistral config

Signed-off-by: James <james@jan.ai>

* fix some lint

Signed-off-by: James <james@jan.ai>

---------

Signed-off-by: James <james@jan.ai>
Signed-off-by: hiro <hiro@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>

* Tensorrt LLM disable turing support (#2418)

Co-authored-by: Hien To <tominhhien97@gmail.com>

* chore: add prompt template tensorrtllm (#2375)

* chore: add prompt template tensorrtllm

* Add Prompt template for mistral and correct model metadata

---------

Co-authored-by: Hien To <tominhhien97@gmail.com>

* fix: correct tensorrt mistral model.json (#2419)

---------

Signed-off-by: James <james@jan.ai>
Signed-off-by: hiro <hiro@jan.ai>
Co-authored-by: Louis <louis@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>
Co-authored-by: hiento09 <136591877+hiento09@users.noreply.github.com>
Co-authored-by: Hien To <tominhhien97@gmail.com>

---------

Signed-off-by: James <james@jan.ai>
Signed-off-by: hiro <hiro@jan.ai>
Co-authored-by: NamH <NamNh0122@gmail.com>
Co-authored-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>
Co-authored-by: hiento09 <136591877+hiento09@users.noreply.github.com>
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-03-19 12:20:09 +07:00
Meta Spartan
0348aa3321
feat: Groq Inference Extension (#2263)
* feat: Groq Inference Extension

* Add Groq supported models

* Fix folder typo

* Add Groq options to interface and new API Key saving, tested working

* Fix linting
2024-03-18 06:40:20 +07:00
NamH
ed6bd14e02
chore: temporary remove linux from tensorrt support (#2386)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-03-15 23:02:42 +07:00
NamH
5f19983de1
fix: some regressions for tensorrt nightly build (#2380)
* fix: some regressions for tensorrt nightly build

Signed-off-by: James <james@jan.ai>
---------

Signed-off-by: hiro <hiro@jan.ai>
Signed-off-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-03-15 17:45:56 +07:00
Louis
2d622614bf
Update models.json (#2382) 2024-03-15 16:37:00 +07:00
Louis
58e12f35c9
fix: wrong engine handling (#2363) 2024-03-14 23:59:42 +07:00
Louis
758afdbeb4
fix: incompatible GPU error message (#2357)
* fix: incompatible GPU error message

* fix: change port
2024-03-14 22:11:55 +07:00
Louis
0415786e20
fix: app does not recognize GPU first launch (#2350) 2024-03-14 19:28:38 +07:00
Louis
aab8ee84af
fix: disable rag & stream settings from tensorrt model.json (#2351) 2024-03-14 19:04:31 +07:00
Louis
d85d02693b
feat: Nitro-Tensorrt-LLM Extension (#2280)
* feat: tensorrt-llm-extension

* fix: loading

* feat: add download tensorrt llm runner

Signed-off-by: James <james@jan.ai>

* feat: update to rollupjs instead of webpack for monitoring extension

Signed-off-by: James <james@jan.ai>

* feat: move update nvidia info to monitor extension

Signed-off-by: James <james@jan.ai>

* allow download tensorrt

Signed-off-by: James <james@jan.ai>

* update

Signed-off-by: James <james@jan.ai>

* allow download tensor rt based on gpu setting

Signed-off-by: James <james@jan.ai>

* update downloaded models

Signed-off-by: James <james@jan.ai>

* feat: add extension compatibility

* dynamic tensor rt engines

Signed-off-by: James <james@jan.ai>

* update models

Signed-off-by: James <james@jan.ai>

* chore: remove ts-ignore

* feat: getting installation state from extension

Signed-off-by: James <james@jan.ai>

* chore: adding type for decompress

Signed-off-by: James <james@jan.ai>

* feat: update according Louis's comment

Signed-off-by: James <james@jan.ai>

* feat: add progress for installing extension

Signed-off-by: James <james@jan.ai>

* chore: remove args from extension installation

* fix: model download does not work properly

* fix: do not allow user to stop tensorrtllm inference

* fix: extension installed style

* fix: download tensorrt does not update state

Signed-off-by: James <james@jan.ai>

* chore: replace int4 by fl16

* feat: modal for installing extension

Signed-off-by: James <james@jan.ai>

* fix: start download immediately after press install

Signed-off-by: James <james@jan.ai>

* fix: error switching between engines

* feat: rename inference provider to ai engine and refactor to core

* fix: missing ulid

* fix: core bundler

* feat: add cancel extension installing

Signed-off-by: James <james@jan.ai>

* remove mocking for mac

Signed-off-by: James <james@jan.ai>

* fix: show models only when extension is ready

* add tensorrt badge for model

Signed-off-by: James <james@jan.ai>

* fix: copy

* fix: add compatible check (#2342)

* fix: add compatible check

Signed-off-by: James <james@jan.ai>

* fix: copy

* fix: font

* fix: copy

* fix: broken monitoring extension

* chore: bump engine

* fix: copy

* fix: model copy

* fix: copy

* fix: model json

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: Louis <louis@jan.ai>

* fix: vulkan support

* fix: installation button padding

* fix: empty script

* fix: remove hard code string

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: NamH <NamNh0122@gmail.com>
2024-03-14 14:07:22 +07:00
NamH
f36d740b1e
feat: add quick ask (#2197)
* feat: add quick ask

Signed-off-by: James <james@jan.ai>

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: Louis <louis@jan.ai>
2024-03-08 10:01:37 +07:00
Louis
7556430693
fix: should not attach error messages to the completion request (#2258) 2024-03-06 16:15:46 +07:00
Louis
5ca3069fa1
fix: image upload button and drag event are not enabled (#2248)
* fix: image upload button and drag event are not enabled

* fix: add tooltips for unsupported model
2024-03-06 09:09:38 +07:00
NamH
510491253c
fix: error message being sent along with conversation when inference (#2242)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-03-05 22:09:38 +07:00
hiro
e6c10202e0
feat: Jan can see (#2069)
* feat: jan can see

feat: Add GPT-4 Vision model (Preview)

fix: Add visionModel as property in ModelInfo

fix: Fix condition to load local messages in useSetActiveThread hook

feat: Enable Image as input for chat

fix: Update model parameters in JSON files for remote GPT models

fix: Add thread as optional

fix: Add support for message as image

fix: Linter

fix: Update proxyModel to proxy_model and add textModel

chore: Change proxyModel to proxy_model

fix: Update settings with visionModel and textModel

fix: vision model passed through the retrieval tool

fix: linter

* fix: could not load image and request is not able to be sent

---------

Co-authored-by: Louis <louis@jan.ai>
2024-03-05 08:33:09 +07:00
Louis
42675891a6
chore: bump nitro 0.3.14 (#2183) 2024-02-28 11:25:30 +07:00
NamH
d7070d8c4a
fix: some bugs for import model (#2181)
* fix: some bugs for import model

Signed-off-by: James <james@jan.ai>

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-02-27 23:59:37 +07:00
hiento09
9078bbebea
Huggingface extension add codesign step for building on darwin (#2166)
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-02-27 11:35:11 +07:00
hiento09
a28230caaa
Add run codesign for huggingface extension (#2163)
Co-authored-by: Service Account <service@jan.ai>
2024-02-27 11:01:38 +07:00
NamH
773963a456
feat: add import model (#2104)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-02-26 16:15:10 +07:00
Louis
90108745e2
Merge pull request #2151 from janhq/main
Sync release 0.4.7 to dev
2024-02-26 11:15:21 +07:00
Helloyunho
e86cd7e661
feat: add a simple way to convert Hugging Face model to GGUF (#1972)
* chore: add react developer tools to electron

* feat: add small convert modal

* feat: separate modals and add hugging face extension

* feat: fully implement hugging face converter

* fix: forgot to uncomment this...

* fix: typo

* feat: try hf-to-gguf script first and then use convert.py

HF-to-GGUF has support for some unusual models
maybe using convert.py first would be better but we can change the usage order later

* fix: pre-install directory changed

* fix: sometimes exit code is undefined

* chore: download additional files for qwen

* fix: event handling changed

* chore: add one more necessary package

* feat: download gguf-py from llama.cpp

* fix: cannot interpret wildcards on GNU tar

Co-authored-by: hiento09 <136591877+hiento09@users.noreply.github.com>

---------

Co-authored-by: hiento09 <136591877+hiento09@users.noreply.github.com>
2024-02-26 10:57:53 +07:00
Louis
3c8caf3345
fix: correct vulkan settings (#2128) 2024-02-22 21:18:39 +07:00
hiento09
2f4bffdbef
Bump nitro from 0.3.12 to 0.3.13 (#2124)
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-02-22 18:00:37 +07:00
hiento09
a71c74d468
Fix: Linux vulkan binary path (#2123)
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-02-22 14:48:00 +07:00
hiro
926f19bd9b
feat: Add nitro vulkan to support AMD GPU/ APU and Intel Arc GPU (#2056)
* feat: add vulkan support on windows and linux

* fix: correct vulkan settings

* fix: gpu settings and enable Vulkan support

* fix: vulkan support 1 device at a time only

* inference-nitro-extension add download vulkaninfo

---------

Co-authored-by: Louis <louis@jan.ai>
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-02-22 11:19:36 +07:00
Louis
7fbc6cb6c0
fix: failed to bind port - nitro error message copy (#2101)
* fix: failed to bind port - nitro error message copy

* fix: copy
2024-02-20 13:54:21 +07:00
hiento09
2cbbe1bcd3
Fix bug #2005 docker blank website (#2093)
* Web: change API_BASE_URL to build time env

* Update Dockerfile and Docker Compose by adding env API_BASE_URL

* Update make clean

* INFERENCE_URL get from baseApiUrl

* Fix error settings/settings.json not found when start server at the first time

* Update README docker

---------

Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-02-19 23:30:59 +07:00
Louis
53006dd26e
fix: broken model.json update (#2099) 2024-02-19 22:07:12 +07:00
Hoang Ha
3af0ae1481
Merge pull request #1829 from janhq/chore/update-hub
chore: update models
2024-02-19 10:25:09 +07:00
hiro
69244e6ced
chore: Update version.txt to 0.3.12 (#2057) 2024-02-16 23:41:11 +07:00
NamH
42da19a463
fix: download mutilple binaries (#2043)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-02-16 11:32:14 +07:00
NamH
b7e94aac02
fix: disable gpu drop down box if there's no GPU ready (#2046)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-02-16 10:46:42 +07:00
Louis
3412a23654
chore: prettier fix (#2019) 2024-02-15 08:38:05 +07:00
Louis
f0fd2c5a2a
fix: model path backward compatible (#2018) 2024-02-14 23:04:46 +07:00
Louis
3b51f3d1aa
chore: bump nitro 0.3.9 (#2016) 2024-02-14 16:05:27 +07:00
Louis
bcf6103da3
fix: app fails gracefully with clear error messages (#1993) 2024-02-11 15:44:53 +07:00
Louis
0db1763c2b
fix: retrieval stuck at generating response (#1988) 2024-02-11 08:27:26 +07:00
0xgokuz
875c2bc3c9
feat: Thread titles should auto-summarize Topic (#1976) 2024-02-10 19:16:42 +07:00
Louis
eb09399fbf
chore: reduce bundle size (#1970)
* chore: reduce bundle size

* chore: trimming langchainjs

* chore: trim pdf-parse
2024-02-09 19:23:56 +07:00
Louis
5890ade451
chore: server download progress + S3 (#1925)
* fix: reduce the number of api call

Signed-off-by: James <james@jan.ai>

* fix: download progress

Signed-off-by: James <james@jan.ai>

* chore: save blob

* fix: server boot up

* fix: download state not updating

Signed-off-by: James <james@jan.ai>

* fix: copy assets

* Add Dockerfile CPU for Jan Server and Jan Web

* Add Dockerfile GPU for Jan Server and Jan Web

* feat: S3 adapter

* Update check find count from ./pre-install and correct copy:asserts command

* server add bundleDependencies @janhq/core

* server add bundleDependencies @janhq/core

* fix: update success/failed download state (#1945)

* fix: update success/failed download state

Signed-off-by: James <james@jan.ai>

* fix: download model progress and state handling for both Desktop and Web

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: Louis <louis@jan.ai>

* chore: refactor

* fix: load models empty first time open

* Add Docker compose

* fix: assistants onUpdate

---------

Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: Hien To <tominhhien97@gmail.com>
Co-authored-by: NamH <NamNh0122@gmail.com>
2024-02-07 17:54:35 +07:00
hiento09
4471b2c941
feat: User Selectable GPUs and GPU-based Model Recommendations (#1730) 2024-02-06 17:31:46 +07:00
NamH
01fec49798
fix: reduce the number of api call (#1896)
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-02-05 13:13:39 +07:00
Louis
f43fae2e86
Merge pull request #1919 from janhq/main
Sync release 0.4.6 to dev
2024-02-05 10:08:39 +07:00