Louis
02c49e796d
fix: race condition issue - reading settings.json file ( #2683 )
...
* fix: race condition issue - reading settings.json file
* fix: cannot reset data while starting model
* chore: remove extension suffix
2024-04-11 15:37:46 +07:00
Louis
c0949b2d7e
fix: better kill process tensorrt-llm ( #2681 )
2024-04-11 12:47:41 +07:00
hiento09
ebdaaa6c10
bump nitro version to 0.3.21 ( #2680 )
...
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-04-11 12:08:39 +07:00
Louis
065ed03099
fix: wrong monitoring system information type ( #2679 )
2024-04-11 11:07:31 +07:00
NamH
ddb73d8131
fix: can't read the setting at first time ( #2677 )
...
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-04-11 10:56:47 +07:00
Louis
b19234ed71
chore: Extension should have product name in manifest ( #2675 )
...
* chore: Extension should have product name in manifest
* chore: typo
2024-04-11 09:50:58 +07:00
Louis
d93d74c86b
feat: nitro additional dependencies ( #2674 )
2024-04-11 09:13:02 +07:00
NamH
8917be5ef3
fix: add fallback as default endpoint for inference engine ( #2669 )
...
Co-authored-by: James <james@jan.ai>
2024-04-10 18:15:20 +07:00
Louis
3f23de6c28
feat: move log into monitoring extension ( #2662 )
2024-04-10 14:35:15 +07:00
hiento09
2931a46799
Bump nitro to 0.3.19 ( #2663 )
...
Bump nitro to 0.3.19
2024-04-09 22:43:23 +07:00
hiento09
5fd6025175
Bump nitro version to 0.3.18 ( #2652 )
2024-04-09 12:31:10 +07:00
Louis
9479beb7d1
fix: unload model while loading cause unknown error ( #2649 )
...
* fix: unload model while loading cause unknown error
* chore: mask placeholder
2024-04-09 11:31:42 +07:00
Inchoker
1244f03f66
feat: mistral inference engine extension ( #2569 )
...
* Add new feat: Inference Mistral extension
* change settings
* nitpicking fix
* fix model position and add mistral registerModel
* remove irrelevant changes
* change desc of mistral medium
Co-authored-by: Louis <louis@jan.ai>
* change desc of mistral small
Co-authored-by: Louis <louis@jan.ai>
* change desc of mistral large
Co-authored-by: Louis <louis@jan.ai>
* remove unpopular mistral model
* replace placeholder
* sort remaining models using size
---------
Co-authored-by: Jack Tri Le <Jack>
Co-authored-by: Louis <louis@jan.ai>
2024-04-09 11:18:03 +07:00
Louis
f8cf93a906
chore: add GPU driver and toolkit status ( #2628 )
2024-04-08 09:50:16 +07:00
hiento09
b3c8bab153
Correct condition checking cuda dependencies windows ( #2629 )
...
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-04-05 16:19:10 +07:00
NamH
e0d6049d66
chore: extension should register its own models ( #2601 )
...
* chore: extension should register its own models
Signed-off-by: James <james@jan.ai>
---------
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-04-05 14:18:58 +07:00
Louis
1eaf13b13e
fix: cancel loading model with stop action ( #2607 )
2024-04-04 10:57:54 +07:00
NamH
402f85f179
chore: update place holder for api key of groq and openai ( #2588 )
...
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-04-03 22:19:55 +07:00
hiento09
beb3473d4e
Move out from cloudflare r2 to aws s3 ( #2596 )
...
* Move out from cloudflare r2 to aws s3
* Remove clean cloudflare jobs
---------
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-04-03 16:10:01 +07:00
hiento09
a6cbc0b86f
Change release download url to cloudflare worker proxy and update download model tensorrt llm to aws s3 endpoint ( #2576 )
...
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-04-02 17:08:53 +07:00
hiento09
7feaf0b3bd
Change npm registry to nexus for CI test and enable turbo remote cache ( #2535 )
...
* Change npm registry to nexus for CI test
* Change npm registry to nexus for CI test
* Add yarn.lock
* Remove clean step
* Revert to disable yarn.lock file
* Turn NPM Proxy to env
---------
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-04-02 15:34:26 +07:00
NamH
345c7d58e6
chore: some wordings in extension settings ( #2573 )
...
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-04-02 15:31:20 +07:00
Louis
f6d3b53ab5
Merge branch 'main' into dev
...
# Conflicts:
# web/screens/Chat/ErrorMessage/index.tsx
2024-04-02 11:09:59 +07:00
Hoang Ha
30f34a41b7
Hotfix: model hub ID mismatch ( #2557 )
2024-04-01 16:20:37 +07:00
Hoang Ha
3e8ad2bde0
fix: Update package.json
2024-04-01 10:56:05 +07:00
Louis
228a363914
fix: image model does not work when retrieval tool is enabled ( #2538 )
2024-03-29 16:07:49 +07:00
NamH
fa35aa6e14
feat: dynamically register extension settings ( #2494 )
...
* feat: add extesion settings
Signed-off-by: James <james@jan.ai>
---------
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: Louis <louis@jan.ai>
2024-03-29 15:44:46 +07:00
Louis
3b3eb119f0
fix: duplicate api definition ( #2522 )
2024-03-28 11:45:08 +07:00
Louis
75eea1fdb2
Merge branch 'dev'
...
# Conflicts:
# core/src/browser/core.ts
# core/src/browser/extensions/monitoring.ts
# core/src/browser/fs.ts
# core/src/extensions/ai-engines/LocalOAIEngine.ts
# extensions/monitoring-extension/src/node/index.ts
# extensions/tensorrt-llm-extension/src/index.ts
# extensions/tensorrt-llm-extension/src/node/index.ts
# web/hooks/useSendChatMessage.ts
2024-03-28 10:46:05 +07:00
NamH
5eed8a5eca
fix: rag is not working for nitro ( #2511 )
...
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-03-27 11:17:43 +07:00
Louis
84e1b09e84
fix: error invoking remote method readdirsync ( #2505 )
2024-03-27 05:51:42 +07:00
Louis
7857a6e75e
fix: upload document mid-thread does not work ( #2504 )
2024-03-26 22:22:54 +07:00
Louis
8e8dfd4b37
refactor: introduce inference tools ( #2493 )
2024-03-25 23:26:05 +07:00
Louis
9551996e34
chore: load, unload model and inference synchronously
2024-03-25 12:25:30 +07:00
NamH
67e285fa96
chore: remove rmdirsync from core api since it is deprecated ( #2459 )
...
* chore: remove rmdirsync from core api since it is deprecated
Signed-off-by: James <james@jan.ai>
* chore: remove mkdirsync
Signed-off-by: James <james@jan.ai>
---------
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-03-22 17:57:16 +07:00
Louis
3c0383f6d8
fix: app raises port not available error ( #2466 )
2024-03-22 17:53:33 +07:00
Louis
254a79ccbe
fix: turborepo extensions ( #2392 )
...
* fix: turborepo extensions
Update package.json
Update Makefile
Update Makefile
Update Makefile
Update Makefile
Update Makefile
Update package.json
* chore: turbo cache
* fix: install extensions in parallel
* fix: timeout issue
* Turbo cache using s3
* Remove cache task
---------
Co-authored-by: Hien To <tominhhien97@gmail.com>
Co-authored-by: Service Account <service@jan.ai>
2024-03-22 17:53:20 +07:00
Louis
b8cee875b1
fix: app shows wrong toast on stopping inference ( #2460 )
2024-03-22 14:40:15 +07:00
Louis
c2f6330daf
chore: log system information for debugging ( #2453 )
2024-03-22 12:34:44 +07:00
Louis
acbec78dbf
fix: refactor inference engines to extends AIEngine ( #2347 )
...
* fix: refactor nitro to extends localoaiengine
* fix: refactor openai extension
* chore: refactor groq extension
* chore: refactor triton tensorrt extension
* chore: add tests
* chore: refactor engines
2024-03-22 09:35:14 +07:00
Louis
ff7ec39915
fix: incompatible browser dependency ( #2439 )
...
* fix: incompatible browser dependency
* fix: update model extension to use rollup
* fix: test timeout
2024-03-21 16:54:42 +07:00
NamH
b8d86df688
Fix/unable factory reset windows nitro running ( #2422 )
...
* fix: unable to factory reset when nitro is running on windows
---------
Signed-off-by: James <james@jan.ai>
2024-03-19 18:05:03 +07:00
Louis
489e8aab24
Sync release 0.4.9 to dev ( #2407 )
...
* fix: move tensorrt executable to engine (#2400 )
* fix: move tensorrt executable to engine
Signed-off-by: James <james@jan.ai>
* some update
Signed-off-by: hiro <hiro@jan.ai>
* chore: bump tensorrt version
* fix: wrong destroy path
* fix: install extensions in parallel
* chore: update path for tensorrt engine (#2404 )
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
---------
Signed-off-by: James <james@jan.ai>
Signed-off-by: hiro <hiro@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>
Co-authored-by: Louis <louis@jan.ai>
* Release/v0.4.9 (#2421 )
* fix: turn off experimental settings should also turn off quick ask (#2411 )
* fix: app glitches 1s generating response before starting model (#2412 )
* fix: disable experimental feature should also disable vulkan (#2414 )
* fix: model load stuck on windows when can't get CPU core count (#2413 )
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
* feat: TensorRT-LLM engine update support (#2415 )
* fix: engine update
* chore: add remove prepopulated models
Signed-off-by: James <james@jan.ai>
* update tinyjensen url
Signed-off-by: James <james@jan.ai>
* update llamacorn
Signed-off-by: James <james@jan.ai>
* update Mistral 7B Instruct v0.1 int4
Signed-off-by: James <james@jan.ai>
* update tensorrt
Signed-off-by: James <james@jan.ai>
* update
Signed-off-by: hiro <hiro@jan.ai>
* update
Signed-off-by: James <james@jan.ai>
* prettier
Signed-off-by: James <james@jan.ai>
* update mistral config
Signed-off-by: James <james@jan.ai>
* fix some lint
Signed-off-by: James <james@jan.ai>
---------
Signed-off-by: James <james@jan.ai>
Signed-off-by: hiro <hiro@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>
* Tensorrt LLM disable turing support (#2418 )
Co-authored-by: Hien To <tominhhien97@gmail.com>
* chore: add prompt template tensorrtllm (#2375 )
* chore: add prompt template tensorrtllm
* Add Prompt template for mistral and correct model metadata
---------
Co-authored-by: Hien To <tominhhien97@gmail.com>
* fix: correct tensorrt mistral model.json (#2419 )
---------
Signed-off-by: James <james@jan.ai>
Signed-off-by: hiro <hiro@jan.ai>
Co-authored-by: Louis <louis@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>
Co-authored-by: hiento09 <136591877+hiento09@users.noreply.github.com>
Co-authored-by: Hien To <tominhhien97@gmail.com>
---------
Signed-off-by: James <james@jan.ai>
Signed-off-by: hiro <hiro@jan.ai>
Co-authored-by: NamH <NamNh0122@gmail.com>
Co-authored-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>
Co-authored-by: hiento09 <136591877+hiento09@users.noreply.github.com>
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-03-19 12:20:09 +07:00
NamH
3a3bceb0c0
Release/v0.4.9 ( #2421 )
...
* fix: turn off experimental settings should also turn off quick ask (#2411 )
* fix: app glitches 1s generating response before starting model (#2412 )
* fix: disable experimental feature should also disable vulkan (#2414 )
* fix: model load stuck on windows when can't get CPU core count (#2413 )
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
* feat: TensorRT-LLM engine update support (#2415 )
* fix: engine update
* chore: add remove prepopulated models
Signed-off-by: James <james@jan.ai>
* update tinyjensen url
Signed-off-by: James <james@jan.ai>
* update llamacorn
Signed-off-by: James <james@jan.ai>
* update Mistral 7B Instruct v0.1 int4
Signed-off-by: James <james@jan.ai>
* update tensorrt
Signed-off-by: James <james@jan.ai>
* update
Signed-off-by: hiro <hiro@jan.ai>
* update
Signed-off-by: James <james@jan.ai>
* prettier
Signed-off-by: James <james@jan.ai>
* update mistral config
Signed-off-by: James <james@jan.ai>
* fix some lint
Signed-off-by: James <james@jan.ai>
---------
Signed-off-by: James <james@jan.ai>
Signed-off-by: hiro <hiro@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>
* Tensorrt LLM disable turing support (#2418 )
Co-authored-by: Hien To <tominhhien97@gmail.com>
* chore: add prompt template tensorrtllm (#2375 )
* chore: add prompt template tensorrtllm
* Add Prompt template for mistral and correct model metadata
---------
Co-authored-by: Hien To <tominhhien97@gmail.com>
* fix: correct tensorrt mistral model.json (#2419 )
---------
Signed-off-by: James <james@jan.ai>
Signed-off-by: hiro <hiro@jan.ai>
Co-authored-by: Louis <louis@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>
Co-authored-by: hiento09 <136591877+hiento09@users.noreply.github.com>
Co-authored-by: Hien To <tominhhien97@gmail.com>
2024-03-19 10:06:47 +07:00
NamH
c81a33f382
fix: move tensorrt executable to engine ( #2400 )
...
* fix: move tensorrt executable to engine
Signed-off-by: James <james@jan.ai>
* some update
Signed-off-by: hiro <hiro@jan.ai>
* chore: bump tensorrt version
* fix: wrong destroy path
* fix: install extensions in parallel
* chore: update path for tensorrt engine (#2404 )
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
---------
Signed-off-by: James <james@jan.ai>
Signed-off-by: hiro <hiro@jan.ai>
Co-authored-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>
Co-authored-by: Louis <louis@jan.ai>
2024-03-18 07:38:35 +07:00
Meta Spartan
0348aa3321
feat: Groq Inference Extension ( #2263 )
...
* feat: Groq Inference Extension
* Add Groq supported models
* Fix folder typo
* Add Groq options to interface and new API Key saving, tested working
* Fix linting
2024-03-18 06:40:20 +07:00
NamH
ed6bd14e02
chore: temporary remove linux from tensorrt support ( #2386 )
...
Signed-off-by: James <james@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-03-15 23:02:42 +07:00
NamH
5f19983de1
fix: some regressions for tensorrt nightly build ( #2380 )
...
* fix: some regressions for tensorrt nightly build
Signed-off-by: James <james@jan.ai>
---------
Signed-off-by: hiro <hiro@jan.ai>
Signed-off-by: James <james@jan.ai>
Co-authored-by: hiro <hiro@jan.ai>
Co-authored-by: James <james@jan.ai>
2024-03-15 17:45:56 +07:00
Louis
2d622614bf
Update models.json ( #2382 )
2024-03-15 16:37:00 +07:00
Louis
58e12f35c9
fix: wrong engine handling ( #2363 )
2024-03-14 23:59:42 +07:00