Nicholai/jan - jan - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Hoang Ha	d14c3af99b	add: featured	2024-04-24 16:35:05 +07:00
Louis	da161cd159	fix: override cpu_threads setting from model.json (#2789 )	2024-04-23 15:09:48 +07:00
Van Pham	67db45ff3c	chore: add model.json for Llama3 and other outdated model version (#2773 ) * chore: add model.json for Llama3 and other outdated model version * fix: consistency format * fix: correct folder id * update: bump version * add: stop words * fix: model.json * Update extensions/inference-nitro-extension/resources/models/llama3-8b-instruct/model.json * Update extensions/inference-nitro-extension/resources/models/llama3-8b-instruct/model.json Based on suggested change Co-authored-by: Nikolaus Kühn <nikolaus.kuehn@commercetools.com> --------- Co-authored-by: Van-QA <van@jan.ai> Co-authored-by: Hoang Ha <64120343+hahuyhoang411@users.noreply.github.com> Co-authored-by: Louis <louis@jan.ai> Co-authored-by: Nikolaus Kühn <nikolaus.kuehn@commercetools.com>	2024-04-22 21:40:22 +07:00
NamH	95632788e4	chore: default context length to 2048 (#2746 )	2024-04-17 19:14:51 +07:00
NamH	a2cb1353cd	fix: cannot download phin34 model (#2745 ) Signed-off-by: James <james@jan.ai> Co-authored-by: James <james@jan.ai>	2024-04-17 18:36:02 +07:00
Van Pham	e43ee8ec2c	Bump nitro to 0.3.22 (#2740 ) * Bump nitro to 0.3.22 * Update model.json for Command-r-34b Remove Coming Soon and Unavailable	2024-04-17 01:00:16 +07:00
NamH	31397de2d1	Refactor/deprecate hugging face ext (#2620 ) * refactor: deprecate huggingface extension Signed-off-by: James <james@jan.ai>	2024-04-16 17:23:45 +07:00
Louis	9369ac3e8b	Merge branch 'dev' into main	2024-04-15 14:57:31 +07:00
Andreas Deininger	81e8889568	Fix typos (#2714 )	2024-04-15 13:27:28 +07:00
Hoang Ha	b908ae2933	Chore: Change CommandR to unavailable (#2722 ) * fix: move to comming soon * fix: Q4 for consistancy * version pump extension * pump version model * fix: highlight unsupported tag --------- Co-authored-by: Louis <louis@jan.ai>	2024-04-15 12:57:52 +07:00
hiento09	aff6a7d11a	Bump nitro to -.3.16-hotfix (#2702 ) Co-authored-by: Hien To <tominhhien97@gmail.com>	2024-04-12 15:24:52 +07:00
Van Pham	8dbd2524b8	Revert to 0.3.16 due to Nitro issue (#2700 )	2024-04-12 13:00:47 +07:00
Van Pham	4a9a9f27df	Revert to 0.3.14 due to Nitro issue (#2699 )	2024-04-12 12:35:53 +07:00
Louis	fa9d8ab9a5	fix: switch between models get stuck at generating (#2698 )	2024-04-12 12:34:22 +07:00
NamH	7d67087919	fix: add markdown support for extension description (#2691 ) Signed-off-by: James <james@jan.ai> Co-authored-by: James <james@jan.ai>	2024-04-11 17:43:59 +07:00
Louis	02c49e796d	fix: race condition issue - reading settings.json file (#2683 ) * fix: race condition issue - reading settings.json file * fix: cannot reset data while starting model * chore: remove extension suffix	2024-04-11 15:37:46 +07:00
hiento09	ebdaaa6c10	bump nitro version to 0.3.21 (#2680 ) Co-authored-by: Hien To <tominhhien97@gmail.com>	2024-04-11 12:08:39 +07:00
Louis	065ed03099	fix: wrong monitoring system information type (#2679 )	2024-04-11 11:07:31 +07:00
Louis	b19234ed71	chore: Extension should have product name in manifest (#2675 ) * chore: Extension should have product name in manifest * chore: typo	2024-04-11 09:50:58 +07:00
Louis	d93d74c86b	feat: nitro additional dependencies (#2674 )	2024-04-11 09:13:02 +07:00
Louis	3f23de6c28	feat: move log into monitoring extension (#2662 )	2024-04-10 14:35:15 +07:00
hiento09	2931a46799	Bump nitro to 0.3.19 (#2663 ) Bump nitro to 0.3.19	2024-04-09 22:43:23 +07:00
hiento09	5fd6025175	Bump nitro version to 0.3.18 (#2652 )	2024-04-09 12:31:10 +07:00
Louis	9479beb7d1	fix: unload model while loading cause unknown error (#2649 ) * fix: unload model while loading cause unknown error * chore: mask placeholder	2024-04-09 11:31:42 +07:00
NamH	e0d6049d66	chore: extension should register its own models (#2601 ) * chore: extension should register its own models Signed-off-by: James <james@jan.ai> --------- Signed-off-by: James <james@jan.ai> Co-authored-by: James <james@jan.ai>	2024-04-05 14:18:58 +07:00
Louis	1eaf13b13e	fix: cancel loading model with stop action (#2607 )	2024-04-04 10:57:54 +07:00
NamH	fa35aa6e14	feat: dynamically register extension settings (#2494 ) * feat: add extesion settings Signed-off-by: James <james@jan.ai> --------- Signed-off-by: James <james@jan.ai> Co-authored-by: James <james@jan.ai> Co-authored-by: Louis <louis@jan.ai>	2024-03-29 15:44:46 +07:00
Louis	9551996e34	chore: load, unload model and inference synchronously	2024-03-25 12:25:30 +07:00
Louis	acbec78dbf	fix: refactor inference engines to extends AIEngine (#2347 ) * fix: refactor nitro to extends localoaiengine * fix: refactor openai extension * chore: refactor groq extension * chore: refactor triton tensorrt extension * chore: add tests * chore: refactor engines	2024-03-22 09:35:14 +07:00
Louis	ff7ec39915	fix: incompatible browser dependency (#2439 ) * fix: incompatible browser dependency * fix: update model extension to use rollup * fix: test timeout	2024-03-21 16:54:42 +07:00
Louis	d85d02693b	feat: Nitro-Tensorrt-LLM Extension (#2280 ) * feat: tensorrt-llm-extension * fix: loading * feat: add download tensorrt llm runner Signed-off-by: James <james@jan.ai> * feat: update to rollupjs instead of webpack for monitoring extension Signed-off-by: James <james@jan.ai> * feat: move update nvidia info to monitor extension Signed-off-by: James <james@jan.ai> * allow download tensorrt Signed-off-by: James <james@jan.ai> * update Signed-off-by: James <james@jan.ai> * allow download tensor rt based on gpu setting Signed-off-by: James <james@jan.ai> * update downloaded models Signed-off-by: James <james@jan.ai> * feat: add extension compatibility * dynamic tensor rt engines Signed-off-by: James <james@jan.ai> * update models Signed-off-by: James <james@jan.ai> * chore: remove ts-ignore * feat: getting installation state from extension Signed-off-by: James <james@jan.ai> * chore: adding type for decompress Signed-off-by: James <james@jan.ai> * feat: update according Louis's comment Signed-off-by: James <james@jan.ai> * feat: add progress for installing extension Signed-off-by: James <james@jan.ai> * chore: remove args from extension installation * fix: model download does not work properly * fix: do not allow user to stop tensorrtllm inference * fix: extension installed style * fix: download tensorrt does not update state Signed-off-by: James <james@jan.ai> * chore: replace int4 by fl16 * feat: modal for installing extension Signed-off-by: James <james@jan.ai> * fix: start download immediately after press install Signed-off-by: James <james@jan.ai> * fix: error switching between engines * feat: rename inference provider to ai engine and refactor to core * fix: missing ulid * fix: core bundler * feat: add cancel extension installing Signed-off-by: James <james@jan.ai> * remove mocking for mac Signed-off-by: James <james@jan.ai> * fix: show models only when extension is ready * add tensorrt badge for model Signed-off-by: James <james@jan.ai> * fix: copy * fix: add compatible check (#2342) * fix: add compatible check Signed-off-by: James <james@jan.ai> * fix: copy * fix: font * fix: copy * fix: broken monitoring extension * chore: bump engine * fix: copy * fix: model copy * fix: copy * fix: model json --------- Signed-off-by: James <james@jan.ai> Co-authored-by: James <james@jan.ai> Co-authored-by: Louis <louis@jan.ai> * fix: vulkan support * fix: installation button padding * fix: empty script * fix: remove hard code string --------- Signed-off-by: James <james@jan.ai> Co-authored-by: James <james@jan.ai> Co-authored-by: NamH <NamNh0122@gmail.com>	2024-03-14 14:07:22 +07:00
NamH	f36d740b1e	feat: add quick ask (#2197 ) * feat: add quick ask Signed-off-by: James <james@jan.ai> --------- Signed-off-by: James <james@jan.ai> Co-authored-by: James <james@jan.ai> Co-authored-by: Louis <louis@jan.ai>	2024-03-08 10:01:37 +07:00
Louis	42675891a6	chore: bump nitro 0.3.14 (#2183 )	2024-02-28 11:25:30 +07:00
NamH	773963a456	feat: add import model (#2104 ) Signed-off-by: James <james@jan.ai> Co-authored-by: James <james@jan.ai>	2024-02-26 16:15:10 +07:00
Louis	3c8caf3345	fix: correct vulkan settings (#2128 )	2024-02-22 21:18:39 +07:00
hiento09	2f4bffdbef	Bump nitro from 0.3.12 to 0.3.13 (#2124 ) Co-authored-by: Hien To <tominhhien97@gmail.com>	2024-02-22 18:00:37 +07:00
hiento09	a71c74d468	Fix: Linux vulkan binary path (#2123 ) Co-authored-by: Hien To <tominhhien97@gmail.com>	2024-02-22 14:48:00 +07:00
hiro	926f19bd9b	feat: Add nitro vulkan to support AMD GPU/ APU and Intel Arc GPU (#2056 ) * feat: add vulkan support on windows and linux * fix: correct vulkan settings * fix: gpu settings and enable Vulkan support * fix: vulkan support 1 device at a time only * inference-nitro-extension add download vulkaninfo --------- Co-authored-by: Louis <louis@jan.ai> Co-authored-by: Hien To <tominhhien97@gmail.com>	2024-02-22 11:19:36 +07:00
Louis	7fbc6cb6c0	fix: failed to bind port - nitro error message copy (#2101 ) * fix: failed to bind port - nitro error message copy * fix: copy	2024-02-20 13:54:21 +07:00
hiento09	2cbbe1bcd3	Fix bug #2005 docker blank website (#2093 ) * Web: change API_BASE_URL to build time env * Update Dockerfile and Docker Compose by adding env API_BASE_URL * Update make clean * INFERENCE_URL get from baseApiUrl * Fix error settings/settings.json not found when start server at the first time * Update README docker --------- Co-authored-by: Hien To <tominhhien97@gmail.com>	2024-02-19 23:30:59 +07:00
hiro	69244e6ced	chore: Update version.txt to 0.3.12 (#2057 )	2024-02-16 23:41:11 +07:00
Louis	3412a23654	chore: prettier fix (#2019 )	2024-02-15 08:38:05 +07:00
Louis	f0fd2c5a2a	fix: model path backward compatible (#2018 )	2024-02-14 23:04:46 +07:00
Louis	3b51f3d1aa	chore: bump nitro 0.3.9 (#2016 )	2024-02-14 16:05:27 +07:00
0xgokuz	875c2bc3c9	feat: Thread titles should auto-summarize Topic (#1976 )	2024-02-10 19:16:42 +07:00
Louis	eb09399fbf	chore: reduce bundle size (#1970 ) * chore: reduce bundle size * chore: trimming langchainjs * chore: trim pdf-parse	2024-02-09 19:23:56 +07:00
Louis	5890ade451	chore: server download progress + S3 (#1925 ) * fix: reduce the number of api call Signed-off-by: James <james@jan.ai> * fix: download progress Signed-off-by: James <james@jan.ai> * chore: save blob * fix: server boot up * fix: download state not updating Signed-off-by: James <james@jan.ai> * fix: copy assets * Add Dockerfile CPU for Jan Server and Jan Web * Add Dockerfile GPU for Jan Server and Jan Web * feat: S3 adapter * Update check find count from ./pre-install and correct copy:asserts command * server add bundleDependencies @janhq/core * server add bundleDependencies @janhq/core * fix: update success/failed download state (#1945) * fix: update success/failed download state Signed-off-by: James <james@jan.ai> * fix: download model progress and state handling for both Desktop and Web --------- Signed-off-by: James <james@jan.ai> Co-authored-by: James <james@jan.ai> Co-authored-by: Louis <louis@jan.ai> * chore: refactor * fix: load models empty first time open * Add Docker compose * fix: assistants onUpdate --------- Signed-off-by: James <james@jan.ai> Co-authored-by: James <james@jan.ai> Co-authored-by: Hien To <tominhhien97@gmail.com> Co-authored-by: NamH <NamNh0122@gmail.com>	2024-02-07 17:54:35 +07:00
hiento09	4471b2c941	feat: User Selectable GPUs and GPU-based Model Recommendations (#1730 )	2024-02-06 17:31:46 +07:00
Louis	f43fae2e86	Merge pull request #1919 from janhq/main Sync release 0.4.6 to dev	2024-02-05 10:08:39 +07:00
Louis	eaa3053d40	fix: openAIEmbedding now requires top level API Key configuration (#1902 ) * fix: openAIEmbedding now requires top level API Key configuration * chore: typo	2024-02-02 13:28:21 +07:00

1 2 3

138 Commits