Nicholai/jan - jan - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Akarshan Biswas	07d76dc871	feat: Allow specifying mmproj path during model loading The `loadOptions` interface in `AIEngine.ts` now includes an optional `mmprojPath` property. This allows users to provide a path to their MMProject file when loading a model, which is required for certain model types. The `llamacpp-extension/src/index.ts` has been updated to pass this option to the llamacpp server if provided.	2025-07-02 12:27:13 +07:00
Thien Tran	fbfaaf43c5	download CUDA libs if needed	2025-07-02 12:27:13 +07:00
Thien Tran	40cd7e962a	feat: download backend for llama.cpp extension (#5123 ) * wip * update * add download logic * add decompress. support delete file * download backend upon selecting setting * add some logging and nootes * add note on race condition * remove then catch * default to none backend. only download if it's not installed * merge version and backend. fetch version from GH * restrict scope of output_dir * add note on unpack	2025-07-02 12:27:13 +07:00
Akarshan Biswas	da23673a44	feat: Add API key generation for Llama.cpp This commit introduces API key generation for the Llama.cpp extension. The API key is now generated on the server side using HMAC-SHA256 and a secret key to ensure security and uniqueness. The frontend now passes the model ID and API secret to the server to generate the key. This addresses the requirement for secure model access and authorization.	2025-07-02 12:27:12 +07:00
Akarshan Biswas	d6edb1e944	If checking for proper ctx_len settings after refactoring	2025-07-02 12:27:12 +07:00
Thien Tran	39bb3f34d6	patch failing calls to cortex	2025-07-02 12:27:12 +07:00
Akarshan Biswas	31971e7821	(WIP)randomly generate api-key hash each session	2025-07-02 12:27:12 +07:00
Akarshan Biswas	1dd762f0cf	remove parseGGUFFileName function as it is not used	2025-07-02 12:27:12 +07:00
Akarshan Biswas	7481fae0df	remove ununsed imports and remove n_ctx key from loadOptions	2025-07-02 12:27:11 +07:00
Akarshan Biswas	77d861f56f	Fixup: change key to ctx_size to align with upstream and remove duplicate key	2025-07-02 12:27:11 +07:00
Thien Tran	d5c07acdb5	feat: add `LlamacppConfig` for llama.cpp extension to improve settings (#5121 ) * add engine settings * update load options * rename variable	2025-07-02 12:27:11 +07:00
Thien Tran	9bb4deeb78	update model config (import and list)	2025-07-02 12:27:11 +07:00
Thien Tran	77f6770333	update fileStat()	2025-07-02 12:27:10 +07:00
Akarshan Biswas	742e731e96	Add --reasoning_budget option	2025-07-02 12:27:10 +07:00
Akarshan Biswas	fe457a5368	slight modelbasepath refactoring	2025-07-02 12:27:10 +07:00
Akarshan Biswas	c5a0ee7f6e	refactor unload and implement a destructor to clean up sessions	2025-07-02 12:27:10 +07:00
Thien Tran	cd36b423b6	add basic model list	2025-07-02 12:27:10 +07:00
Thien Tran	d523166b61	implement delete	2025-07-02 12:27:09 +07:00
Thien Tran	ded9ae733a	feat: Model import (download + local import) for llama.cpp extension (#5087 ) * add pull and abortPull * add model import (download only) * write model.yaml. support local model import * remove cortex-related command * add TODO * remove cortex-related command	2025-07-02 12:27:09 +07:00
Akarshan Biswas	a7a2dcc8d8	refactor load/unload again; move types to core and refactor AIEngine abstract class	2025-07-02 12:27:09 +07:00
Akarshan Biswas	0e9a8a27e5	fixup from refactoring	2025-07-02 12:27:08 +07:00
Akarshan Biswas	bbbf4779df	refactor load/unload	2025-07-02 12:27:08 +07:00
Akarshan Biswas	021f8ae80f	Fixup: llama-server load	2025-07-02 12:27:08 +07:00
Akarshan Biswas	ed6f86d4b1	Change scripts to download only llama.cpp engine	2025-07-02 12:27:07 +07:00
Akarshan Biswas	a8abc9f9aa	Resolved conflicts by keeping HEAD changes	2025-07-02 12:27:07 +07:00
Akarshan Biswas	19274f7e69	update settings	2025-07-02 12:26:39 +07:00
Thien Tran	3f082372fd	add llamacpp-extension. can list some models	2025-07-02 12:26:39 +07:00
Louis	5caf9cc690	Merge pull request #5607 from menloresearch/dependabot/npm_and_yarn/extensions/download-extension/rimraf-6.0.1 chore(deps-dev): bump rimraf from 3.0.2 to 6.0.1 in /extensions/download-extension	2025-07-01 01:09:45 +07:00
Louis	9917716cc9	Merge pull request #5608 from menloresearch/dependabot/npm_and_yarn/extensions/conversational-extension/rimraf-6.0.1 chore(deps-dev): bump rimraf from 3.0.2 to 6.0.1 in /extensions/conversational-extension	2025-07-01 01:09:28 +07:00
dependabot[bot]	c358831808	chore(deps-dev): bump rimraf in /extensions/assistant-extension --- updated-dependencies: - dependency-name: rimraf dependency-version: 6.0.1 dependency-type: direct:development update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com>	2025-06-30 08:51:12 +00:00
dependabot[bot]	1c9a64855e	chore(deps-dev): bump rimraf in /extensions/conversational-extension --- updated-dependencies: - dependency-name: rimraf dependency-version: 6.0.1 dependency-type: direct:development update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com>	2025-06-30 08:49:18 +00:00
dependabot[bot]	3e22835049	chore(deps-dev): bump rimraf in /extensions/download-extension --- updated-dependencies: - dependency-name: rimraf dependency-version: 6.0.1 dependency-type: direct:development update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com>	2025-06-30 08:44:31 +00:00
Louis	e1b6690763	Merge branch 'dev' into release/v0.6.3 # Conflicts: # web-app/src/containers/DropdownModelProvider.tsx # web-app/src/containers/SettingsMenu.tsx # web-app/src/containers/dialogs/DeleteModel.tsx # web-app/src/routes/__root.tsx # web-app/src/routes/settings/general.tsx # web-app/src/routes/threads/$threadId.tsx	2025-06-26 21:55:35 +07:00
Louis	c5aac9270c	Update extensions/inference-cortex-extension/src/index.ts Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>	2025-06-26 19:36:07 +07:00
Louis	d75db73a43	fix: handle context shift setting from old app versions	2025-06-26 19:32:34 +07:00
Faisal Amir	f5cfe8a537	✨enhancement: Added jan-nano-128k	2025-06-25 15:56:38 +07:00
dependabot[bot]	51b6c0a7de	chore(deps-dev): bump typescript in /extensions/download-extension (#5405 ) Bumps [typescript](https://github.com/microsoft/TypeScript) from 5.3.3 to 5.8.3. - [Release notes](https://github.com/microsoft/TypeScript/releases) - [Changelog](https://github.com/microsoft/TypeScript/blob/main/azure-pipelines.release-publish.yml) - [Commits](https://github.com/microsoft/TypeScript/compare/v5.3.3...v5.8.3) --- updated-dependencies: - dependency-name: typescript dependency-version: 5.8.3 dependency-type: direct:development update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-06-24 14:54:11 +07:00
Louis	52bb54d2d5	fix: glitch model download issue (#5322 )	2025-06-17 23:01:22 +07:00
Louis	86079074d2	🐛fix: update default extension settings (#5315 ) * fix: update default extension settings * chore: hide language setting on Prod	2025-06-17 17:37:31 +07:00
Louis	2899d58ad7	🐛fix: llama.cpp default NGL setting does not offload all layers to GPU (#5310 ) * 🐛fix: llama.cpp default NGL setting does not offload all layers to GPU * chore: cover more cases * chore: clean up * fix: should not show GPU section on Mac	2025-06-17 15:30:07 +07:00
Louis	9ab69b157b	🐛fix: Jan-nano repo name changed (#5274 )	2025-06-15 11:17:41 +07:00
Louis	2634659366	🐛fix: default onboarding model should use recommended quantizations (#5273 ) * 🐛fix: default onboarding model should use recommended quantizations * ✨enhancement: show context shift option in provider settings * 🔧chore: wording	2025-06-15 02:34:14 +07:00
Louis	a10dbef2c8	✨enhancement: Model sources should not be hard coded from frontend (#5270 )	2025-06-14 22:00:43 +07:00
Louis	1e17cc6ec7	✨enhancement: model run improvement (#5268 ) * fix: mcp tool error handling * fix: error message * fix: trigger download from recommend model * fix: can't scroll hub * fix: show progress * ✨enhancement: prompt users to increase context size * ✨enhancement: rearrange action buttons for a better UX * 🔧chore: clean up logics --------- Co-authored-by: Faisal Amir <urmauur@gmail.com>	2025-06-14 16:32:15 +07:00
Louis	23240f958e	Merge pull request #5262 from menloresearch/chore/sync-new-hub-data chore: sync new hub data	2025-06-13 17:05:49 +07:00
Louis	27c4918395	fix: default settings should leave empty (#5257 ) * fix: default settings should leave empty * fix: default settings * fix: remove some more default settings * fix: threads and cont * fix: data * fix: default setting * fix: settings * chore: bump cortex version * chore: bump to cortex 1.0.14 * chore: clean up * typoe * chore: fix dialog hang * fix: default parameter * chore: truncate edit model title * chore: update default provider settings * chore: fix typo --------- Co-authored-by: Faisal Amir <urmauur@gmail.com>	2025-06-13 00:01:25 +07:00
Louis	51a321219d	chore: fix model settings are not applied accordingly on change (#5231 ) * chore: fix model settings are not applied accordingly on change * chore: handle failed tool call * chore: stop inference and model on reject	2025-06-10 16:26:42 +07:00
Louis	6d6790d5e0	fix: clean up cortex processes (#5215 ) * fix: clean up cortex processes * chore: clean up * chore: does not need output shell	2025-06-09 22:23:07 +07:00
Louis	b85bce8e09	fix: migrate provider settings (#5205 ) * chore: fix migrate api key * chore: add logs * chore: deprecate migration step * chore: clean up * fix: key apply	2025-06-05 20:25:43 +07:00
Louis	6faca3e732	refactor: remove JS server package (#5192 ) * refactor: remove js server package * chore: migrate HF token data	2025-06-04 15:33:35 +07:00

1 2 3 4 5 ...

691 Commits