Nicholai/jan - jan - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Akarshan Biswas	1f1605bdf9	feat: Add support for overriding tensor buffer type (#6062 ) * feat: Add support for overriding tensor buffer type This commit introduces a new configuration option, `override_tensor_buffer_t`, which allows users to specify a regex for matching tensor names to override their buffer type. This is an advanced setting primarily useful for optimizing the performance of large models, particularly Mixture of Experts (MoE) models. By overriding the tensor buffer type, users can keep critical parts of the model, like the attention layers, on the GPU while offloading other parts, such as the expert feed-forward networks, to the CPU. This can lead to significant speed improvements for massive models. Additionally, this change refines the error message to be more specific when a model fails to load. The previous message "Failed to load llama-server" has been updated to "Failed to load model" to be more accurate. * chore: update FE to suppoer override-tensor --------- Co-authored-by: Faisal Amir <urmauur@gmail.com>	2025-08-07 10:31:34 +05:30
Faisal Amir	5d001dfd5a	✨feat: jinja template customize per model instead provider level (#6053 )	2025-08-05 21:21:41 +07:00
Louis	48004024ee	Merge pull request #6020 from cmppoon/fix-mcp-servers-edit-json fix connected servers status not in sync when edit mcp json	2025-08-05 11:06:05 +07:00
Faisal Amir	641df474fd	fix: Generate A Response button does not show context size error dialog (#6029 ) * fix: Generate A Response button does not show context size error dialog * chore: remove as a child button params	2025-08-05 08:34:06 +07:00
Chaiyapruek Muangsiri	477651e5d5	fix connected servers status not in sync when edit mcp json	2025-08-05 08:08:59 +07:00
Faisal Amir	787c4ee073	fix: wrong desc setting cont_batching (#6034 )	2025-08-02 21:48:43 +07:00
Faisal Amir	3acb61b5ed	fix: react state loop from hooks useMediaQuery (#6031 ) * fix: react state loop from hooks useMediaQuerry * chore: update test cases hooks media query	2025-08-02 21:48:40 +07:00
Louis	9573329d06	Merge pull request #6004 from menloresearch/release/v0.6.6 Sync release/v0.6.6 into dev	2025-07-31 21:34:52 +07:00
Louis	4bcfa84d75	Merge pull request #6008 from menloresearch/hotfix/regression-issue-with-colon-in-model-name hotfix: regression issue with colon in model name	2025-07-31 17:55:28 +07:00
Louis	25fa4901c2	Merge pull request #5997 from menloresearch/release/v0.6.6 Sync Release/v0.6.6 into dev	2025-07-31 10:25:09 +07:00
Louis	76bcf33f80	fix: generate response button disappear on tool call (#5988 ) * fix: generate a response button should appear when an incomplete tool call message is present * fix: wording * fix: do not send duplicate messages on regenerating * fix: tests	2025-07-30 21:04:12 +07:00
cmuangs	d2f99c36f5	fix thread sorting issue (#5976 )	2025-07-30 18:15:29 +07:00
Faisal Amir	63cb4fbf3b	fix: assistant with last used and fix metadata (#5955 ) * fix: assistant with last used and fix metadata * chore: revert instruction and desc * chore: fix current assistant state * chore: updae metadata message assistant * chore: update test case	2025-07-29 09:50:07 +07:00
Faisal Amir	1c74bfd5ef	fix: update edge case experimental feature MCP (#5951 ) * fix: update edge case experimental feature MCP * Update web-app/src/routes/settings/mcp-servers.tsx Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>	2025-07-28 21:31:51 +07:00
Louis	fdaa3b1992	fix: openrouter unselect itself (#5943 ) * fix: selected openrouter model does not work * test: add tests to cover new change	2025-07-28 10:33:23 +07:00
Louis	1fc37a9349	fix: migrate app settings to the new version (#5936 ) * fix: migrate app settings to the new version * fix: edge cases * fix: migrate HF import model on Windows * fix hardware page broken after downgraded * test: correct test * fix: backward compatible hardware info	2025-07-27 21:13:05 +07:00
Faisal Amir	54d44ce741	fix: update default GPU toggle, and simplify state (#5937 )	2025-07-27 14:36:08 +07:00
Faisal Amir	7dec980630	fix: persist model capabilities refresh app (#5918 )	2025-07-25 20:27:51 +07:00
Louis	0c53ad0e16	fix: models hub should show latest data only (#5925 ) * fix: models hub should show latest data only * test: correct expected result	2025-07-25 17:34:14 +07:00
Akarshan Biswas	a1af70f7a9	feat: Enhance Llama.cpp backend management with persistence (#5886 ) * feat: Enhance Llama.cpp backend management with persistence This commit introduces significant improvements to how the Llama.cpp extension manages and updates its backend installations, focusing on user preference persistence and smarter auto-updates. Key changes include: * Persistent Backend Type Preference: The extension now stores the user's preferred backend type (e.g., `cuda`, `cpu`, `metal`) in `localStorage`. This ensures that even after updates or restarts, the system attempts to use the user's previously selected backend type, if available. * Intelligent Auto-Update: The auto-update mechanism has been refined to prioritize updating to the *latest version of the currently selected backend type*** rather than always defaulting to the "best available" backend (which might change). This respects user choice while keeping the chosen backend type up-to-date. * Improved Initial Installation/Configuration: For fresh installations or cases where the `version_backend` setting is invalid, the system now intelligently determines and installs the best available backend, then persists its type. * Refined Old Backend Cleanup: The `removeOldBackends` function has been renamed to `removeOldBackend` and modified to specifically clean up older versions of the currently selected backend type, preventing the accumulation of unnecessary files while preserving other backend types the user might switch to. * Robust Local Storage Handling: New private methods (`getStoredBackendType`, `setStoredBackendType`, `clearStoredBackendType`) are introduced to safely interact with `localStorage`, including error handling for potential `localStorage` access issues. * Version Filtering Utility: A new utility `findLatestVersionForBackend` helps in identifying the latest available version for a specific backend type from a list of supported backends. These changes provide a more stable, user-friendly, and maintainable backend management experience for the Llama.cpp extension. Fixes: #5883 * fix: cortex models migration should be done once * feat: Optimize Llama.cpp backend preference storage and UI updates This commit refines the Llama.cpp extension's backend management by: * Optimizing `localStorage` Writes: The system now only writes the backend type preference to `localStorage` if the new value is different from the currently stored one. This reduces unnecessary `localStorage` operations. * Ensuring UI Consistency on Initial Setup: When a fresh installation or an invalid backend configuration is detected, the UI settings are now explicitly updated to reflect the newly determined `effectiveBackendString`, ensuring the displayed setting matches the active configuration. These changes improve performance by reducing redundant storage operations and enhance user experience by maintaining UI synchronization with the backend state. * Revert "fix: provider settings should be refreshed on page load (#5887)" This reverts commit ce6af62c7df4a7e7ea8c0896f307309d6bf38771. * fix: add loader version backend llamacpp * fix: wrong key name * fix: model setting issues * fix: virtual dom hub * chore: cleanup * chore: hide device ofload setting --------- Co-authored-by: Louis <louis@jan.ai> Co-authored-by: Faisal Amir <urmauur@gmail.com>	2025-07-24 18:33:35 +07:00
Faisal Amir	399671488c	fix: gpu detected from backend version (#5882 ) * fix: gpu detected from backend version * chore: remove readonly props from dynamic field	2025-07-24 10:45:48 +07:00
Louis	6599d91660	fix: bring back HF repo ID search in Hub (#5880 ) * fix: bring back HF search input * test: fix useModelSources tests for updated addSource signature	2025-07-24 09:46:13 +07:00
Louis	d6ad797769	fix: llama.cpp backend shows blank list sometime (#5876 )	2025-07-23 20:04:38 +07:00
Louis	af116dd7dc	fix: jan should have a general assistant instruction (#5872 ) * fix: default Jan assistant prompt * test: update tests	2025-07-23 13:55:20 +07:00
Faisal Amir	fd26270e78	🐛fix/update vulkan active syntax (#5869 )	2025-07-23 11:45:54 +07:00
Louis	3e30c61fb0	fix: app should refresh local provider models list on launch (#5868 )	2025-07-23 08:36:09 +07:00
Faisal Amir	1d443e1f7d	fix: support load model configurations (#5843 ) * fix: support load model configurations * chore: remove log * chore: sampling params add from send completion * chore: remove comment * chore: remove comment on predefined file * chore: update test model service	2025-07-22 19:52:12 +07:00
Faisal Amir	7b3b6cc8be	🐛fix: delete all should not include fav thread (#5864 )	2025-07-22 19:51:59 +07:00
Faisal Amir	25952f293c	✨enhancement: auto focus always allow action from tool approval dialog and add req parameters (#5836 ) * ✨enhancement: auto focus always allow action from tool approval dialog * chore: error handling tools parameters * chore: update test button focus cases	2025-07-22 12:17:53 +07:00
Louis	05b9d4e9fd	feat: add claude-4 (#5829 ) * feat: add claude-4 * fix: sorting order	2025-07-21 12:30:56 +07:00
Louis	bc4fe52f8d	fix: llama.cpp integration model load and chat experience (#5823 ) * fix: stop generating should not stop running models * fix: ensure backend ready before loading model * fix: backend setting should not block onLoad	2025-07-21 09:29:26 +07:00
Akarshan	59ad2eb784	Merge branch 'dev' into release/v0.6.6	2025-07-18 18:29:20 +05:30
Louis	8d84c3b884	feat: add model load error handling to improve UX (#5802 ) * feat: model load error handling * chore: clean up * test: add tests * fix: provider name	2025-07-18 08:25:54 +05:30
Louis	3eaa3424e1	fix: fetch models from custom provider causes app to crash	2025-07-16 15:36:45 +07:00
Louis	9872a6e82a	test: add missing unit tests	2025-07-15 22:29:28 +07:00
Louis	03bcd02002	test: add missing unit tests	2025-07-12 22:46:27 +07:00
Louis	864ad50880	test: add missing tests	2025-07-12 21:29:51 +07:00
Louis	c5fd964bf2	test: add missing tests	2025-07-12 20:15:45 +07:00
Louis	b8259e7794	feat: add HF token setting	2025-07-11 00:05:52 +07:00
Louis	ca6f4f8977	test: fix failed tests	2025-07-10 16:25:47 +07:00
Faisal Amir	1422d94fac	🐛fix: make three dots default show 3 dots and can trigger with right click (#5712 ) * 🐛fix: default show 3 dots * ✨enhancement: enable resizable left panel (#5713) * ✨enhancement: enable resizable left panel * Update web-app/src/hooks/useLeftPanel.ts Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>	2025-07-07 11:14:43 +07:00
Faisal Amir	a0be23b500	enhancement: show readme on detail each model (#5705 ) * 🧹cleanup: linter and log * Update web-app/src/routes/hub/$modelId.tsx Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>	2025-07-07 09:54:16 +07:00
Faisal Amir	19fc399ae1	enhancement: gpu list based on backend	2025-07-03 23:18:50 +07:00
Louis	66bae2adb8	chore: clean up	2025-07-02 12:29:02 +07:00
Louis	9b730058b4	feat: use hardware information api	2025-07-02 12:29:02 +07:00
Louis	c6ac9f1d2a	feat: sync hub with model catalog	2025-07-02 12:29:01 +07:00
Louis	8bd4a3389f	refactor: frontend uses new engine extension # Conflicts: # extensions/model-extension/resources/default.json # web-app/src/containers/dialogs/DeleteProvider.tsx # web-app/src/routes/hub.tsx	2025-07-02 12:28:24 +07:00
Louis	7223f6fc3f	Merge pull request #5552 from menloresearch/dev sync: apply latest changes into release/v0.6.4	2025-06-26 09:02:27 -07:00
Louis	16aab0d661	fix: increase context size window does not popup first time	2025-06-26 16:40:55 +07:00
Faisal Amir	9bbf9a590c	✨enhancement: support base layout responsive UI (#5472 ) * ✨enhancement: support base layout responsive UI * Update web-app/src/containers/LeftPanel.tsx Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * Update web-app/src/containers/ThreadList.tsx Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * ✨enhancement: responsive assistant screen (#5502) * ✨enhancement: support base layout responsive UI * Update web-app/src/containers/LeftPanel.tsx Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * Update web-app/src/containers/ThreadList.tsx Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * ✨enhancement: responsive assistant screen * Update web-app/src/containers/dialogs/AddEditAssistant.tsx Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * ✨enhancement: sort assistant * Update web-app/src/routes/assistant.tsx Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * ✨enhancement: responsive hub screen (#5507) * ✨enhancement: support base layout responsive UI * Update web-app/src/containers/LeftPanel.tsx Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * Update web-app/src/containers/ThreadList.tsx Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * ✨enhancement: responsive assistant screen * Update web-app/src/containers/dialogs/AddEditAssistant.tsx Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * ✨enhancement: sort assistant * Update web-app/src/routes/assistant.tsx Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * ✨enhancement: responsive hub screen * 🧹cleanup: multiple key and useless for hub translation --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>	2025-06-26 15:01:50 +07:00

1 2 3

134 Commits