Nicholai/jan - jan - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Faisal Amir	787c4ee073	fix: wrong desc setting cont_batching (#6034 )	2025-08-02 21:48:43 +07:00
Akarshan Biswas	4d4cf896af	fix: Persist 'Auto-Unload Old Models' setting in llama.cpp (#5906 ) The 'Auto-Unload Old Models' setting in the llama.cpp extension failed to persist due to a typo in its key name within `settings.json`. The key was incorrectly `auto_unload_models` instead of `auto_unload`. This commit corrects the key name to `auto_unload`, ensuring that user-configured changes to this setting are properly saved, retrieved, and persist across application restarts. This resolves the issue where the setting would change and remain to its previous value after being changed.	2025-07-25 11:03:15 +05:30
Akarshan Biswas	a1af70f7a9	feat: Enhance Llama.cpp backend management with persistence (#5886 ) * feat: Enhance Llama.cpp backend management with persistence This commit introduces significant improvements to how the Llama.cpp extension manages and updates its backend installations, focusing on user preference persistence and smarter auto-updates. Key changes include: * Persistent Backend Type Preference: The extension now stores the user's preferred backend type (e.g., `cuda`, `cpu`, `metal`) in `localStorage`. This ensures that even after updates or restarts, the system attempts to use the user's previously selected backend type, if available. * Intelligent Auto-Update: The auto-update mechanism has been refined to prioritize updating to the *latest version of the currently selected backend type*** rather than always defaulting to the "best available" backend (which might change). This respects user choice while keeping the chosen backend type up-to-date. * Improved Initial Installation/Configuration: For fresh installations or cases where the `version_backend` setting is invalid, the system now intelligently determines and installs the best available backend, then persists its type. * Refined Old Backend Cleanup: The `removeOldBackends` function has been renamed to `removeOldBackend` and modified to specifically clean up older versions of the currently selected backend type, preventing the accumulation of unnecessary files while preserving other backend types the user might switch to. * Robust Local Storage Handling: New private methods (`getStoredBackendType`, `setStoredBackendType`, `clearStoredBackendType`) are introduced to safely interact with `localStorage`, including error handling for potential `localStorage` access issues. * Version Filtering Utility: A new utility `findLatestVersionForBackend` helps in identifying the latest available version for a specific backend type from a list of supported backends. These changes provide a more stable, user-friendly, and maintainable backend management experience for the Llama.cpp extension. Fixes: #5883 * fix: cortex models migration should be done once * feat: Optimize Llama.cpp backend preference storage and UI updates This commit refines the Llama.cpp extension's backend management by: * Optimizing `localStorage` Writes: The system now only writes the backend type preference to `localStorage` if the new value is different from the currently stored one. This reduces unnecessary `localStorage` operations. * Ensuring UI Consistency on Initial Setup: When a fresh installation or an invalid backend configuration is detected, the UI settings are now explicitly updated to reflect the newly determined `effectiveBackendString`, ensuring the displayed setting matches the active configuration. These changes improve performance by reducing redundant storage operations and enhance user experience by maintaining UI synchronization with the backend state. * Revert "fix: provider settings should be refreshed on page load (#5887)" This reverts commit ce6af62c7df4a7e7ea8c0896f307309d6bf38771. * fix: add loader version backend llamacpp * fix: wrong key name * fix: model setting issues * fix: virtual dom hub * chore: cleanup * chore: hide device ofload setting --------- Co-authored-by: Louis <louis@jan.ai> Co-authored-by: Faisal Amir <urmauur@gmail.com>	2025-07-24 18:33:35 +07:00
Faisal Amir	399671488c	fix: gpu detected from backend version (#5882 ) * fix: gpu detected from backend version * chore: remove readonly props from dynamic field	2025-07-24 10:45:48 +07:00
Faisal Amir	43b7eb6e18	🐛fix: remove sampling parameters from llamacpp extension (#5871 )	2025-07-23 12:13:42 +07:00
Akarshan Biswas	92703bceb2	refactor: move thinking toggle to runtime settings for dynamic control (#5800 ) * refactor: move thinking toggle to runtime settings for per-message control Replaces the static `reasoning_budget` config with a dynamic `enable_thinking` flag under `chat_template_kwargs`, allowing models like Jan-nano and Qwen3 to enable/disable thinking behavior at runtime, even mid-conversation. Requires UI update * remove engine argument	2025-07-17 20:18:24 +05:30
Akarshan Biswas	96ba42e411	feat: Add missing ctx-shift toggle (#5765 ) * feat: Add missing ctx_shift * fix typo Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * refine description --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>	2025-07-14 11:51:34 +05:30
Akarshan	ffef7b9cab	enhancement: Add custom Jinja chat template option Adds a new configuration option `chat_template` to the Llama.cpp extension, allowing users to define a custom Jinja chat template for the model. The template can be provided via a new input field in the settings, and if set, it will be passed to the Llama.cpp backend using the `--chat-template` argument. This enhances flexibility for users who require specific chat formatting beyond the GGUF default. The `chat_template` is added to the `LlamacppConfig` type and conditionally pushed to the command arguments if it's provided. The placeholder text provides an example of a Jinja template structure.	2025-07-03 23:38:16 +07:00
Akarshan	40f1fd4ffd	feat: Auto update backend implementation	2025-07-03 19:32:12 +05:30
Akarshan	0cbf35dc77	Add auto unload setting to llamacpp-extension	2025-07-02 12:28:25 +07:00
Thien Tran	1ae7c0b59a	update version/backend format. fix bugs around load()	2025-07-02 12:27:15 +07:00
Thien Tran	40cd7e962a	feat: download backend for llama.cpp extension (#5123 ) * wip * update * add download logic * add decompress. support delete file * download backend upon selecting setting * add some logging and nootes * add note on race condition * remove then catch * default to none backend. only download if it's not installed * merge version and backend. fetch version from GH * restrict scope of output_dir * add note on unpack	2025-07-02 12:27:13 +07:00
Akarshan Biswas	742e731e96	Add --reasoning_budget option	2025-07-02 12:27:10 +07:00
Akarshan Biswas	19274f7e69	update settings	2025-07-02 12:26:39 +07:00
Thien Tran	3f082372fd	add llamacpp-extension. can list some models	2025-07-02 12:26:39 +07:00

15 Commits