Nicholai/jan - jan - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Faisal Amir	ba4dc6d1eb	enhancement: update ui dialog update llamacpp backend	2025-09-11 09:52:09 +05:30
Akarshan Biswas	7a174e621a	feat: Smart model management (#6390 ) * feat: Smart model management * New UI option – `memory_util` added to `settings.json` with a dropdown (high / medium / low) to let users control how aggressively the engine uses system memory. * Configuration updates – `LlamacppConfig` now includes `memory_util`; the extension class stores it in a new `memoryMode` property and handles updates through `updateConfig`. * System memory handling * Introduced `SystemMemory` interface and `getTotalSystemMemory()` to report combined VRAM + RAM. * Added helper methods `getKVCachePerToken`, `getLayerSize`, and a new `ModelPlan` type. * Smart model‑load planner – `planModelLoad()` computes: * Number of GPU layers that can fit in usable VRAM. * Maximum context length based on KV‑cache size and the selected memory utilization mode (high/medium/low). * Whether KV‑cache must be off‑loaded to CPU and the overall loading mode (GPU, Hybrid, CPU, Unsupported). * Detailed logging of the planning decision. * Improved support check – `isModelSupported()` now: * Uses the combined VRAM/RAM totals from `getTotalSystemMemory()`. * Applies an 80% usable‑memory heuristic. * Returns GREEN only when both weights and KV‑cache fit in VRAM, YELLOW when they fit only in total memory or require CPU off‑load, and RED when the model cannot fit at all. * Cleanup – Removed unused `GgufMetadata` import; updated imports and type definitions accordingly. * Documentation/comments – Added explanatory JSDoc comments for the new methods and clarified the return semantics of `isModelSupported`. * chore: migrate no_kv_offload from llamacpp setting to model setting * chore: add UI auto optimize model setting * feat: improve model loading planner with mmproj support and smarter memory budgeting * Extend `ModelPlan` with optional `noOffloadMmproj` flag to indicate when a multimodal projector can stay in VRAM. * Add `mmprojPath` parameter to `planModelLoad` and calculate its size, attempting to keep it on GPU when possible. * Refactor system memory detection: * Use `used_memory` (actual free RAM) instead of total RAM for budgeting. * Introduced `usableRAM` placeholder for future use. * Rewrite KV‑cache size calculation: * Properly handle GQA models via `attention.head_count_kv`. * Compute bytes per token as `nHeadKV * headDim * 2 * 2 * nLayer`. * Replace the old 70 % VRAM heuristic with a more flexible budget: * Reserve a fixed VRAM amount and apply an overhead factor. * Derive usable system RAM from total memory minus VRAM. * Implement a robust allocation algorithm: * Prioritize placing the mmproj in VRAM. * Search for the best balance of GPU layers and context length. * Fallback strategies for hybrid and pure‑CPU modes with detailed safety checks. * Add extensive validation of model size, KV‑cache size, layer size, and memory mode. * Improve logging throughout the planning process for easier debugging. * Adjust final plan return shape to include the new `noOffloadMmproj` field. * remove unused variable --------- Co-authored-by: Faisal Amir <urmauur@gmail.com>	2025-09-11 09:48:03 +05:30
Faisal Amir	86dcfc10cf	enhancement: rollback edit capabilities for local model	2025-09-10 19:43:44 +07:00
Faisal Amir	5e30e10bf4	Merge pull request #6388 from menloresearch/feat/import-vision-model feat: allow user import model include mmproj file	2025-09-09 09:41:58 +07:00
Faisal Amir	836990b7d9	chore: update fn check mmproj file	2025-09-08 11:10:00 +07:00
Faisal Amir	1b035fd2f1	feat: allow user import model include mmproj file	2025-09-08 00:00:46 +07:00
Faisal Amir	a49008e02d	enhancement: responsive dialog modals	2025-09-06 21:48:09 +07:00
Dinh Long Nguyen	d490174544	feat: Web use jan model (#6374 ) * call jan api * fix lint * ci: add jan server web * chore: add Dockerfile * clean up ui ux and support for reasoning fields, make app spa * add logo * chore: update tag for preview image * chore: update k8s service name * chore: update image tag and image name * fixed test --------- Co-authored-by: Minh141120 <minh.itptit@gmail.com> Co-authored-by: Nguyen Ngoc Minh <91668012+Minh141120@users.noreply.github.com>	2025-09-05 16:18:30 +07:00
Dinh Long Nguyen	a30eb7f968	feat: Jan Web (reusing Jan Desktop UI) (#6298 ) * add platform guards * add service management * fix types * move to zustand for servicehub * update App Updater * update tauri missing move * update app updater * refactor: move PlatformFeatures to separate const file 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * change tauri fetch name * update implementation * update extension fetch * make web version run properly * disabled unused web settings * fix all tests * fix lint * fix tests * add mock for extension * fix build * update make and mise * fix tsconfig for web-extensions * fix loader type * cleanup * fix test * update error handling + mcp should be working * Update mcp init * use separate is_web_app build property * Remove fixed model catalog url * fix additional tests * fix download issue (event emitter not implemented correctly) * Update Title html * fix app logs * update root tsx render timing --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-09-05 01:47:46 +07:00
Faisal Amir	cb4641e4ad	feat: allow see apikey when server local status running	2025-09-03 17:55:52 +07:00
Faisal Amir	75d189900c	fix: mcp cleanup dropodown tool availabel and sort list	2025-08-27 18:08:23 +07:00
Faisal Amir	e73a710c06	fix/update-ui-info	2025-08-25 16:45:59 +07:00
Louis	5c4deff215	Merge pull request #6260 from menloresearch/fix/bring-back-manual-model-capability-edit fix: bring back manual model capability edit modal	2025-08-21 16:31:17 +07:00
Louis	9bc243c3f7	Merge branch 'dev' into fix/enable-back-app-language-setting	2025-08-21 12:53:21 +07:00
Louis	8e7378b70f	Merge pull request #6255 from menloresearch/fix/remove-experimental-toggle fix: remove experimental toggle	2025-08-21 12:51:25 +07:00
Faisal Amir	7b9e752301	Merge pull request #6250 from menloresearch/feat/local-api-server feat: run on startup setting for local api server	2025-08-21 12:43:13 +07:00
Louis	65cb473d25	fix: enable back app language setting	2025-08-21 12:30:30 +07:00
Louis	8de5c1709b	fix: test	2025-08-21 12:01:45 +07:00
Louis	cfbc6b9150	fix: remove experimental toggle	2025-08-21 11:54:34 +07:00
Louis	6850dda108	feat: MCP server error handling	2025-08-20 23:42:12 +07:00
Faisal Amir	39df7b22b9	chore: rename key runOnStartup from hooks useLocalApiServer	2025-08-20 22:37:45 +07:00
Faisal Amir	cfa68c5500	feat: run on startup settin for local api server	2025-08-20 21:56:53 +07:00
Faisal Amir	f96ff52506	enhancement: remove validate file extension from select file	2025-08-20 12:54:08 +07:00
Faisal Amir	b828d3f84f	chore: handle toaster failed import model	2025-08-19 22:07:30 +07:00
Faisal Amir	6c612d8eba	chore: seperate function handle import model	2025-08-19 22:01:14 +07:00
Faisal Amir	6ee044d106	fix: validation import model	2025-08-19 21:29:18 +07:00
Faisal Amir	872284b770	enhancement: offload model when provider not active	2025-08-19 14:18:39 +07:00
Louis	bfe671d7b4	feat: #5917 - model tool use capability should be auto detected	2025-08-19 09:51:36 +07:00
Louis	362324cb87	Merge pull request #6188 from menloresearch/feat/mcp-enhancement feat: mcp enhancement	2025-08-18 09:55:44 +07:00
Faisal Amir	b1b2ca1987	Merge pull request #6006 from menloresearch/feat/fav-model 🚀feat: allow user mark model as favorite	2025-08-17 23:14:26 +07:00
Louis	c8d9592ab8	chore: mcp group server, action and import json	2025-08-15 11:37:21 +07:00
Minh141120	aa8fb0464c	Merge branch 'dev' into fix/feature-toggle-auto-updater	2025-08-14 13:42:27 +07:00
Louis	16bfd6eafb	fix: full url search	2025-08-14 11:33:03 +07:00
Louis	4350d4c9a0	fix: feature toggle for auto updater	2025-08-14 09:58:46 +07:00
Louis	f3dd26e499	fix: uvx and npx dirs should be not be relocated	2025-08-11 14:33:58 +07:00
Akarshan Biswas	0cfc745954	feat: Introduce structured error handling for llamacpp extension (#6087 ) * feat: Introduce structured error handling for llamacpp extension This commit introduces a structured error handling system for the `llamacpp` extension. Instead of returning simple string errors, we now use a custom `LlamacppError` struct with a specific `ErrorCode` enum. This allows the frontend to display more user-friendly and actionable error messages based on the code, rather than raw debug logs. The changes include: - A new `ErrorCode` enum to categorize errors (e.g., `OutOfMemory`, `ModelArchNotSupported`, `BinaryNotFound`). - A `LlamacppError` struct to encapsulate the code, a user-facing message, and optional detailed logs. - A static method `from_stderr` that intelligently parses llama.cpp's standard error output to identify and map common issues like Out of Memory errors to a specific error code. - Refactored `ServerError` enum to wrap the new `LlamacppError` and provide a consistent serialization format for the Tauri frontend. - Updated all relevant functions (`load_llama_model`, `get_devices`) to return the new structured error type, ensuring a more robust and predictable error flow. - A reduced timeout for model loading from 300 to 180 seconds. This work lays the groundwork for a more intuitive and helpful user experience, as the application can now provide clear guidance to users when a model fails to load. * Update src-tauri/src/core/utils/extensions/inference_llamacpp_extension/server.rs Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * Update src-tauri/src/core/utils/extensions/inference_llamacpp_extension/server.rs Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * chore: update FE handle error object from extension * chore: fix property type --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> Co-authored-by: Faisal Amir <urmauur@gmail.com>	2025-08-07 23:28:25 +05:30
Faisal Amir	4d67418b0d	fix: update ux recemmend backend label into desc setting (#6088 )	2025-08-07 22:14:23 +07:00
Faisal Amir	f58332e9b5	Merge branch 'dev' into feat/fav-model	2025-08-07 18:11:44 +07:00
Faisal Amir	d8e1fef3f0	🐛fix/onboarding-loop (#6054 )	2025-08-07 18:11:22 +07:00
Faisal Amir	e3ba37ba15	🚀feat: allow user mark model as favorite	2025-08-05 14:26:12 +07:00
Chaiyapruek Muangsiri	da0cf10f91	remove unnecessary try catch block	2025-08-05 08:08:59 +07:00
Chaiyapruek Muangsiri	477651e5d5	fix connected servers status not in sync when edit mcp json	2025-08-05 08:08:59 +07:00
Faisal Amir	59a17d4a2a	fix/remove-auto-refresh-model (#6002 )	2025-07-31 14:07:31 +07:00
Faisal Amir	f58d745585	fix: title tooltip MCP edit json (#5987 ) * fix/title-tooltip-mcp-json * fix: title tooltip delete mcp	2025-07-30 21:00:55 +07:00
Louis	812a8082b8	fix: factory reset fail with access denied error (#5952 ) * fix: factory reset fail due to access denied error * fix: unused import * fix: tests	2025-07-28 23:20:45 +07:00
Faisal Amir	1c74bfd5ef	fix: update edge case experimental feature MCP (#5951 ) * fix: update edge case experimental feature MCP * Update web-app/src/routes/settings/mcp-servers.tsx Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>	2025-07-28 21:31:51 +07:00
Faisal Amir	54d44ce741	fix: update default GPU toggle, and simplify state (#5937 )	2025-07-27 14:36:08 +07:00
Faisal Amir	b89d9d090f	fix: update ui version_backend, mem usage hardware (#5932 ) * fix: update ui version_backend, mem usage hardware * chore: hidden gpu from system monitor on mac * chore: fix gpus vram	2025-07-26 18:36:18 +07:00
Akarshan Biswas	8ec4a36826	fix: Frontend updates when llama.cpp backend auto-downloads (#5926 )	2025-07-26 08:48:29 +07:00
Faisal Amir	2e870ad4d0	fix: calculation memory on hardware and system monitor (#5922 )	2025-07-26 08:47:59 +07:00

1 2 3

150 Commits