Nicholai/jan - jan - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Akarshan	8b15fe4ef2	feat: Simplify backend architecture This commit introduces a functional flag for embedding models and refactors the backend detection logic for cleaner implementation. Key changes: - Embedding Support: The loadLlamaModel API and SessionInfo now include an isEmbedding: boolean flag. This allows the core process to differentiate and correctly initialize models intended for embedding tasks. - Backend Naming Simplification (Refactor): Consolidated the CPU-specific backend tags (e.g., win-noavx-x64, win-avx2-x64) into generic *-common_cpus-x64 variants (e.g., win-common_cpus-x64). This streamlines supported backend detection. - File Structure Update: Changed the download path for CUDA runtime libraries (cudart) to place them inside the specific backend's directory (/build/bin/) rather than a shared lib folder, improving asset isolation.	2025-10-29 08:02:09 +05:30
Akarshan	0c5fbc102c	refactor: Simplify Tauri plugin calls and enhance 'Flash Attention' setting This commit introduces significant improvements to the llama.cpp extension, focusing on the 'Flash Attention' setting and refactoring Tauri plugin interactions for better code clarity and maintenance. The backend interaction is streamlined by removing the unnecessary `libraryPath` argument from the Tauri plugin commands for loading models and listing devices. * Simplified API Calls: The `loadLlamaModel`, `unloadLlamaModel`, and `get_devices` functions in both the extension and the Tauri plugin now manage the library path internally based on the backend executable's location. * Decoupled Logic: The extension (`src/index.ts`) now uses the new, simplified Tauri plugin functions, which enhances modularity and reduces boilerplate code in the extension. * Type Consistency: Added `UnloadResult` interface to `guest-js/index.ts` for consistency. * Updated UI Control: The 'Flash Attention' setting in `settings.json` is changed from a boolean checkbox to a string-based dropdown, offering 'auto', 'on', and 'off' options. * Improved Logic: The extension logic in `src/index.ts` is updated to correctly handle the new string-based `flash_attn` configuration. It now passes the string value (`'auto'`, `'on'`, or `'off'`) directly as a command-line argument to the llama.cpp backend, simplifying the version-checking logic previously required for older llama.cpp versions. The old, complex logic tied to specific backend versions is removed. This refactoring cleans up the extension's codebase and moves environment and path setup concerns into the Tauri plugin where they are most relevant.	2025-10-29 08:00:57 +05:30
Minh141120	15c426aefc	chore: update org name	2025-10-28 17:26:27 +07:00
Dinh Long Nguyen	340042682a	ui ux enhancement	2025-10-09 03:48:51 +07:00
Dinh Long Nguyen	6dd2d2d6c1	Merge branch 'dev' into feat/file-attachment	2025-10-09 02:21:22 +07:00
Akarshan	7762cea10a	feat: Distinguish and preserve embedding model sessions This commit introduces a new field, `is_embedding`, to the `SessionInfo` structure to clearly mark sessions running dedicated embedding models. Key changes: - Adds `is_embedding` to the `SessionInfo` interface in `AIEngine.ts` and the Rust backend. - Updates the `loadLlamaModel` command signatures to pass this new flag. - Modifies the llama.cpp extension's auto-unload logic to explicitly filter out and not unload any currently loaded embedding models when a new text generation model is loaded. This is a critical performance fix to prevent the embedding model (e.g., used for RAG) from being repeatedly reloaded. Also includes minor code style cleanup/reformatting in `jan-provider-web/provider.ts` for improved readability.	2025-10-08 20:03:35 +05:30
Faisal Amir	610b741db2	Merge pull request #6763 from menloresearch/chore/turn-off-zoomHotkeysEnabled chore: turn off zoomHotkeysEnabled	2025-10-08 19:16:34 +07:00
Minh141120	1905f9a9ce	chore: move license to resources	2025-10-08 16:55:24 +07:00
Dinh Long Nguyen	ff93dc3c5c	Merge branch 'dev' into feat/file-attachment	2025-10-08 16:34:45 +07:00
Dinh Long Nguyen	510c4a5188	working attachments	2025-10-08 16:08:40 +07:00
Minh141120	c7d1a3c65d	chore: update license path	2025-10-08 15:48:16 +07:00
Faisal Amir	f224d18d7f	chore: turn off zoomHotkeysEnabled	2025-10-08 12:54:04 +07:00
Louis	26006c143e	fix: build	2025-10-07 19:33:49 +07:00
Nguyen Ngoc Minh	816d60b22a	Merge pull request #6721 from menloresearch/chore/use-custom-nsis-template chore: use custom nsis template # Conflicts: # Makefile # package.json # src-tauri/tauri.windows.conf.json	2025-10-07 18:05:14 +07:00
Louis	fe2c2a8687	Merge branch 'dev' into release/v0.7.0 # Conflicts: # web-app/src/containers/DropdownModelProvider.tsx # web-app/src/containers/ThreadList.tsx # web-app/src/containers/__tests__/DropdownModelProvider.displayName.test.tsx # web-app/src/hooks/__tests__/useModelProvider.test.ts # web-app/src/hooks/useChat.ts # web-app/src/lib/utils.ts	2025-10-06 20:42:05 +07:00
Faisal Amir	17dced03c0	chore: check support blur on FE	2025-10-06 10:55:17 +07:00
Faisal Amir	39b1ba4691	chore: check support blur using hardware api	2025-10-06 10:55:17 +07:00
Faisal Amir	8c7ad408a9	chore: fix desktop capabilities	2025-10-06 10:55:17 +07:00
Faisal Amir	f0c4784b7b	chore: update permission windows	2025-10-06 10:55:17 +07:00
Faisal Amir	aa0c4b0d1b	fix: theme native system and check os support blur	2025-10-06 10:55:17 +07:00
Vanalite	fa61163350	fix: Fix openssl issue on mobile after merging	2025-10-05 14:40:39 +07:00
Vanalite	41a93690a1	Merge remote-tracking branch 'origin/dev' into mobile/persistence_store	2025-10-04 12:28:11 +07:00
Vanalite	b628b3d9ab	fix: Fix tests in threads with proper mock folder properly	2025-10-03 14:17:59 +07:00
Dinh Long Nguyen	5adaf62975	fix: extensions missing on Unix dev (#6724 ) * fix: extensions missing on Unix dev * re add bun uv for mcp	2025-10-03 13:54:37 +07:00
Vanalite	1747e0ad41	Merge remote-tracking branch 'origin/dev' into mobile/persistence_store # Conflicts: # src-tauri/src/core/extensions/commands.rs	2025-10-02 20:59:34 +07:00
Vanalite	08d527366e	feat: organize code for proper import Move platform checker for db access to helper Add test for to threads controller	2025-10-02 20:53:46 +07:00
Vanalite	9720ad368e	feat: use sql for mobile storage	2025-10-02 18:09:33 +07:00
Roushan Kumar Singh	eccaa282e0	refactor: resolve rust analyzer warnings and improve code quality (#6696 ) - Update string formatting to use modern interpolation syntax - Simplify expressions and remove unnecessary intermediate variables - Improve logging statements for better readability - Clean up code across core modules (app, downloads, mcp, server, etc.)	2025-10-02 15:01:06 +07:00
Akarshan Biswas	0f0ba43b7f	feat: Adjust RAM/VRAM calculation for unified memory systems (#6687 ) * feat: Adjust RAM/VRAM calculation for unified memory systems This commit refactors the logic for calculating total RAM and total VRAM in `is_model_supported` and `plan_model_load` commands, specifically targeting systems with unified memory (like modern macOS devices where the GPU list may be empty). The changes are as follows: * Total RAM Calculation: If no GPUs are detected (`sys_info.gpus.is_empty()` is true), total RAM is now set to $0$. This avoids confusing total system memory with dedicated GPU memory when planning model placement. * Total VRAM Calculation: If no GPUs are detected, total VRAM is still calculated as the system's total memory (RAM), as this shared memory acts as VRAM on unified memory architectures. This adjustment improves the accuracy of memory availability checks and model planning on unified memory systems. * fix: total usable memory in case there is no system vram reported * chore: temporarily change to self-hosted runner mac * ci: revert back to github hosted runner macos --------- Co-authored-by: Louis <louis@jan.ai> Co-authored-by: Minh141120 <minh.itptit@gmail.com>	2025-10-01 18:58:14 +07:00
Roushan Kumar Singh	247db95bad	resolve TypeScript and Rust warnings (#6612 ) * chore: fix warnings * fix: add missing scrollContainerRef dependencies to React hooks * fix: typo * fix: remove unsupported fetch option and enable AsyncIterable types - Removed `connectTimeout` from fetch init (not supported in RequestInit) - Updated tsconfig to target ES2018 * chore: refactor rename * fix(hooks): update dependency arrays for useThreadScrolling effects * Add type.d.ts to extend requestinit with connectionTimeout * remove commentd unused import	2025-10-01 16:06:41 +07:00
Vanalite	262a1a9544	Merge remote-tracking branch 'origin/dev' into mobile/dev # Conflicts: # src-tauri/src/core/setup.rs # src-tauri/src/lib.rs # web-app/src/hooks/useChat.ts	2025-10-01 09:52:01 +07:00
Dinh Long Nguyen	9a72a2d5d5	fix tauri test	2025-09-30 22:43:14 +07:00
Nghia Doan	c5a5968bf8	Merge pull request #6643 from menloresearch/fix/model-name-change fix: Apply model name change correctly	2025-09-30 22:41:05 +07:00
Dinh Long Nguyen	d50226b4dd	add missing closing test	2025-09-30 22:36:52 +07:00
Dinh Long Nguyen	817680565e	remove test conflict	2025-09-30 22:33:51 +07:00
Dinh Long Nguyen	84f46dc997	Merge branch 'dev' into feat/sync-release=to-dev	2025-09-30 22:31:20 +07:00
Louis	3c7eb64353	fix: mcp bin path (#6667 ) * fix: mcp bin path * chore: clean up unused structs * fix: bin name * fix: tests	2025-09-30 22:29:15 +07:00
Dinh Long Nguyen	e6bc1182a6	Merge branch 'dev' into feat/sync-release=to-dev	2025-09-30 22:04:27 +07:00
Vanalite	6bd623c020	fix: Fix cargo test	2025-09-30 17:19:58 +07:00
Vanalite	a62852f384	fix: Restore default permission on desktop build Restore desktop capabilities Restore linter correctness Restore different capabilities on each platform	2025-09-30 17:01:09 +07:00
Minh141120	508cbe16f8	refactor: remove redundant resource	2025-09-30 15:49:14 +07:00
Minh141120	0b8f3e01fb	feat: add msi installer for windows	2025-09-30 15:32:29 +07:00
Vanalite	43d20e2a32	fix: revert the modification of vulkan	2025-09-30 14:50:54 +07:00
Akarshan	34b254e2d8	fix: Improve KV cache estimation robustness The KV cache size calculation in estimate_kv_cache_internal now includes a fallback mechanism for models that do not explicitly define key_length and value_length in the GGUF metadata. If these attention keys are missing, the head dimension (and thus key/value length) is calculated using the formula embedding_length / total_heads. This improves robustness and compatibility with GGUF models that don't have the proper keys in metadata. Also adds logging of the full model metadata for easier debugging of the estimation process.	2025-09-30 11:14:18 +05:30
Nguyen Ngoc Minh	d315522c5a	Merge pull request #6618 from github-roushan/show-supported-files Show supported files	2025-09-30 12:19:22 +07:00
Vanalite	549c962248	fix: Fix nvidia and vulkan after upgrade to be compatible with mobile compiling too	2025-09-30 09:44:21 +07:00
Louis	54d17c9c72	fix: migrate new mcp server config (#6651 )	2025-09-30 00:07:57 +07:00
Vanalite	5e57caee43	Merge remote-tracking branch 'origin/dev' into mobile/dev # Conflicts: # extensions/yarn.lock # package.json # src-tauri/plugins/tauri-plugin-hardware/src/vendor/vulkan.rs # src-tauri/src/lib.rs # yarn.lock	2025-09-29 22:22:00 +07:00
Nghia Doan	70ac13e536	Merge pull request #6643 from menloresearch/fix/model-name-change fix: Apply model name change correctly	2025-09-29 22:15:13 +07:00
Louis	5fd249c72d	refactor: deprecate Vulkan external binaries (#6638 ) * refactor: deprecate vulkan binary refactor: clean up vulkan lib chore: cleanup chore: clean up chore: clean up fix: build * fix: skip binaries download env * Update src-tauri/utils/src/system.rs Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update src-tauri/utils/src/system.rs Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-09-29 17:47:59 +07:00

1 2 3 4 5 ...

380 Commits