Nicholai/jan - jan - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Faisal Amir	6bb66b2b93	fix: handle checking compatible gated model	2025-08-28 12:57:11 +07:00
Faisal Amir	7801f9c330	fix: update copy mmproj setting desc	2025-08-22 15:27:07 +07:00
Akarshan Biswas	510c70bdf7	feat: Add model compatibility check and memory estimation (#6243 ) * feat: Add model compatibility check and memory estimation This commit introduces a new feature to check if a given model is supported based on available device memory. The change includes: - A new `estimateKVCache` method that calculates the required memory for the model's KV cache. It uses GGUF metadata such as `block_count`, `head_count`, `key_length`, and `value_length` to perform the calculation. - An `isModelSupported` method that combines the model file size and the estimated KV cache size to determine the total memory required. It then checks if any available device has sufficient free memory to load the model. - An updated error message for the `version_backend` check to be more user-friendly, suggesting a stable internet connection as a potential solution for backend setup failures. This functionality helps prevent the application from attempting to load models that would exceed the device's memory capacity, leading to more stable and predictable behavior. fixes: #5505 * Update extensions/llamacpp-extension/src/index.ts Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * Update extensions/llamacpp-extension/src/index.ts Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * Extend this to available system RAM if GGML device is not available * fix: Improve model metadata and memory checks This commit refactors the logic for checking if a model is supported by a system's available memory. Key changes: - Remote model support: The `read_gguf_metadata` function can now fetch metadata from a remote URL by reading the file in chunks. - Improved KV cache size calculation: The KV cache size is now estimated more accurately by using `attention.key_length` and `attention.value_length` from the GGUF metadata, with a fallback to `embedding_length`. - Granular memory check statuses: The `isModelSupported` function now returns a more specific status (`'RED'`, `'YELLOW'`, `'GREEN'`) to indicate whether the model weights or the KV cache are too large for the available memory. - Consolidated logic: The logic for checking local and remote models has been consolidated into a single `isModelSupported` function, improving code clarity and maintainability. These changes provide more robust and informative model compatibility checks, especially for models hosted on remote servers. * Update extensions/llamacpp-extension/src/index.ts Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * Make ctx_size optional and use sum free memory across ggml devices * feat: hub and dropdown model selection handle model compatibility * feat: update bage model info color * chore: enable detail page to get compatibility model * chore: update copy * chore: update shrink indicator UI --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> Co-authored-by: Faisal Amir <urmauur@gmail.com>	2025-08-21 16:13:50 +05:30
Dinh Long Nguyen	32a2ca95b6	feat: gguf file size + hash validation (#5266 ) (#6259 ) * feat: gguf file size + hash validation * fix tests fe * update cargo tests * handle asyn download for both models and mmproj * move progress tracker to models * handle file download cancelled * add cancellation mid hash run	2025-08-21 16:17:58 +07:00
Faisal Amir	07b1101736	chore: enable attachment icon for remote provider	2025-08-19 22:46:45 +07:00
Faisal Amir	80dc491f9d	chore: conditianal attachment and drag file to chat input	2025-08-19 22:46:45 +07:00
Faisal Amir	fdc8e07f86	chore: update model setting include offload-mmproj	2025-08-19 20:00:08 +07:00
Faisal Amir	00674ec0d5	chore: update test case model service	2025-08-19 19:51:33 +07:00
Faisal Amir	050d38c0ab	fix: convert search hgf repo include mmproj models	2025-08-19 19:51:32 +07:00
Louis	d7cf258a40	fix: tool indicator in hub	2025-08-19 10:40:11 +07:00
Louis	bfe671d7b4	feat: #5917 - model tool use capability should be auto detected	2025-08-19 09:51:36 +07:00
Faisal Amir	a66d83c598	Merge pull request #6172 from menloresearch/fix/model-id-special-char fix: handle modelId special char	2025-08-14 12:33:58 +07:00
Louis	526e532e2d	fix: normalize model id from source preparation	2025-08-14 10:50:50 +07:00
Faisal Amir	ace8214d4d	chore: make utils sanitize modelId	2025-08-14 09:42:47 +07:00
Faisal Amir	b338849952	fix: handle modelId special char	2025-08-14 09:18:03 +07:00
Faisal Amir	5266583e5b	enhancement: Add support for mmproj models (#6150 )	2025-08-13 10:05:25 +07:00
Louis	c355649759	fix: HF token is not used while searching repositories	2025-08-11 15:29:50 +07:00
Louis	7f0c605651	fix: Jan hub repo detail and deep link	2025-08-05 13:44:40 +07:00
Louis	6599d91660	fix: bring back HF repo ID search in Hub (#5880 ) * fix: bring back HF search input * test: fix useModelSources tests for updated addSource signature	2025-07-24 09:46:13 +07:00
Faisal Amir	1d443e1f7d	fix: support load model configurations (#5843 ) * fix: support load model configurations * chore: remove log * chore: sampling params add from send completion * chore: remove comment * chore: remove comment on predefined file * chore: update test model service	2025-07-22 19:52:12 +07:00
Louis	c550f6cf0d	Merge pull request #5809 from menloresearch/refactor/simplify-proxy-settings refactor: simplify proxy settings by removing unused SSL verification options	2025-07-19 16:34:37 +07:00
Louis	8ca507c01c	feat: proxy support for the new downloader (#5795 ) * feat: proxy support for the new downloader * test: remove outdated test * ci: clean up	2025-07-17 23:10:21 +07:00
Akarshan Biswas	dee98f41d1	✨ Feat: Improved llamacpp Server Stability and Diagnostics (#5761 ) * feat: Improve llamacpp server error reporting and model load stability This commit introduces significant improvements to how the llamacpp server process is managed and how its errors are reported. Key changes: - Enhanced Error Reporting: The llamacpp server's stdout and stderr are now piped and captured. If the llamacpp process exits prematurely or fails to start, its stderr output is captured and returned as a `LlamacppError`. This provides much more specific and actionable diagnostic information for users and developers. - Increased Model Load Timeout: The `waitForModelLoad` timeout has been increased from 30 seconds to 240 seconds (4 minutes). This addresses issues where larger models or slower systems would prematurely time out during the model loading phase. - API Secret Update: The internal API secret for the llamacpp extension has been updated from 'Jan' to 'JustAskNow'. - Version Bump: The application version in `tauri.conf.json` has been incremented to `0.6.901`. * fix: should not spam load requests * test: add test to cover the fix * refactor: clean up * test: add more test case --------- Co-authored-by: Louis <louis@jan.ai>	2025-07-14 11:55:44 +05:30
Faisal Amir	a0be23b500	enhancement: show readme on detail each model (#5705 ) * 🧹cleanup: linter and log * Update web-app/src/routes/hub/$modelId.tsx Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>	2025-07-07 09:54:16 +07:00
Louis	2bdbce2e40	refactor: clean up unused apis	2025-07-02 12:29:02 +07:00
Louis	c6ac9f1d2a	feat: sync hub with model catalog	2025-07-02 12:29:01 +07:00
Louis	5edc773535	fix: wait for model start	2025-07-02 12:28:25 +07:00
Louis	8bd4a3389f	refactor: frontend uses new engine extension # Conflicts: # extensions/model-extension/resources/default.json # web-app/src/containers/dialogs/DeleteProvider.tsx # web-app/src/routes/hub.tsx	2025-07-02 12:28:24 +07:00
Louis	035cc0f79c	Sync Release/v0.6.0 into dev (#5293 ) * chore: enable shortcut zoom (#5261) * chore: enable shortcut zoom * chore: update shortcut setting * fix: thinking block (#5263) * Merge pull request #5262 from menloresearch/chore/sync-new-hub-data chore: sync new hub data * ✨enhancement: model run improvement (#5268) * fix: mcp tool error handling * fix: error message * fix: trigger download from recommend model * fix: can't scroll hub * fix: show progress * ✨enhancement: prompt users to increase context size * ✨enhancement: rearrange action buttons for a better UX * 🔧chore: clean up logics --------- Co-authored-by: Faisal Amir <urmauur@gmail.com> * fix: glitch download from onboarding (#5269) * ✨enhancement: Model sources should not be hard coded from frontend (#5270) * 🐛fix: default onboarding model should use recommended quantizations (#5273) * 🐛fix: default onboarding model should use recommended quantizations * ✨enhancement: show context shift option in provider settings * 🔧chore: wording * 🔧 config: add to gitignore * 🐛fix: Jan-nano repo name changed (#5274) * 🚧 wip: disable showSpeedToken in ChatInput * 🐛 fix: commented out the wrong import * fix: masking value MCP env field (#5276) * ✨ feat: add token speed to each message that persist * ♻️ refactor: to follow prettier convention * 🐛 fix: exclude deleted field * 🧹 clean: all the missed console.log * ✨enhancement: out of context troubleshooting (#5275) * ✨enhancement: out of context troubleshooting * 🔧refactor: clean up * ✨enhancement: add setting chat width container (#5289) * ✨enhancement: add setting conversation width * ✨enahncement: cleanup log and change improve accesibility * ✨enahcement: move const beta version * 🐛fix: optional additional_information gpu (#5291) * 🐛fix: showing release notes for beta and prod (#5292) * 🐛fix: showing release notes for beta and prod * ♻️refactor: make an utils env * ♻️refactor: hide MCP for production * ♻️refactor: simplify the boolean expression fetch release note --------- Co-authored-by: Faisal Amir <urmauur@gmail.com> Co-authored-by: LazyYuuki <huy2840@gmail.com> Co-authored-by: Bui Quang Huy <34532913+LazyYuuki@users.noreply.github.com>	2025-06-16 17:27:42 +07:00
Faisal Amir	7b59aa32f9	chore: onboarding local model (#5234 ) * chore: simple onboarding local model * chore: update new model and improve flow e2e onboarding local model * fix: default tool support models --------- Co-authored-by: Louis <louis@jan.ai>	2025-06-11 18:38:07 +07:00
Louis	7dc51c5e0f	fix: relocate jan data folder (#5179 ) * fix: relocate jan data folder failed * fix: avoid infinite recursion * chore: kill background processes to unblock factory reset * chore: stop models before reset factory * chore: clean up * chore: clean up * fix: show error * chore: get active models should not have retry	2025-06-03 21:23:42 +07:00
Faisal Amir	1b3f16b3e1	feat: start and stop model (#5133 ) * feat: start and stop model * refactor: clean up start models --------- Co-authored-by: Louis <louis@jan.ai>	2025-05-29 13:23:12 +07:00
Louis	4672754b81	chore: persist assistants settings (#5127 ) * chore: assistant settings * chore: fix model sources issue after deleted models * chore: assistants as files * chore: clean up	2025-05-28 19:33:13 +07:00
Louis	26154941ca	fix: chore UI issues (#5116 ) * fix: app log outputs with debug level * fix: reasoning block still show loading indicator when stopped * chore: fix mistral AI base url	2025-05-27 19:38:21 +07:00
Louis	cdd13594a3	fix: model import name issues (#5093 )	2025-05-24 14:48:38 +07:00
Louis	942f2f51b7	chore: send chat completion with messages history (#5070 ) * chore: send chat completion with messages history * chore: handle abort controllers * chore: change max attempts setting * chore: handle stop running models in system monitor screen * Update web-app/src/services/models.ts Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * chore: format time * chore: handle stop model load action --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>	2025-05-22 20:13:50 +07:00
Louis	570bb8290f	chore: add model information in System Monitoring (#5062 ) * chore: add model information in System Monitoring * chore: handle empty models case * chore: fix type	2025-05-22 16:07:08 +07:00
Faisal Amir	88ea8b0638	chore: show model name as filename	2025-05-20 22:25:40 +07:00
Faisal Amir	c54e8c80b6	chore: model import from llama.cpp provider	2025-05-20 15:48:45 +07:00
Louis	2bc8fccaf0	chore: allow users to enable/disable MCP servers (#5015 )	2025-05-19 13:15:37 +07:00
Louis	2345ff172d	chore: update model handlers on the new frontend (#5011 ) * chore: provide model handlers to new frontend * chore: add API server function to the new front end	2025-05-19 10:39:43 +07:00
Louis	c1091ce812	feat: new frontend with model download function (#5008 )	2025-05-18 20:00:17 +07:00
Faisal Amir	852ea84cd8	epic: Jan with new UI/UX (#4964 ) * chore: initial new FE setup * chore: update namespace text-left-panel foreground variable * chore: enable dynamic mainview color * chore: remove greetings new chat * chore: fix chat input style * chore: simplify hook useAppearance * chore: enable internationalization * chore: prepare vn locale * chore: keyboardshortcut layout * chore: update keyboard shortcut exclude pathname * chore: update state active setting route * chore: fix update theme by system * chore: handle dynamic primary color * chore: fix left panel navigation active state and styled item privacy analytic * chore: reorder general setting being a first * chore: add function reset appearance * chore: update scrollbar * chore: update delete thread with dialog confirmation * chore: update state dialog inside dropdown menu * chore: wip thread detail or chat page * chore: wip model dropdown * chore: prepare model dropdown select * chore: update model providers setting * chore: show provider on model dropdown based isActive toogle * chore: update layout model provider * chore: update state active on storage * chore: update gap of item dropdown model * chore: update select model base on id * chore: update edit model capabilities * chore: add dialog to add model * chore: update sheet for model setting * chore: add sheet setting each model * chore: make dynamic syntax highlight * chore: fix menu setting appearance theme * chore: markdown render support emoji * chore: markdown support latex * chore: change codeblock default theme * chore: update ui codeblock * chore: custom render link taget new window * chore: fix copy button codeblock * chore: update accent and desctructive color * chore: setup user chat message * chore: prepare some page settings * chore: simple list extension and prepare mcp, local api, and hardware * chore: mcp-serve * chore: MCP server UI * chore: update local api server config * chore: adjust chat input * chore: update local api server log * chore: prepare hub page * chore: remove help page * chore: update mock * chore: prepare http proxy setting UI * chore: adjust local api server and title every action * fix: chore FE package (#4962) * fix: update command which referred to non-existent web app * fix: added commented out macos platform for now * fix: remove the platform name as macos * fix: remove unnecessary line for platform name in HeaderPage component * fix: update dev script to specify port 3000 for Vite * feat: model providers and chat completion * enhancement: threads performance * fix: thread content update * chore: clean up threads * fix: performance issue with streaming and state loop * fix: streaming * fix: react markdow * feat: extension manager * chore: add nodePolyfills include path * chore: improve performance avoid unhandle rejection * chore: update pre margin bottom * chore: swith thread should be deafult scroll to bottom * chore: wip scroll to bottom * chore: add model loader * chore: add platform utils * feat: threads functionality * chore: setup toaster * chore: persist threads deletion * fix: create thread with new message * chore: create new thread should change route path * chore: navigate after delet dialog thread * chore: thread favorites and orders * chore: dismiss deleting modal on delete * chore: remove undefined properties * chore: remove deprecated run step * chore: fix delete thread * chore: create empty thread content on started streaming * chore: correct messages store key * chore: stuck at generating state * chore: preapre chat toolbar * chore: introduce in-memory app state * chore: update extensions migration logic * chore: remove redundant extensions migration gate * chore: message toolbar user and assistant * chore: add logo gemini * feat: remote providers with model capabilities * chore: maintain provider settings * chore: move speed token into chat input * chore: temp harcoded model loader * chore: make chat text selectable and truncate model list * chore: update shortcut UI * Feat/implement threads (#4977) * chore: add fuse.js library for enhanced search functionality * feat: implement thread filtering with Fuse.js for improved search capabilities * fix: update the fuseOptions * feat: add search functionality to LeftPanel and refactor thread retrieval logic * refactor: optimize thread filtering and improve search functionality in LeftPanel * fix: more edits * refactor: remove duplicate import of useAppState in StreamingContent component * chore: update navigate after delete all thread * chore: pass prop speedToken from new chat input * chore: persist provider general settings * chore: styling search left panel * chore: cleanup margin * chore: update size icon * chore: improve chat input * chore: imprve list markdown * chore: animate border * feat: local model provider work * chore: persist manually added model * chore: prepare download management ui and show version on general setting * chore: improve pre tag * chore: remove buton install extension and improve light theme download * chore: add missing hardware information handler * chore: cleanup small ui * chore: update default provider settings * fix: missing fs commands * chore: correct provider models * chore: prepare delete model * chore: handle thinking block * chore: fix conditional message toolbar * chore: pophover download select none * enhancement: add prune mode * chore: model settings * chore: bump engine version tauri * chore: update style thinking * chore: add indicator and toogle mcp server * chore: wip hub * chore: update model settings * chore: mvp hub * chore: add function rename title * chore: update function delete message * chore: update rename title * chore: update model settings * chore: persist MCP configs * refactor: clean up utils * chore: add tools to completion request * chore: clean up * chore: ignore assets --------- Co-authored-by: Ivan Leo <ivanleomk@gmail.com> Co-authored-by: Louis <louis@jan.ai>	2025-05-15 19:38:59 +07:00

43 Commits