Go to file

Akarshan Biswas a1af70f7a9

feat: Enhance Llama.cpp backend management with persistence (#5886 )

* feat: Enhance Llama.cpp backend management with persistence

This commit introduces significant improvements to how the Llama.cpp extension manages and updates its backend installations, focusing on user preference persistence and smarter auto-updates.

Key changes include:

* **Persistent Backend Type Preference:** The extension now stores the user's preferred backend type (e.g., `cuda`, `cpu`, `metal`) in `localStorage`. This ensures that even after updates or restarts, the system attempts to use the user's previously selected backend type, if available.
* **Intelligent Auto-Update:** The auto-update mechanism has been refined to prioritize updating to the **latest version of the *currently selected backend type*** rather than always defaulting to the "best available" backend (which might change). This respects user choice while keeping the chosen backend type up-to-date.
* **Improved Initial Installation/Configuration:** For fresh installations or cases where the `version_backend` setting is invalid, the system now intelligently determines and installs the best available backend, then persists its type.
* **Refined Old Backend Cleanup:** The `removeOldBackends` function has been renamed to `removeOldBackend` and modified to specifically clean up *older versions of the currently selected backend type*, preventing the accumulation of unnecessary files while preserving other backend types the user might switch to.
* **Robust Local Storage Handling:** New private methods (`getStoredBackendType`, `setStoredBackendType`, `clearStoredBackendType`) are introduced to safely interact with `localStorage`, including error handling for potential `localStorage` access issues.
* **Version Filtering Utility:** A new utility `findLatestVersionForBackend` helps in identifying the latest available version for a specific backend type from a list of supported backends.

These changes provide a more stable, user-friendly, and maintainable backend management experience for the Llama.cpp extension.

Fixes: #5883

* fix: cortex models migration should be done once

* feat: Optimize Llama.cpp backend preference storage and UI updates

This commit refines the Llama.cpp extension's backend management by:

* **Optimizing `localStorage` Writes:** The system now only writes the backend type preference to `localStorage` if the new value is different from the currently stored one. This reduces unnecessary `localStorage` operations.
* **Ensuring UI Consistency on Initial Setup:** When a fresh installation or an invalid backend configuration is detected, the UI settings are now explicitly updated to reflect the newly determined `effectiveBackendString`, ensuring the displayed setting matches the active configuration.

These changes improve performance by reducing redundant storage operations and enhance user experience by maintaining UI synchronization with the backend state.

* Revert "fix: provider settings should be refreshed on page load (#5887)"

This reverts commit ce6af62c7df4a7e7ea8c0896f307309d6bf38771.

* fix: add loader version backend llamacpp

* fix: wrong key name

* fix: model setting issues

* fix: virtual dom hub

* chore: cleanup

* chore: hide device ofload setting

---------

Co-authored-by: Louis <louis@jan.ai>
Co-authored-by: Faisal Amir <urmauur@gmail.com>

2025-07-24 18:33:35 +07:00

.devcontainer

refactor: pin linuxdeploy in make/yarn build process instead of github workflow

2025-07-10 04:50:12 +00:00

.github

chore: update cua mac runner (#5888 )

2025-07-24 16:25:02 +07:00

.husky

chore: enhance onboarding screen's models (#4723 )

2025-02-25 09:36:55 +07:00

.vscode

test: init e2e test with selenium and CI work (#5591 )

2025-06-29 17:12:16 +07:00

autoqa

fix: app should refresh local provider models list on launch (#5868 )

2025-07-23 08:36:09 +07:00

core

fix: llama.cpp backend shows blank list sometime (#5876 )

2025-07-23 20:04:38 +07:00

docs

Update mcp.mdx (#5771 )

2025-07-20 15:20:53 +07:00

extensions

feat: Enhance Llama.cpp backend management with persistence (#5886 )

2025-07-24 18:33:35 +07:00

pre-install

chore: server download progress + S3 (#1925 )

2024-02-07 17:54:35 +07:00

scripts

test: migrate jest to vitest

2025-07-10 21:14:21 +07:00

specs

chore: janhq to menloresearch

2025-03-18 13:06:17 +07:00

src-tauri

feat: add support for querying available backend devices (#5877 )

2025-07-23 19:20:12 +05:30

web-app

feat: Enhance Llama.cpp backend management with persistence (#5886 )

2025-07-24 18:33:35 +07:00

.dockerignore

255: Cloud native

2023-10-30 23:20:10 +07:00

.gitignore

feat: add autoqa (#5779 )

2025-07-18 15:22:31 +07:00

.prettierignore

fix: model path backward compatible (#2018 )

2024-02-14 23:04:46 +07:00

.prettierrc

fix: model path backward compatible (#2018 )

2024-02-14 23:04:46 +07:00

.yarnrc.yml

chore: upgrade to turbo v2 and reduce ci quality gate runtime (#4324 )

2024-12-29 17:46:15 +07:00

ai.menlo.jan.desktop

chore: add flathub submission (#4391 )

2025-01-04 22:47:05 +07:00

ai.menlo.jan.metainfo.xml

chore: janhq to menloresearch

2025-03-18 13:06:17 +07:00

CONTRIBUTING.md

chore: janhq to menloresearch

2025-03-18 13:06:17 +07:00

demo.gif

docs: Update README.md (#1248 )

2023-12-29 11:30:16 +07:00

JanBanner.png

Add files via upload

2024-10-28 23:09:25 +07:00

LICENSE

chore: Jan's code is now under the Apache license (#5042 )

2025-05-20 22:18:59 +07:00

Makefile

Merge branch 'dev' into release/v0.6.6

2025-07-22 13:18:00 +07:00

mise.toml

chore: sync make build with dev (#5847 )

2025-07-22 11:12:14 +07:00

package.json

feat: Enhance Llama.cpp backend management with persistence (#5886 )

2025-07-24 18:33:35 +07:00

README.md

Improve dev experience by using mise & sccache (#5265 )

2025-06-21 12:49:32 +07:00

testRunner.js

test: add web helpers, services, utils tests (#3669 )

2024-09-20 14:24:51 +07:00

vitest.config.ts

test: add missing unit tests

2025-07-15 22:29:28 +07:00

yarn.lock

chore(deps): bump @radix-ui/react-hover-card from 1.1.11 to 1.1.14 (#5603 )

2025-07-20 15:20:18 +07:00

README.md

Jan - Local AI Assistant

GitHub commit activity Github Last Commit Github Contributors GitHub closed issues Discord

Getting Started - Docs - Changelog - Bug reports - Discord

Jan is a ChatGPT-alternative that runs 100% offline on your device. Our goal is to make it easy for a layperson to download and run LLMs and use AI with full control and privacy.

⚠️ Jan is in active development.

Installation

Because clicking a button is still the easiest way to get started:

Platform	Stable	Beta	Nightly
Windows	jan.exe	jan.exe	jan.exe
macOS	jan.dmg	jan.dmg	jan.dmg
Linux (deb)	jan.deb	jan.deb	jan.deb
Linux (AppImage)	jan.AppImage	jan.AppImage	jan.AppImage

Download from jan.ai or GitHub Releases.

Demo

Features

Local AI Models: Download and run LLMs (Llama, Gemma, Qwen, etc.) from HuggingFace
Cloud Integration: Connect to OpenAI, Anthropic, Mistral, Groq, and others
Custom Assistants: Create specialized AI assistants for your tasks
OpenAI-Compatible API: Local server at localhost:1337 for other applications
Model Context Protocol: MCP integration for enhanced capabilities
Privacy First: Everything runs locally when you want it to

Build from Source

For those who enjoy the scenic route:

Prerequisites

Node.js ≥ 20.0.0
Yarn ≥ 1.22.0
Make ≥ 3.81
Rust (for Tauri)

Run with Make

git clone https://github.com/menloresearch/jan
cd jan
make dev

This handles everything: installs dependencies, builds core components, and launches the app.

Available make targets:

make dev - Full development setup and launch
make build - Production build
make test - Run tests and linting
make clean - Delete everything and start fresh

Run with Mise (easier)

You can also run with mise, which is a bit easier as it ensures Node.js, Rust, and other dependency versions are automatically managed:

git clone https://github.com/menloresearch/jan
cd jan

# Install mise (if not already installed)
curl https://mise.run | sh

# Install tools and start development
mise install    # installs Node.js, Rust, and other tools
mise dev        # runs the full development setup

Available mise commands:

mise dev - Full development setup and launch
mise build - Production build
mise test - Run tests and linting
mise clean - Delete everything and start fresh
mise tasks - List all available tasks

Manual Commands

yarn install
yarn build:core
yarn build:extensions
yarn dev

System Requirements

Minimum specs for a decent experience:

macOS: 13.6+ (8GB RAM for 3B models, 16GB for 7B, 32GB for 13B)
Windows: 10+ with GPU support for NVIDIA/AMD/Intel Arc
Linux: Most distributions work, GPU acceleration available

For detailed compatibility, check our installation guides.

Troubleshooting

When things go sideways (they will):

Check our troubleshooting docs
Copy your error logs and system specs
Ask for help in our Discord #🆘|jan-help channel

We keep logs for 24 hours, so don't procrastinate on reporting issues.

Contributing

Contributions welcome. See CONTRIBUTING.md for the full spiel.

Contact

Bugs: GitHub Issues
Business: hello@jan.ai
Jobs: hr@jan.ai
General Discussion: Discord

Trust & Safety

Friendly reminder: We're not trying to scam you.

We won't ask for personal information
Jan is completely free (no premium version exists)
We don't have a cryptocurrency or ICO
We're bootstrapped and not seeking your investment (yet)

License

Apache 2.0 - Because sharing is caring.

Acknowledgements

Built on the shoulders of giants:

Languages

TypeScript 54.9%

JavaScript 34.1%

Rust 8.6%

Python 1.5%

Shell 0.4%

Other 0.5%

README.md

Jan - Local AI Assistant

Installation

Demo

Features

Build from Source

Prerequisites

Run with Make

Run with Mise (easier)

Manual Commands

System Requirements

Troubleshooting

Contributing

Links

Contact

Trust & Safety

License

Acknowledgements