Go to file

Akarshan Biswas 1f1605bdf9

feat: Add support for overriding tensor buffer type (#6062 )

* feat: Add support for overriding tensor buffer type

This commit introduces a new configuration option, `override_tensor_buffer_t`, which allows users to specify a regex for matching tensor names to override their buffer type. This is an advanced setting primarily useful for optimizing the performance of large models, particularly Mixture of Experts (MoE) models.

By overriding the tensor buffer type, users can keep critical parts of the model, like the attention layers, on the GPU while offloading other parts, such as the expert feed-forward networks, to the CPU. This can lead to significant speed improvements for massive models.

Additionally, this change refines the error message to be more specific when a model fails to load. The previous message "Failed to load llama-server" has been updated to "Failed to load model" to be more accurate.

* chore: update FE to suppoer override-tensor

---------

Co-authored-by: Faisal Amir <urmauur@gmail.com>

2025-08-07 10:31:34 +05:30

.devcontainer

refactor: pin linuxdeploy in make/yarn build process instead of github workflow

2025-07-10 04:50:12 +00:00

.github

ci: deprecate jan docs new release workflow in favor of jan-docs

2025-08-07 00:04:21 +07:00

.husky

chore: enhance onboarding screen's models (#4723 )

2025-02-25 09:36:55 +07:00

autoqa

fix: app should refresh local provider models list on launch (#5868 )

2025-07-23 08:36:09 +07:00

core

✨feat: recommended label llamacpp setting (#6052 )

2025-08-06 12:24:21 +10:00

docs

Add gpt-oss local installation blog post (#6075 )

2025-08-07 09:48:05 +07:00

extensions

feat: Add support for overriding tensor buffer type (#6062 )

2025-08-07 10:31:34 +05:30

pre-install

chore: server download progress + S3 (#1925 )

2024-02-07 17:54:35 +07:00

scripts

test: migrate jest to vitest

2025-07-10 21:14:21 +07:00

specs

chore: janhq to menloresearch

2025-03-18 13:06:17 +07:00

src-tauri

chore: add deep_link register_all

2025-08-06 12:24:21 +10:00

web-app

feat: Add support for overriding tensor buffer type (#6062 )

2025-08-07 10:31:34 +05:30

website

fixed components in troubleshooting tab

2025-08-06 12:49:01 +10:00

.dockerignore

255: Cloud native

2023-10-30 23:20:10 +07:00

.gitignore

feat(docs): Migrate to dual Nextra/Astro deployment & recreate products section

2025-07-31 18:52:00 +10:00

.prettierignore

fix: model path backward compatible (#2018 )

2024-02-14 23:04:46 +07:00

.prettierrc

fix: model path backward compatible (#2018 )

2024-02-14 23:04:46 +07:00

.yarnrc.yml

chore: upgrade to turbo v2 and reduce ci quality gate runtime (#4324 )

2024-12-29 17:46:15 +07:00

ai.menlo.jan.desktop

chore: add flathub submission (#4391 )

2025-01-04 22:47:05 +07:00

ai.menlo.jan.metainfo.xml

chore: janhq to menloresearch

2025-03-18 13:06:17 +07:00

CONTRIBUTING.md

chore: janhq to menloresearch

2025-03-18 13:06:17 +07:00

demo.gif

docs: Update README.md (#1248 )

2023-12-29 11:30:16 +07:00

JanBanner.png

Add files via upload

2024-10-28 23:09:25 +07:00

LICENSE

chore: sync dev to release/v0.5.18 (#5106 )

2025-05-26 16:02:30 +07:00

Makefile

Merge branch 'dev' into release/v0.6.6

2025-07-22 13:18:00 +07:00

mise.toml

chore: sync make build with dev (#5847 )

2025-07-22 11:12:14 +07:00

package.json

fix: run dev should reinstall extensions

2025-08-06 12:24:21 +10:00

README.md

Update README.md

2025-08-06 00:24:10 +10:00

testRunner.js

test: add web helpers, services, utils tests (#3669 )

2024-09-20 14:24:51 +07:00

vitest.config.ts

test: add missing unit tests

2025-07-15 22:29:28 +07:00

yarn.lock

chore(deps): bump @radix-ui/react-hover-card from 1.1.11 to 1.1.14 (#5603 )

2025-07-20 15:20:18 +07:00

README.md

Jan - Local AI Assistant

GitHub commit activity Github Last Commit Github Contributors GitHub closed issues Discord

Getting Started - Docs - Changelog - Bug reports - Discord

Jan is an AI assistant that can run 100% offline on your device. Download and run LLMs with full control and privacy.

Installation

The easiest way to get started is by downloading one of the following versions for your respective operating system:

Platform	Stable	Nightly
Windows	jan.exe	jan.exe
macOS	jan.dmg	jan.dmg
Linux (deb)	jan.deb	jan.deb
Linux (AppImage)	jan.AppImage	jan.AppImage

Download from jan.ai or GitHub Releases.

Features

Local AI Models: Download and run LLMs (Llama, Gemma, Qwen, etc.) from HuggingFace
Cloud Integration: Connect to OpenAI, Anthropic, Mistral, Groq, and others
Custom Assistants: Create specialized AI assistants for your tasks
OpenAI-Compatible API: Local server at localhost:1337 for other applications
Model Context Protocol: MCP integration for enhanced capabilities
Privacy First: Everything runs locally when you want it to

Build from Source

For those who enjoy the scenic route:

Prerequisites

Node.js ≥ 20.0.0
Yarn ≥ 1.22.0
Make ≥ 3.81
Rust (for Tauri)

Run with Make

git clone https://github.com/menloresearch/jan
cd jan
make dev

This handles everything: installs dependencies, builds core components, and launches the app.

Available make targets:

make dev - Full development setup and launch
make build - Production build
make test - Run tests and linting
make clean - Delete everything and start fresh

Run with Mise (easier)

You can also run with mise, which is a bit easier as it ensures Node.js, Rust, and other dependency versions are automatically managed:

git clone https://github.com/menloresearch/jan
cd jan

# Install mise (if not already installed)
curl https://mise.run | sh

# Install tools and start development
mise install    # installs Node.js, Rust, and other tools
mise dev        # runs the full development setup

Available mise commands:

mise dev - Full development setup and launch
mise build - Production build
mise test - Run tests and linting
mise clean - Delete everything and start fresh
mise tasks - List all available tasks

Manual Commands

yarn install
yarn build:core
yarn build:extensions
yarn dev

System Requirements

Minimum specs for a decent experience:

macOS: 13.6+ (8GB RAM for 3B models, 16GB for 7B, 32GB for 13B)
Windows: 10+ with GPU support for NVIDIA/AMD/Intel Arc
Linux: Most distributions work, GPU acceleration available

For detailed compatibility, check our installation guides.

Troubleshooting

If things go sideways:

Check our troubleshooting docs
Copy your error logs and system specs
Ask for help in our Discord #🆘|jan-help channel

Contributing

Contributions welcome. See CONTRIBUTING.md for the full spiel.

Contact

Bugs: GitHub Issues
Business: hello@jan.ai
Jobs: hr@jan.ai
General Discussion: Discord

License

Apache 2.0 - Because sharing is caring.

Acknowledgements

Built on the shoulders of giants:

Languages

TypeScript 54.9%

JavaScript 34.1%

Rust 8.6%

Python 1.5%

Shell 0.4%

Other 0.5%

README.md

Jan - Local AI Assistant

Installation

Features

Build from Source

Prerequisites

Run with Make

Run with Mise (easier)

Manual Commands

System Requirements

Troubleshooting

Contributing

Links

Contact

License

Acknowledgements