12 Commits

Author SHA1 Message Date
vansangpfiev
dea29ff8c4
fix: Jan supports cortex with llama-server (#4904)
* fix: update logic to pull llama-cpp engine

* chore: bump cortex version

* fix: chat macos

* chore: bump llama.cpp

* chore: download windows

* chore: download linux

* chore: bump cortex to 1.0.13-rc6

* fix: stop cortex before exit

* chore: bum llama.cpp engine to b5350

---------

Co-authored-by: vansangpfiev <sang@jan.ai>
2025-05-15 17:11:18 +07:00
Louis
75d3dd2de0
fix: qwen3 - weird token output - reasoning content should not be in completion request (#4983)
* fix: qwen3 - weird token output  - reasoning content should not be in completion request

* chore: bump engine version to llama.cpp b5219
2025-05-13 21:08:16 +07:00
Louis
9d64e9bef4
chore: merge release/v0.5.16 into dev (#4833)
* chore: bump cortex 1.0.11-rc10

* chore: bump to latest cortex release

* feat: Cortex API Authorization

* chore: correct CI CD repo name

* chore: correct new menloresearch repo name

* feat: rotate api token for each run (#4820)

* feat: rotate api token for each run

* chore: correct github repo url

* chore: correct github api url

* chore: should not filter out models first launch

* chore: bump cortex release

* chore: should get hardware information on launch (#4821)

* chore: should have an option to not revalidate hardware information

* chore: cortex.cpp gpu activation could cause a race condition (#4825)

* fix: jan beta logo displayed in jan release (#4828)

---------

Co-authored-by: David <davidpt.janai@gmail.com>
Co-authored-by: Nguyen Ngoc Minh <91668012+Minh141120@users.noreply.github.com>
2025-03-24 11:53:30 +07:00
Louis
c0f4517c34
chore: bump llama.cpp to support gemma3 2025-03-12 22:11:43 +08:00
Louis
455d320d35
fix: engine version update - cortex version bump - update tests (#4787) 2025-03-10 13:26:48 +07:00
Louis
b990b259bc
refactor: clean up legacy predefined models (#4730)
* refactor: clean up legacy predefined models

* chore: fix onboarding filtered models
2025-02-25 12:40:37 +07:00
Louis
badc632ee4
refactor: different Jan builds should have different Cortex server port (#4699)
* refactor: different Jan instances should have different Cortex server port configurations

* chore: update workflow to use env input

* chore: update env for cortex port setting
2025-02-20 15:53:16 +07:00
Louis
c32c9e1dcb
chore: add DeepSeek R1 Distill models to Hub (#4586) 2025-02-05 23:10:40 +07:00
Louis
144b836935
fix: app does not reload model when prompt template is updated (#4537)
* fix: app does not reload model when prompt template is updated

* chore: lint fix
2025-01-29 21:50:17 +07:00
Louis
a4649fa9cc
chore: bump engine version to 0.1.46 2025-01-16 15:26:15 +07:00
Louis
a3077ea4a5
chore: bump engine version to 0.1.43 2025-01-13 09:15:45 +07:00
Louis
b8bd846744
chore: refactor extensions build script for optimization 2024-12-30 22:10:48 +07:00