Louis
c5aac9270c
Update extensions/inference-cortex-extension/src/index.ts
...
Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>
2025-06-26 19:36:07 +07:00
Louis
d75db73a43
fix: handle context shift setting from old app versions
2025-06-26 19:32:34 +07:00
Louis
86079074d2
🐛 fix: update default extension settings ( #5315 )
...
* fix: update default extension settings
* chore: hide language setting on Prod
2025-06-17 17:37:31 +07:00
Louis
2899d58ad7
🐛 fix: llama.cpp default NGL setting does not offload all layers to GPU ( #5310 )
...
* 🐛 fix: llama.cpp default NGL setting does not offload all layers to GPU
* chore: cover more cases
* chore: clean up
* fix: should not show GPU section on Mac
2025-06-17 15:30:07 +07:00
Louis
2634659366
🐛 fix: default onboarding model should use recommended quantizations ( #5273 )
...
* 🐛 fix: default onboarding model should use recommended quantizations
* ✨ enhancement: show context shift option in provider settings
* 🔧 chore: wording
2025-06-15 02:34:14 +07:00
Louis
1e17cc6ec7
✨ enhancement: model run improvement ( #5268 )
...
* fix: mcp tool error handling
* fix: error message
* fix: trigger download from recommend model
* fix: can't scroll hub
* fix: show progress
* ✨ enhancement: prompt users to increase context size
* ✨ enhancement: rearrange action buttons for a better UX
* 🔧 chore: clean up logics
---------
Co-authored-by: Faisal Amir <urmauur@gmail.com>
2025-06-14 16:32:15 +07:00
Louis
27c4918395
fix: default settings should leave empty ( #5257 )
...
* fix: default settings should leave empty
* fix: default settings
* fix: remove some more default settings
* fix: threads and cont
* fix: data
* fix: default setting
* fix: settings
* chore: bump cortex version
* chore: bump to cortex 1.0.14
* chore: clean up
* typoe
* chore: fix dialog hang
* fix: default parameter
* chore: truncate edit model title
* chore: update default provider settings
* chore: fix typo
---------
Co-authored-by: Faisal Amir <urmauur@gmail.com>
2025-06-13 00:01:25 +07:00
Louis
51a321219d
chore: fix model settings are not applied accordingly on change ( #5231 )
...
* chore: fix model settings are not applied accordingly on change
* chore: handle failed tool call
* chore: stop inference and model on reject
2025-06-10 16:26:42 +07:00
Louis
6d6790d5e0
fix: clean up cortex processes ( #5215 )
...
* fix: clean up cortex processes
* chore: clean up
* chore: does not need output shell
2025-06-09 22:23:07 +07:00
Louis
b85bce8e09
fix: migrate provider settings ( #5205 )
...
* chore: fix migrate api key
* chore: add logs
* chore: deprecate migration step
* chore: clean up
* fix: key apply
2025-06-05 20:25:43 +07:00
Louis
6faca3e732
refactor: remove JS server package ( #5192 )
...
* refactor: remove js server package
* chore: migrate HF token data
2025-06-04 15:33:35 +07:00
Louis
7dc51c5e0f
fix: relocate jan data folder ( #5179 )
...
* fix: relocate jan data folder failed
* fix: avoid infinite recursion
* chore: kill background processes to unblock factory reset
* chore: stop models before reset factory
* chore: clean up
* chore: clean up
* fix: show error
* chore: get active models should not have retry
2025-06-03 21:23:42 +07:00
Louis
ecef9d7df6
feat: handle open Jan on HF GGUF repo ( #5173 )
...
* feat: handle open Jan on HF GGUF repo
* chore: reset retry attempts
2025-06-03 01:09:36 +07:00
Louis
1a0f643d87
refactor: clean up repo ( #5165 )
...
* chore: remove legacy themes
* refactor: clean up dependencies
* chore: remove cuda 11 dependency - fix linux LD_LIBRARY_PATH
* fix: load models issue on Linux
# Conflicts:
# src-tauri/src/core/setup.rs
* chore: do not download cuda 11 by default
* chore: remove cuda 11 from installer
* fix: cuda lookup on Linux
2025-06-02 19:03:06 +07:00
Louis
a1111033d9
chore: allow users to setting model offload ( #5134 )
...
* chore: allow users to setting model offload
* chore: apply model.yaml configurations to default model settings
* chore: fallback default value
2025-05-29 13:29:32 +07:00
Louis
942f2f51b7
chore: send chat completion with messages history ( #5070 )
...
* chore: send chat completion with messages history
* chore: handle abort controllers
* chore: change max attempts setting
* chore: handle stop running models in system monitor screen
* Update web-app/src/services/models.ts
Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>
* chore: format time
* chore: handle stop model load action
---------
Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>
2025-05-22 20:13:50 +07:00
Louis
570bb8290f
chore: add model information in System Monitoring ( #5062 )
...
* chore: add model information in System Monitoring
* chore: handle empty models case
* chore: fix type
2025-05-22 16:07:08 +07:00
Louis
0627f29059
chore: enable / disable proxy configrations ( #5050 )
...
* chore: enable / disable proxy configrations
* Update web-app/src/routes/settings/https-proxy.tsx
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
* Update web-app/src/lib/completion.ts
Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>
---------
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>
2025-05-21 14:18:25 +07:00
Louis
d5393e4563
feat: add custom OpenAI provider ( #5033 )
...
* feat: add custom OpenAI provider
* chore: add HF token setting
* chore: move HF token setting to llama.cpp provider - later deprecate model extension
2025-05-20 14:30:51 +07:00
Louis
f6433544af
feat: handle stop streaming message, scroll to bottom and model loads ( #5023 )
2025-05-19 23:32:55 +07:00
Louis
e9f37e98d1
refactor: remove hardcoded provider names ( #4995 )
...
* refactor: remove hardcoded provider names
* chore: continue the replacement
2025-05-15 22:10:43 +07:00
Louis
bf3f22c854
fix: should not spawn many llama.cpp servers for the same model ( #4994 )
...
* fix: should not spawn many llama.cpp servers for the same model
* chore: test step placeholder for the new revamp
* chore: coverage check should not fail pipeline
2025-05-15 21:27:41 +07:00
Louis
68d7dec23b
chore: maintain electron build
2025-05-15 17:10:01 +07:00
Louis
f0487bea8a
refactor: setup extensions, engines and sidecar
2025-05-15 17:09:59 +07:00
Louis
27beb46801
feat: tauri toolkit
2025-05-15 17:09:58 +07:00
Louis
9d64e9bef4
chore: merge release/v0.5.16 into dev ( #4833 )
...
* chore: bump cortex 1.0.11-rc10
* chore: bump to latest cortex release
* feat: Cortex API Authorization
* chore: correct CI CD repo name
* chore: correct new menloresearch repo name
* feat: rotate api token for each run (#4820 )
* feat: rotate api token for each run
* chore: correct github repo url
* chore: correct github api url
* chore: should not filter out models first launch
* chore: bump cortex release
* chore: should get hardware information on launch (#4821 )
* chore: should have an option to not revalidate hardware information
* chore: cortex.cpp gpu activation could cause a race condition (#4825 )
* fix: jan beta logo displayed in jan release (#4828 )
---------
Co-authored-by: David <davidpt.janai@gmail.com>
Co-authored-by: Nguyen Ngoc Minh <91668012+Minh141120@users.noreply.github.com>
2025-03-24 11:53:30 +07:00
Louis
455d320d35
fix: engine version update - cortex version bump - update tests ( #4787 )
2025-03-10 13:26:48 +07:00
Louis
4f7a8d58ca
refactor: clean up legacy vision model settings ( #4777 )
2025-03-06 09:53:31 +07:00
Louis
b990b259bc
refactor: clean up legacy predefined models ( #4730 )
...
* refactor: clean up legacy predefined models
* chore: fix onboarding filtered models
2025-02-25 12:40:37 +07:00
Faisal Amir
daa7c0ca21
feat: better hardware setting ( #4471 )
...
* feat: better hardware setting
* chore: update layout
* feat: better hardware setting
* chore: fix title section
* chore: added hardware engine management
* chore: integrate gpus and enable set gpu activate
* chore: update calculate ram and vram
* chore: update calulate vram and ram used
* fix: set active gpus
* chore: fix progress bar spacing
* chore: always update cache vram gpu
* chore: update cpu usage percentage
* chore: fix type usage cpu
* chore: update ram cpus usage getsystemmonitor from new api harware engine management system
* test: update test case data using hardware management extension
* chore: resolve conflict lock json
* chore: cleanup app services
* chore: update type OperationSystemInfo
* chore: update app service
* chore: show list gpus on system monitor
* chore: remove monitoring extension
* chore: update test case app service
* chore: remove unused hooks useGpusSetting
* chore: remove monitor from shource index
* chore: fix test core
* chore: update gpu and cpu info on engine management ext
* chore: fix app service test
* chore: update test appService include cpu info
* chore: filter gpus show or hide on system monitor based activated gpu
* chore: remove unused run_mode
* chore: remove tensort
* chore: update check gpu run_mode
* chore: handle undefined gpus
* chore: cleanup PR
* chore: cleanup process node error
* chore: fix type
2025-02-03 22:01:08 +07:00
Faisal Amir
2a0601f75a
feat: remote engine management ( #4364 )
...
* feat: remote engine management
* chore: fix linter issue
* chore: remove unused imports
* fix: populate engines, models and legacy settings (#4403 )
* fix: populate engines, models and legacy settings
* chore: legacy logics update configured remote engine
* fix: check exist path before reading
* fix: engines and models persist - race condition
* chore: update issue state
* test: update test cases
* chore: bring back Cortex extension settings
* chore: setup button gear / plus based apikey
* chore: fix remote engine from welcome screen
* chore: resolve linter issue
* chore: support request headers template
* chore: update engines using header_template instead of api_key_template
* chore: update models on changes
* fix: anthropic response template
* chore: fix welcome screen and debounce update value input
* chore: update engines list on changes
* chore: update engines list on change
* chore: update desc form add modal remote engines
* chore: bump cortex version to latest RC
* chore: fix linter
* fix: transform payload of Anthropic and OpenAI
* fix: typo
* fix: openrouter model id for auto routing
* chore: remove remote engine URL setting
* chore: add cohere engine and model support
* fix: should not clean on app launch - models list display issue
* fix: local engine check logic
* chore: bump app version to latest release 0.5.13
* test: fix failed tests
---------
Co-authored-by: Louis <louis@jan.ai>
2025-01-14 17:29:56 +07:00
Louis
b4d2176280
Merge pull request #4408 from janhq/main
...
Sync release 0.5.13 into dev
2025-01-06 21:01:49 +07:00
Louis
27e40c35d8
chore: add cpu_threads settings in cortex extension
2025-01-02 13:03:16 +07:00
Faisal Amir
a6a0cb325b
feat: local engine management ( #4334 )
...
* feat: local engine management
* chore: move remote engine into engine page instead extension page
* chore: set default engine from extension
* chore: update endpoint update engine
* chore: update event onEngineUpdate
* chore: filter out engine download
* chore: update version env
* chore: select default engine variant base on user device specs
* chore: symlink engine variants
* chore: rolldown.config in mjs format
* chore: binary codesign
* fix: download state in footer bar and variant status
* chore: update yarn.lock
* fix: rimraf failure
* fix: setup-node@v3 for built-in cache
* fix: cov pipeline
* fix: build syntax
* chore: fix build step
* fix: create engines folder on launch
* chore: update ui delete engine variant with modal confirmation
* chore: fix linter
* chore: add installing progress for Local Engine download
* chore: wording
---------
Co-authored-by: Louis <louis@jan.ai>
2024-12-30 17:27:51 +07:00
Louis
fb0a916c50
fix: 4155 - Queue up health check on server restarting
2024-12-03 16:17:46 +07:00
Louis
541a37b8f1
fix: 4171 - Model loading gets stuck on stop
2024-12-02 14:04:13 +07:00
Louis
ad8484533d
Merge pull request #4132 from janhq/fix/4094-gpu-mode-not-working-after-first-factory-reset
...
fix: 4094 - gpu mode toggle on by default but not affect for the first time launch
2024-11-26 23:46:06 +07:00
Louis
3854d43e8a
fix: should queue engine set on changes
2024-11-26 22:58:30 +07:00
Louis
2478cf7bc3
Merge pull request #4130 from janhq/chore/4123-update-default-engine-request-body
...
feat: update Jan’s client request to adapt API changes from Cortex
2024-11-26 21:38:14 +07:00
Louis
d55b1dd2f8
fix: 4094 - gpu mode toggle on by default but not affect for the first time launch
2024-11-26 19:50:23 +07:00
Louis
638dbf178e
fix: 4012 - active model should be gone as soon as cortex.cpp server is killed
2024-11-26 19:03:26 +07:00
Louis
1108e6c221
test: correct test cases
2024-11-26 15:25:26 +07:00
Louis
f709c7723f
fix: factory reset hang on wiping data
2024-11-21 23:10:48 +07:00
Louis
33bfd0eed8
fix: disable timeout on model load
2024-11-20 20:39:43 +07:00
Louis
239e5f52e8
fix: force vision models to run with legacy settings
2024-11-20 12:43:10 +07:00
Louis
10e1201083
Merge pull request #4049 from janhq/fix/openai-o1-model-parameters
...
fix: correct OpenAI o1 model parameters
2024-11-20 01:09:58 +07:00
Louis
0b3847a8b1
chore: ensure server is started before letting other requests go thru
2024-11-19 23:45:08 +07:00
Louis
af20587846
fix: handle symlink model import failure - fallback to legacy model run
2024-11-19 22:39:35 +07:00
Louis
bd850fb357
chore: reduce destroy attempts
2024-11-19 13:27:49 +07:00
Louis
363008d37f
fix: model import edge cases
2024-11-19 11:44:19 +07:00