144 Commits

Author SHA1 Message Date
Louis
ef3c090360
fix: Phi 3 Mini Instruct model download url replacement 2024-11-21 16:17:27 +07:00
Louis
daefa99a9d chore: keep the deps folder with gitkeep placeholder file 2024-11-21 11:26:21 +07:00
Louis
8590709480 fix: correct download script to relocate dll files properly 2024-11-21 10:54:44 +07:00
Louis
06df084342
fix: temporary create a deps folder to have cortex.cpp work 2024-11-21 09:16:15 +07:00
Louis
33bfd0eed8
fix: disable timeout on model load 2024-11-20 20:39:43 +07:00
Louis
239e5f52e8
fix: force vision models to run with legacy settings 2024-11-20 12:43:10 +07:00
Louis
3ea2d9c0ae
fix: bump cortex.cpp to latest version - recursively scanning models folder should also include remote models 2024-11-20 12:33:42 +07:00
Louis
10e1201083
Merge pull request #4049 from janhq/fix/openai-o1-model-parameters
fix: correct OpenAI o1 model parameters
2024-11-20 01:09:58 +07:00
Louis
0b3847a8b1 chore: ensure server is started before letting other requests go thru 2024-11-19 23:45:08 +07:00
Louis
af20587846
fix: handle symlink model import failure - fallback to legacy model run 2024-11-19 22:39:35 +07:00
Louis
0f05910f63
chore: bump cortex.cpp 1.0.3-rc4 2024-11-19 19:48:57 +07:00
Louis
bd850fb357
chore: reduce destroy attempts 2024-11-19 13:27:49 +07:00
Louis
363008d37f
fix: model import edge cases 2024-11-19 11:44:19 +07:00
Louis
7a084b6589
Merge pull request #4040 from janhq/fix/rerender-component
chore: refactor rerender components
2024-11-18 20:55:22 +07:00
Louis
e9fd7f4554
fix: models load 2024-11-18 18:22:09 +07:00
Louis
f75dc662ee chore: reduce app launch time 2024-11-18 15:58:15 +07:00
Louis
5243e4a095 fix: correct cortex repo url 2024-11-18 14:22:10 +07:00
Louis
6e9c34baf7
chore: cortex.cpp version bump 2024-11-16 16:23:06 +07:00
Louis
6f066357ed
chore: bump new engine version 0.1.39 and get rid of dangling process 2024-11-16 16:06:37 +07:00
Louis
a38715f18a fix: should queue health check and default engine set before starting model 2024-11-14 23:51:51 +07:00
Louis
a15d92dbdc
feat: integrates cortex.cpp engine variants 2024-11-14 21:09:44 +07:00
Louis
8053ff9eb8
fix: attempt to establish the WebSocket connection on a connection issue 2024-11-13 15:14:14 +07:00
Louis
1bd58e1327 fix: shift model events subscription to cortex 2024-11-12 15:46:14 +07:00
Louis
cd4f72dd31
chore: add qwen2.5-coder 14B and 32B models 2024-11-12 13:46:13 +07:00
Louis
f3802bff04
chore: bump to latest release 2024-11-08 17:01:29 +07:00
Louis
0847b32e87
fix: an edge case when start a model with relative model path 2024-11-07 14:29:27 +07:00
Louis
40019892b8
chore: correct name of bin subfolders to move dll properly 2024-11-07 13:01:34 +07:00
Louis
ced44973b8 chore: queue server start and model load 2024-11-07 12:06:46 +07:00
Louis
e41bcffcef
fix: export PATH env to engine destination folder to have additional dlls scoped 2024-11-07 10:10:05 +07:00
Louis
56e35df84d
chore: clean dangling process on exit and relaunch 2024-11-06 13:34:11 +07:00
Louis
46d5faf59f
chore: new cortex-cpp binary - model import option and model size 2024-11-04 20:36:04 +07:00
Louis
a986c6de2d
chore: decide model name on pull and import 2024-11-04 15:37:20 +07:00
Louis
1ab02b706f
fix: model import symlink 2024-11-04 15:37:19 +07:00
Louis
61f72e6775
chore: bump cortex-cpp v1.0.2-rc1 2024-11-04 15:37:19 +07:00
Louis
a466bbca38
chore: update legacy tensorrt-llm download and run 2024-11-04 15:37:19 +07:00
Louis
2c11caf87e
chore: shared cuda dependencies 2024-11-04 15:37:18 +07:00
Louis
3643c8866e
fix: correct model settings on startup and strip down irrelevant model parameters 2024-11-04 15:37:18 +07:00
Louis
5f075c8554
fix: prebundle cudart and cublas 2024-11-04 15:37:18 +07:00
Louis
dc87f37a9b
fix: package cortex.cpp engines and cuda on windows 2024-11-04 15:37:17 +07:00
Louis
a0e2f16a3b
chore: binary naming convention - following llama.cpp release 2024-11-04 15:37:17 +07:00
Louis
03333cc4c2
fix: onboarding should cover cortex models - debounce reduce model reload - rename cortex binary name 2024-11-04 15:37:17 +07:00
Louis
40957f7686
fix: model reload state - reduce model unload events emit 2024-11-04 15:37:15 +07:00
Louis
03e15fb70f
feat: sync model hub and download progress from cortex.cpp 2024-11-04 15:37:14 +07:00
Louis
4080dc4b65
feat: model and cortex extensions update 2024-11-04 15:37:12 +07:00