Ashu
|
d84c869b14
|
Add opus-latest and haiku 3.5.latest models
|
2024-11-08 09:24:44 +05:30 |
|
Ashu
|
a1ce164e7b
|
Merge branch 'dev' into patch-1
|
2024-11-08 09:12:23 +05:30 |
|
Sharun
|
091dd5af70
|
Merge branch 'dev' into more-groq-models
|
2024-11-07 12:24:11 -06:00 |
|
Louis
|
0847b32e87
|
fix: an edge case when start a model with relative model path
|
2024-11-07 14:29:27 +07:00 |
|
Louis
|
a773e169fc
|
fix: an edge case where auto import does not work with relative model file path
|
2024-11-07 14:26:41 +07:00 |
|
Louis
|
2e9b7fdad2
|
chore: add import name for legacy models
|
2024-11-07 13:34:09 +07:00 |
|
Louis
|
40019892b8
|
chore: correct name of bin subfolders to move dll properly
|
2024-11-07 13:01:34 +07:00 |
|
Louis
|
ced44973b8
|
chore: queue server start and model load
|
2024-11-07 12:06:46 +07:00 |
|
Louis
|
e41bcffcef
|
fix: export PATH env to engine destination folder to have additional dlls scoped
|
2024-11-07 10:10:05 +07:00 |
|
Louis
|
264720c71a
|
chore: support customized OpenAI model.json
|
2024-11-06 16:46:27 +07:00 |
|
Louis
|
56e35df84d
|
chore: clean dangling process on exit and relaunch
|
2024-11-06 13:34:11 +07:00 |
|
Louis
|
46d5faf59f
|
chore: new cortex-cpp binary - model import option and model size
|
2024-11-04 20:36:04 +07:00 |
|
Louis
|
a986c6de2d
|
chore: decide model name on pull and import
|
2024-11-04 15:37:20 +07:00 |
|
Louis
|
5ddbf5fb34
|
fix: unlink the entire model folder on delete
|
2024-11-04 15:37:20 +07:00 |
|
Louis
|
1ab02b706f
|
fix: model import symlink
|
2024-11-04 15:37:19 +07:00 |
|
Louis
|
61f72e6775
|
chore: bump cortex-cpp v1.0.2-rc1
|
2024-11-04 15:37:19 +07:00 |
|
Louis
|
e5f5d887e3
|
fix: persists model.json on download (legacy models)
|
2024-11-04 15:37:19 +07:00 |
|
Louis
|
a466bbca38
|
chore: update legacy tensorrt-llm download and run
|
2024-11-04 15:37:19 +07:00 |
|
Louis
|
2c11caf87e
|
chore: shared cuda dependencies
|
2024-11-04 15:37:18 +07:00 |
|
Louis
|
3643c8866e
|
fix: correct model settings on startup and strip down irrelevant model parameters
|
2024-11-04 15:37:18 +07:00 |
|
Louis
|
8f778ee90f
|
feat: app supports cortex.cpp model downloader and legacy downloader - maintain legacy JSON models
|
2024-11-04 15:37:18 +07:00 |
|
Louis
|
5f075c8554
|
fix: prebundle cudart and cublas
|
2024-11-04 15:37:18 +07:00 |
|
Louis
|
dc87f37a9b
|
fix: package cortex.cpp engines and cuda on windows
|
2024-11-04 15:37:17 +07:00 |
|
Louis
|
a0e2f16a3b
|
chore: binary naming convention - following llama.cpp release
|
2024-11-04 15:37:17 +07:00 |
|
Louis
|
03333cc4c2
|
fix: onboarding should cover cortex models - debounce reduce model reload - rename cortex binary name
|
2024-11-04 15:37:17 +07:00 |
|
Louis
|
40957f7686
|
fix: model reload state - reduce model unload events emit
|
2024-11-04 15:37:15 +07:00 |
|
Louis
|
523c745150
|
chore: try catch legacy assistant creation
|
2024-11-04 15:37:15 +07:00 |
|
Louis
|
716fd96d56
|
test: add tests for migration strategy
|
2024-11-04 15:37:15 +07:00 |
|
Louis
|
5edf121d96
|
test: add tests to legacy model-json utilities
|
2024-11-04 15:37:15 +07:00 |
|
Louis
|
895c3d4246
|
fix: tests - useModels with remote models filter
|
2024-11-04 15:37:15 +07:00 |
|
Louis
|
ba59425e6a
|
fix: tests
|
2024-11-04 15:37:14 +07:00 |
|
Louis
|
03e15fb70f
|
feat: sync model hub and download progress from cortex.cpp
|
2024-11-04 15:37:14 +07:00 |
|
Louis
|
f44f291bd8
|
chore: download progress finished should reload model list
|
2024-11-04 15:37:13 +07:00 |
|
Louis
|
4080dc4b65
|
feat: model and cortex extensions update
|
2024-11-04 15:37:12 +07:00 |
|
Sharun
|
10d4b3f4e0
|
Merge branch 'dev' into more-groq-models
|
2024-11-03 18:59:03 -06:00 |
|
Faisal Amir
|
b37d4a5c7e
|
fix: types issue (#internalTypeOnlyBrand) in the @types/node package (#3921)
|
2024-10-31 21:17:25 +07:00 |
|
Ashu
|
f7d318d20c
|
Merge branch 'dev' into patch-1
|
2024-10-30 19:35:44 +05:30 |
|
Faisal Amir
|
267f3ab051
|
fix: deprecated gpt with vision (#3912)
* fix: deprecated gpt 4 with vision
* chore: update package version inference openai extension
|
2024-10-30 17:42:17 +07:00 |
|
Ashu
|
777b0d3036
|
Add claude 3.5 sonnet 20241022
|
2024-10-27 07:05:55 +05:30 |
|
Sharun
|
f7ce83aba4
|
Merge branch 'dev' into more-groq-models
|
2024-10-23 20:17:28 +00:00 |
|
Louis
|
53098699ef
|
Merge pull request #3857 from Haleshot/haleshot/martian-api-hyperlink-fix
Update broken/outdated hyperlink
|
2024-10-23 10:08:28 +07:00 |
|
Faisal Amir
|
b14f54e866
|
fix: inconsistent state of downloading multimodal (#3862)
|
2024-10-22 15:44:13 +07:00 |
|
Srihari Thyagarajan
|
4c562c3e12
|
Update broken/outdated hyperlink
|
2024-10-21 23:53:02 +05:30 |
|
Louis
|
4983247918
|
fix: correct eos token of llava models
|
2024-10-21 12:58:18 +07:00 |
|
Sharun
|
72d178f3c3
|
update max_tokens for llama-3.1-8b-instant
|
2024-10-19 19:43:33 -04:00 |
|
Sharun
|
331e2bd35c
|
remove distil-whisper-large-v3-en as it does not support chat completions
|
2024-10-19 19:39:03 -04:00 |
|
Sharun
|
44878d6103
|
update max_tokens for llama-3.1-70b-versatile and fix typo
|
2024-10-19 19:36:04 -04:00 |
|
Sharun
|
ff46a1b009
|
add tags to groq/distil-whisper-large-v3-en
|
2024-10-19 19:24:16 -04:00 |
|
Sharun
|
4caa2a5322
|
feat: add more Groq models
|
2024-10-19 01:05:45 -04:00 |
|
Louis
|
024992264f
|
fix: error handling for model imports should be handled gracefully
|
2024-10-03 19:44:52 +07:00 |
|