499 Commits

Author SHA1 Message Date
Ashu
d84c869b14
Add opus-latest and haiku 3.5.latest models 2024-11-08 09:24:44 +05:30
Ashu
a1ce164e7b
Merge branch 'dev' into patch-1 2024-11-08 09:12:23 +05:30
Sharun
091dd5af70
Merge branch 'dev' into more-groq-models 2024-11-07 12:24:11 -06:00
Louis
0847b32e87
fix: an edge case when start a model with relative model path 2024-11-07 14:29:27 +07:00
Louis
a773e169fc
fix: an edge case where auto import does not work with relative model file path 2024-11-07 14:26:41 +07:00
Louis
2e9b7fdad2
chore: add import name for legacy models 2024-11-07 13:34:09 +07:00
Louis
40019892b8
chore: correct name of bin subfolders to move dll properly 2024-11-07 13:01:34 +07:00
Louis
ced44973b8 chore: queue server start and model load 2024-11-07 12:06:46 +07:00
Louis
e41bcffcef
fix: export PATH env to engine destination folder to have additional dlls scoped 2024-11-07 10:10:05 +07:00
Louis
264720c71a
chore: support customized OpenAI model.json 2024-11-06 16:46:27 +07:00
Louis
56e35df84d
chore: clean dangling process on exit and relaunch 2024-11-06 13:34:11 +07:00
Louis
46d5faf59f
chore: new cortex-cpp binary - model import option and model size 2024-11-04 20:36:04 +07:00
Louis
a986c6de2d
chore: decide model name on pull and import 2024-11-04 15:37:20 +07:00
Louis
5ddbf5fb34
fix: unlink the entire model folder on delete 2024-11-04 15:37:20 +07:00
Louis
1ab02b706f
fix: model import symlink 2024-11-04 15:37:19 +07:00
Louis
61f72e6775
chore: bump cortex-cpp v1.0.2-rc1 2024-11-04 15:37:19 +07:00
Louis
e5f5d887e3
fix: persists model.json on download (legacy models) 2024-11-04 15:37:19 +07:00
Louis
a466bbca38
chore: update legacy tensorrt-llm download and run 2024-11-04 15:37:19 +07:00
Louis
2c11caf87e
chore: shared cuda dependencies 2024-11-04 15:37:18 +07:00
Louis
3643c8866e
fix: correct model settings on startup and strip down irrelevant model parameters 2024-11-04 15:37:18 +07:00
Louis
8f778ee90f
feat: app supports cortex.cpp model downloader and legacy downloader - maintain legacy JSON models 2024-11-04 15:37:18 +07:00
Louis
5f075c8554
fix: prebundle cudart and cublas 2024-11-04 15:37:18 +07:00
Louis
dc87f37a9b
fix: package cortex.cpp engines and cuda on windows 2024-11-04 15:37:17 +07:00
Louis
a0e2f16a3b
chore: binary naming convention - following llama.cpp release 2024-11-04 15:37:17 +07:00
Louis
03333cc4c2
fix: onboarding should cover cortex models - debounce reduce model reload - rename cortex binary name 2024-11-04 15:37:17 +07:00
Louis
40957f7686
fix: model reload state - reduce model unload events emit 2024-11-04 15:37:15 +07:00
Louis
523c745150
chore: try catch legacy assistant creation 2024-11-04 15:37:15 +07:00
Louis
716fd96d56
test: add tests for migration strategy 2024-11-04 15:37:15 +07:00
Louis
5edf121d96
test: add tests to legacy model-json utilities 2024-11-04 15:37:15 +07:00
Louis
895c3d4246
fix: tests - useModels with remote models filter 2024-11-04 15:37:15 +07:00
Louis
ba59425e6a
fix: tests 2024-11-04 15:37:14 +07:00
Louis
03e15fb70f
feat: sync model hub and download progress from cortex.cpp 2024-11-04 15:37:14 +07:00
Louis
f44f291bd8
chore: download progress finished should reload model list 2024-11-04 15:37:13 +07:00
Louis
4080dc4b65
feat: model and cortex extensions update 2024-11-04 15:37:12 +07:00
Sharun
10d4b3f4e0
Merge branch 'dev' into more-groq-models 2024-11-03 18:59:03 -06:00
Faisal Amir
b37d4a5c7e
fix: types issue (#internalTypeOnlyBrand) in the @types/node package (#3921) 2024-10-31 21:17:25 +07:00
Ashu
f7d318d20c
Merge branch 'dev' into patch-1 2024-10-30 19:35:44 +05:30
Faisal Amir
267f3ab051
fix: deprecated gpt with vision (#3912)
* fix: deprecated gpt 4 with vision

* chore: update package version inference openai extension
2024-10-30 17:42:17 +07:00
Ashu
777b0d3036
Add claude 3.5 sonnet 20241022 2024-10-27 07:05:55 +05:30
Sharun
f7ce83aba4
Merge branch 'dev' into more-groq-models 2024-10-23 20:17:28 +00:00
Louis
53098699ef
Merge pull request #3857 from Haleshot/haleshot/martian-api-hyperlink-fix
Update broken/outdated hyperlink
2024-10-23 10:08:28 +07:00
Faisal Amir
b14f54e866
fix: inconsistent state of downloading multimodal (#3862) 2024-10-22 15:44:13 +07:00
Srihari Thyagarajan
4c562c3e12
Update broken/outdated hyperlink 2024-10-21 23:53:02 +05:30
Louis
4983247918
fix: correct eos token of llava models 2024-10-21 12:58:18 +07:00
Sharun
72d178f3c3
update max_tokens for llama-3.1-8b-instant 2024-10-19 19:43:33 -04:00
Sharun
331e2bd35c
remove distil-whisper-large-v3-en as it does not support chat completions 2024-10-19 19:39:03 -04:00
Sharun
44878d6103
update max_tokens for llama-3.1-70b-versatile and fix typo 2024-10-19 19:36:04 -04:00
Sharun
ff46a1b009
add tags to groq/distil-whisper-large-v3-en 2024-10-19 19:24:16 -04:00
Sharun
4caa2a5322
feat: add more Groq models 2024-10-19 01:05:45 -04:00
Louis
024992264f
fix: error handling for model imports should be handled gracefully 2024-10-03 19:44:52 +07:00