Akarshan Biswas
da23673a44
feat: Add API key generation for Llama.cpp
...
This commit introduces API key generation for the Llama.cpp extension. The API key is now generated on the server side using HMAC-SHA256 and a secret key to ensure security and uniqueness. The frontend now passes the model ID and API secret to the server to generate the key. This addresses the requirement for secure model access and authorization.
2025-07-02 12:27:12 +07:00
Akarshan Biswas
d6edb1e944
If checking for proper ctx_len settings after refactoring
2025-07-02 12:27:12 +07:00
Thien Tran
39bb3f34d6
patch failing calls to cortex
2025-07-02 12:27:12 +07:00
Akarshan Biswas
31971e7821
(WIP)randomly generate api-key hash each session
2025-07-02 12:27:12 +07:00
Akarshan Biswas
1dd762f0cf
remove parseGGUFFileName function as it is not used
2025-07-02 12:27:12 +07:00
Akarshan Biswas
7481fae0df
remove ununsed imports and remove n_ctx key from loadOptions
2025-07-02 12:27:11 +07:00
Akarshan Biswas
77d861f56f
Fixup: change key to ctx_size to align with upstream and remove duplicate key
2025-07-02 12:27:11 +07:00
Thien Tran
d5c07acdb5
feat: add LlamacppConfig for llama.cpp extension to improve settings ( #5121 )
...
* add engine settings
* update load options
* rename variable
2025-07-02 12:27:11 +07:00
Thien Tran
9bb4deeb78
update model config (import and list)
2025-07-02 12:27:11 +07:00
Thien Tran
5803fcdb99
add read_yaml. use buffered reader/writer
2025-07-02 12:27:11 +07:00
Thien Tran
d01cbe44ae
use PathBuf to check exists()
2025-07-02 12:27:11 +07:00
Thien Tran
77f6770333
update fileStat()
2025-07-02 12:27:10 +07:00
Akarshan Biswas
742e731e96
Add --reasoning_budget option
2025-07-02 12:27:10 +07:00
Akarshan Biswas
fe457a5368
slight modelbasepath refactoring
2025-07-02 12:27:10 +07:00
Akarshan Biswas
c5a0ee7f6e
refactor unload and implement a destructor to clean up sessions
2025-07-02 12:27:10 +07:00
Thien Tran
cd36b423b6
add basic model list
2025-07-02 12:27:10 +07:00
Thien Tran
d523166b61
implement delete
2025-07-02 12:27:09 +07:00
Akarshan Biswas
587ed3c83c
refactor OAI request payload type to support image and audio
2025-07-02 12:27:09 +07:00
Thien Tran
ded9ae733a
feat: Model import (download + local import) for llama.cpp extension ( #5087 )
...
* add pull and abortPull
* add model import (download only)
* write model.yaml. support local model import
* remove cortex-related command
* add TODO
* remove cortex-related command
2025-07-02 12:27:09 +07:00
Akarshan Biswas
a7a2dcc8d8
refactor load/unload again; move types to core and refactor AIEngine abstract class
2025-07-02 12:27:09 +07:00
Akarshan Biswas
ee2cb9e625
remove override from localOAIEngine and OAIEngine
2025-07-02 12:27:09 +07:00
Akarshan Biswas
0e9a8a27e5
fixup from refactoring
2025-07-02 12:27:08 +07:00
Akarshan Biswas
bbbf4779df
refactor load/unload
2025-07-02 12:27:08 +07:00
Akarshan Biswas
b4670b5526
remove cortex engine dirs
2025-07-02 12:27:08 +07:00
Akarshan Biswas
47881db696
remove cortex from tauri.conf.json
2025-07-02 12:27:08 +07:00
Akarshan Biswas
021f8ae80f
Fixup: llama-server load
2025-07-02 12:27:08 +07:00
Akarshan Biswas
ed6f86d4b1
Change scripts to download only llama.cpp engine
2025-07-02 12:27:07 +07:00
Akarshan Biswas
a8abc9f9aa
Resolved conflicts by keeping HEAD changes
2025-07-02 12:27:07 +07:00
Akarshan Biswas
19274f7e69
update settings
2025-07-02 12:26:39 +07:00
Thien Tran
3f082372fd
add llamacpp-extension. can list some models
2025-07-02 12:26:39 +07:00
Thien Tran
15f0b11c0d
make it compile
2025-07-02 12:26:38 +07:00
Akarshan Biswas
0551b0bfd2
Fix import
2025-07-02 12:26:38 +07:00
Akarshan Biswas
f5b5596306
add thiserror to Cargo.toml
2025-07-02 12:26:38 +07:00
Akarshan Biswas
5c9e8dce76
Add spaces before EOF
2025-07-02 12:26:38 +07:00
Akarshan Biswas
9016fbff68
feat: inference-llamacpp-extension: backend implementation
2025-07-02 12:26:37 +07:00
Louis
c288e75407
Merge pull request #5671 from menloresearch/dev
...
Sync dev into Release/v0.7.0
2025-07-02 11:33:01 +07:00
Louis
a3fd6fcd3c
Merge pull request #5670 from menloresearch/release/v0.6.6
...
Sync Release/v0.6.6 into dev
2025-07-02 10:58:33 +07:00
Louis
dc4e592de9
Merge branch 'dev' into release/v0.6.6
2025-07-02 10:43:12 +07:00
Louis
ccffe4ced5
Merge pull request #5669 from menloresearch/release/v0.6.4
...
Sync Release/v0.6.4 into dev
2025-07-02 10:42:07 +07:00
Faisal Amir
2a0597ff01
Merge branch 'dev' into release/v0.6.6
2025-07-02 10:26:32 +07:00
Bui Quang Huy
defc6b92d2
Merge pull request #5661 from menloresearch/template/yml-to-md
...
🔧 config: all yml to md for issue template
2025-07-01 23:47:33 +08:00
LazyYuuki
a8a1de9ad0
🔧 config: all yml to md for issue template
2025-07-01 23:45:44 +08:00
Bui Quang Huy
5690c43633
Merge pull request #5658 from menloresearch/template/fix-bug-md
...
🔧 config: fix bug template
2025-07-01 23:36:13 +08:00
LazyYuuki
5dc9073108
🔧 config: fix bug template
2025-07-01 23:33:30 +08:00
Bui Quang Huy
99df080bd6
Merge pull request #5657 from menloresearch/template/yml-to-md
...
🔧 config: from yml to md for template
2025-07-01 23:26:21 +08:00
LazyYuuki
8e837fc73d
🔧 config: from yml to md for template
2025-07-01 23:25:08 +08:00
Louis
71cf69bb16
Merge pull request #5606 from menloresearch/dependabot/npm_and_yarn/multi-f00634d4ce
...
chore(deps-dev): bump jest and @types/jest
2025-07-01 20:02:59 +07:00
Louis
94b25ec6e8
Merge pull request #5650 from menloresearch/fix/top_k-model-setting-validation
...
fix: top_k validation
v0.6.4
2025-07-01 17:31:03 +07:00
Louis
0b88d93e18
fix: top_k validation
2025-07-01 16:40:22 +07:00
Bui Quang Huy
7dd95101c2
Merge pull request #5647 from menloresearch/template/update-config
...
Update config.yml
2025-07-01 15:04:51 +08:00