Nicholai/jan - jan - Gitea: Git with a cup of tea

Nicholai/jan

Fork 0

Commit Graph

Author	SHA1	Message	Date
Akarshan	9afeb5e514	feat: Add offload_mmproj option and validation This commit introduces a new configuration option offload_mmproj to the llamacpp extension. The offload_mmproj setting allows users to control whether the multimodal projector model is offloaded to the GPU. By default, it's offloaded for better performance. If set to false, the projector model will remain on the CPU, which can be useful in low GPU memory scenarios, though image processing might take longer. Additionally, this commit adds validate_mmproj_path to ensure the provided --mmproj path is valid and accessible, preventing issues during model loading. This change also refactors some invoke calls for improved readability.	2025-08-19 19:51:29 +07:00
Louis	13a1969150	feat: MCP - State update	2025-08-15 10:02:06 +07:00
Dinh Long Nguyen	e1c8d98bf2	Backend Architecture Refactoring (#6094 ) (#6162 ) * add llamacpp plugin * Refactor llamacpp plugin * add utils plugin * remove utils folder * add hardware implementation * add utils folder + move utils function * organize cargo files * refactor utils src * refactor util * apply fmt * fmt * Update gguf + reformat * add permission for gguf commands * fix cargo test windows * revert yarn lock * remove cargo.lock for hardware plugin * ignore cargo.lock file * Fix hardware invoke + refactor hardware + refactor tests, constants * use api wrapper in extension to invoke hardware call + api wrapper build integration * add newline at EOF (per Akarshan) * add vi mock for getSystemInfo	2025-08-15 08:59:01 +07:00

Author

SHA1

Message

Date

Akarshan

9afeb5e514

feat: Add offload_mmproj option and validation

This commit introduces a new configuration option offload_mmproj to the llamacpp extension.

The offload_mmproj setting allows users to control whether the multimodal projector model is offloaded to the GPU. By default, it's offloaded for better performance. If set to false, the projector model will remain on the CPU, which can be useful in low GPU memory scenarios, though image processing might take longer.

Additionally, this commit adds validate_mmproj_path to ensure the provided --mmproj path is valid and accessible, preventing issues during model loading.

This change also refactors some invoke calls for improved readability.

2025-08-19 19:51:29 +07:00

Louis

13a1969150

feat: MCP - State update

2025-08-15 10:02:06 +07:00

Dinh Long Nguyen

e1c8d98bf2

Backend Architecture Refactoring (#6094 ) (#6162 )

* add llamacpp plugin

* Refactor llamacpp plugin

* add utils plugin

* remove utils folder

* add hardware implementation

* add utils folder + move utils function

* organize cargo files

* refactor utils src

* refactor util

* apply fmt

* fmt

* Update gguf + reformat

* add permission for gguf commands

* fix cargo test windows

* revert yarn lock

* remove cargo.lock for hardware plugin

* ignore cargo.lock file

* Fix hardware invoke + refactor hardware + refactor tests, constants

* use api wrapper in extension to invoke hardware call + api wrapper build integration

* add newline at EOF (per Akarshan)

* add vi mock for getSystemInfo

2025-08-15 08:59:01 +07:00

3 Commits