Changed MMAP to mmap
This commit is contained in:
parent
0048fe022f
commit
54aae456c4
@ -150,7 +150,7 @@ For detailed hardware compatibility, please visit our guide for [Mac](/docs/desk
|
||||
| **Flash Attention** | - Optimizes attention computation<br></br>- Reduces memory usage<br></br>- Recommended for most cases | Enabled |
|
||||
| **Caching** | - Enable to store recent prompts and responses<br></br>- Improves response time for repeated prompts | Enabled |
|
||||
| **KV Cache Type** | - KV cache implementation type; controls memory usage and precision trade-off<br></br>- Options:<br></br>• f16 (most stable)<br></br>• q8_0 (balanced)<br></br>• q4_0 (lowest memory) | f16 |
|
||||
| **MMAP** | - Enables memory-mapped model loading<br></br>- Reduces memory usage<br></br>- Recommended for large models | Enabled |
|
||||
| **mmap** | - Enables memory-mapped model loading<br></br>- Reduces memory usage<br></br>- Recommended for large models | Enabled |
|
||||
|
||||
|
||||
## Best Practices
|
||||
|
||||
Loading…
x
Reference in New Issue
Block a user