Thread - Nostr Hypermedia

Kieran kieran@snort.social 3 weeks ago

Im running latest release b8705 but its not listed as a kv cache type yet

↑ Parent

Replies (1)

Kieran kieran@snort.social 3 weeks ago

ggml : add CPU TurboQuant KV cache types (TBQ3_0 / TBQ4_0) by elusznik · Pull Request #21089 · ggml-org/llama.cpp

Summary This PR adds CPU-only TurboQuant KV-cache support for two new cache types: tbq3_0 tbq4_0 The scope is intentionally narrow for the first ...

1 replies ↓

↑