rtx5060ti-gemma4-26b-kv-cache-bug
Batchfile
★ 0
updated 7d ago
RTX 5060 Ti + Gemma 4 26B llama.cpp CUDA mixed KV cache q8/q4 prefill hang notes
No plain-English explanation yet — one is being written right now. Check back in a minute.