gitmyhub

rtx5060ti-gemma4-26b-kv-cache-bug

Batchfile ★ 0 updated 7d ago

RTX 5060 Ti + Gemma 4 26B llama.cpp CUDA mixed KV cache q8/q4 prefill hang notes

No plain-English explanation yet — one is being written right now. Check back in a minute.