llama.cpp-glm52
C++
★ 0
updated 4h ago
llama.cpp fork: GLM-5.2 CPU∥GPU MoE expert-split decode speedup
No plain-English explanation yet — one is being written right now. Check back in a minute.
llama.cpp fork: GLM-5.2 CPU∥GPU MoE expert-split decode speedup
No plain-English explanation yet — one is being written right now. Check back in a minute.