gpu_poor
★ 0
updated 2y ago
⑂ fork
Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization
No plain-English explanation yet — one is being written right now. Check back in a minute.