tiny_qa_benchmark_pp
Python
★ 16
updated 1mo ago
Tiny QA Benchmark++ a micro-benchmark suite (52-item gold + on-demand multilingual synthetic packs), generator CLI, and CI-ready eval harness for ultra-fast LLM smoke-testing & regression-catching.
No plain-English explanation yet — one is being written right now. Check back in a minute.