gitmyhub

UserDriven-LLMEval

Jupyter Notebook ★ 2 updated 11mo ago

[GEM @ ACL 2025] Towards Comprehensive Evaluation of Open-Source Language Models: A Multi-Dimensional, User-Driven Approach

No plain-English explanation yet — one is being written right now. Check back in a minute.