gitmyhub

h2o-LLM-eval

Jupyter Notebook ★ 52 updated 1y ago ▣ archived

Large-language Model Evaluation framework with Elo Leaderboard and A-B testing

No plain-English explanation yet — one is being written right now. Check back in a minute.