h2o-LLM-eval
Jupyter Notebook
★ 52
updated 1y ago
▣ archived
Large-language Model Evaluation framework with Elo Leaderboard and A-B testing
No plain-English explanation yet — one is being written right now. Check back in a minute.