CodeElo
Python
★ 75
updated 1y ago
CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings
No plain-English explanation yet — one is being written right now. Check back in a minute.