AQA-Bench
Python
★ 4
updated 2y ago
Algorithmic-Q&A-Bench: An Interactive Benchmark for Evaluating LLMs’ Sequential Reasoning Ability
No plain-English explanation yet — one is being written right now. Check back in a minute.