gitmyhub

AQA-Bench

Python ★ 4 updated 2y ago

Algorithmic-Q&A-Bench: An Interactive Benchmark for Evaluating LLMs’ Sequential Reasoning Ability

No plain-English explanation yet — one is being written right now. Check back in a minute.