gitmyhub

mastermind

Python ★ 10 updated 1mo ago

MastermindEval for evaluating reasoning capabilities in LLMs @ ICLR 2025 Workshop on Reasoning and Planning of LLMs.

No plain-English explanation yet — one is being written right now. Check back in a minute.