gitmyhub

EDINET-Bench

Python ★ 35 updated 3mo ago

[ICLR 2026] Evaluating the performance of LLMs on Japanese challenging financial tasks.

No plain-English explanation yet — one is being written right now. Check back in a minute.