gitmyhub

C3-Benchmark

Python ★ 39 updated 3mo ago

C^3-Bench: The Things Real Disturbing LLM based Agent in Multi-Tasking

No plain-English explanation yet — one is being written right now. Check back in a minute.