gitmyhub

personal-work-benchmark

JavaScript ★ 3 updated 2d ago

Personal real-work benchmark pipeline for model and coding-agent evaluation

No plain-English explanation yet — one is being written right now. Check back in a minute.