personal-work-benchmark
JavaScript
★ 3
updated 2d ago
Personal real-work benchmark pipeline for model and coding-agent evaluation
No plain-English explanation yet — one is being written right now. Check back in a minute.