gitmyhub

agent-eval

TypeScript ★ 0 updated 2d ago

Tools to evaluate agent performance on Primer tasks

No plain-English explanation yet — one is being written right now. Check back in a minute.