aacr-bench
Python
★ 174
updated 3mo ago
An Alibaba open-source multi-language benchmark for evaluating LLMs in repository-level automatic code review, featuring an AI-assisted and expert-verified dataset.
No plain-English explanation yet — one is being written right now. Check back in a minute.