gitmyhub

SWELancer-Benchmark

★ 1.4k updated 11mo ago ▣ archived

This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?"

No plain-English explanation yet — one is being written right now. Check back in a minute.