SWE-rebench: Lessons from Evaluating Coding Agents — Ibragim Badertdinov, Nebius

Channel: AI Engineer
3K views • 2 weeks ago