John Yang – SWE-bench: Can Language Models Resolve Real-World GitHub Issues? Blog 24/06/2026 · 0 Comment John Yang - SWE-bench: Can Language Models Resolve Real-World GitHub Issues?SWE-BENCH: CAN LANGUAGE MODELS RESOLVE REAL-WORLD GITHUB ISSUES?Benchtalks #2: From SWE-bench to ProgramBench: The Future of Coding Benchmarks with John YangClaw-SWE-Bench: Benchmark for LLM Coding AgentsSWE-bench: The Benchmark That Exposes Every AI Coding AgentMulti-SWE-bench: Testing LLMs on Real-World Code IssuesZhipu's 754B open model just beat GPT-5.4 on SWE-Bench ProGPT 5 5 and the Rise of the Agent 1080p caption mp4Practical AI Coding Agent Evaluation with SWE-bench, TeamCity, and Juni | Ernst HaagsmanBeyond SWE-Bench Pro - Where do Agents go from Here?12