Terminal-Bench 2.0: Benchmarking AI Agents on Hard, Realistic CLI Tasks

Channel: PaperLens
528 views • 23/02/2026