May 22, 2026
Toward Agent Benchmarks That Reflect Human Work: AI agents may not be getting better at full range of economically valuable labor
AI agents seem to be increasingly capable of performing economically valuable tasks, but current benchmarks measure this capability only narrowly.