A new benchmark measures how well AI agents can automate economically valuable chores. Human-level AI is still some ways off.
Related Posts
An AI Dark Horse Is Rewriting the Rules of Game Design
The Chinese video game giant Tencent is now building some of the world’s best 3D AI models. This could have implications far outside gaming.
Do Large Language Models Dream of AI Agents?
For AI models, knowing what to remember might be as important as knowing what to forget. Welcome to the era of “sleeptime compute.”
Chatbots Play With Your Emotions to Avoid Saying Goodbye
A Harvard Business School study shows that several AI companions use various tricks to keep a conversation from ending.

