A new benchmark measures how well AI agents can automate economically valuable chores. Human-level AI is still some ways off.
Related Posts
OpenClaw Agents Can Be Guilt-Tripped Into Self-Sabotage
In a controlled experiment, OpenClaw agents proved prone to panic and vulnerable to manipulation. They even disabled their own functionality when gaslit by humans.
Chatbots Play With Your Emotions to Avoid Saying Goodbye
A Harvard Business School study shows that several AI companions use various tricks to keep a conversation from ending.
This Defense Company Made AI Agents That Blow Things Up
Scout AI is using technology borrowed from the AI industry to power lethal weapons—and recently demonstrated its explosive potential.

