Researchers studying the emotional impact of tools like ChatGPT propose a new kind of benchmark that measures a model’s emotional and social impact.
Related Posts
Inside the US Government’s Unpublished Report on AI Safety
The National Institute of Standards and Technology conducted a groundbreaking study on frontier models just before Donald Trump’s second term as president—and never published the results.
AI Agents Are Terrible Freelance Workers
A new benchmark measures how well AI agents can automate economically valuable chores. Human-level AI is still some ways off.
AI’s Hacking Skills Are Approaching an ‘Inflection Point’
AI models are getting so good at finding vulnerabilities that some experts say the tech industry might need to rethink how software is built.

