Source record

microsoft.com

1 published story in AI Today use this source record.

Stories

High evidence

Canadian

Policy / public sector

ResearchHigh evidence
SocialReasoning-Bench: Measuring whether AI agents act in users’ best interests
Microsoft Research introduced SocialReasoning-Bench, a benchmark for testing whether AI agents act in users' best interests. It measures both outcomes and process, adding a concrete evaluation signal for agentic AI systems as they move into higher-stakes workflows.
Issue 11 Original source (opens in new tab)May 11, 2026

SocialReasoning-Bench: Measuring whether AI agents act in users’ best interests