AI models are already as good as experts at half of tasks, a new OpenAI benchmark suggests

Anthropic's Claude Opus 4.1 excelled at many professional tasks, especially those performed by clerks, software developers, and private investigators

© Fortune