Claude 4 Opus Shatters Scientific Reasoning Benchmarks
Anthropic's Claude 4 Opus achieves record scores across multiple scientific reasoning benchmarks, outperforming GPT-4o a…
7 articles about 'Claude 4 Opus'
Anthropic's Claude 4 Opus achieves record scores across multiple scientific reasoning benchmarks, outperforming GPT-4o a…
Anthropic's Claude 4 Opus sets new state-of-the-art scores on GPQA and other graduate-level reasoning benchmarks, outpac…
Anthropic's Claude 4 Opus achieves unprecedented scores on graduate-level science benchmarks, surpassing GPT-4o and Gemi…
Anthropic's Claude 4 Opus scores 92.4% on SWE-bench, outperforming OpenAI's GPT-5 by 7 points in software engineering ta…
Anthropic's Claude 4 Opus achieves state-of-the-art results in complex legal reasoning tasks, outperforming GPT-4o and G…
Anthropic unveils Claude 4 Opus, its most powerful model yet, featuring a 500K token context window and major reasoning …
Anthropic reportedly begins internal testing of Claude 4 Opus this summer, signaling a major leap in AI model capabiliti…