mathematical reasoning - AI News

Gemini Ultra 2 Matches Humans on Grad Math Exams

2026-05-07 llm 👁 10

Google DeepMind's Gemini Ultra 2 achieves human-level scores on graduate-level mathematics exams, marking a major milest…

2026-05-06 llm 👁 10

Elon Musk's xAI releases Grok 3.5, which outperforms OpenAI's GPT-5 across major mathematical reasoning benchmarks.

2026-05-05 llm 👁 11

Anthropic's Claude 4 sets new records on major mathematical reasoning benchmarks, outperforming GPT-4o and Gemini Ultra.

2026-05-05 research 👁 9

Google DeepMind's AlphaProof system scores 28 out of 42 points at the 2024 International Mathematical Olympiad, narrowly…

2026-05-04 research 👁 9

MathNet introduces 30,000 competition-level math problems to rigorously test AI mathematical reasoning, raising the bar …

2026-04-27 research 👁 13

A latest arXiv paper proposes the 'Math Takes Two' testing framework, which examines whether language models possess gen…