NVIDIA Star Elastic: One Checkpoint, Three Models
NVIDIA releases Star Elastic, a post-training method embedding 30B, 23B, and 12B reasoning models in a single checkpoint…
9 articles about 'reasoning models'
NVIDIA releases Star Elastic, a post-training method embedding 30B, 23B, and 12B reasoning models in a single checkpoint…
Anthropic publishes new research advancing Constitutional AI methods for aligning reasoning models, setting a new standa…
DeepSeek R1's benchmark results challenge assumptions about the gap between open-source and proprietary AI models, spark…
Hugging Face releases open-weight reasoning models that match proprietary systems from OpenAI and Google on key benchmar…
AI safety experts warn that OpenAI's o3 reasoning models introduce unprecedented alignment challenges that existing safe…
Major AI labs including Meta, Alibaba, and Mistral are accelerating efforts to build open-source reasoning models that r…
DeepSeek releases its R2 reasoning model under the Apache 2.0 license, sparking fierce debate over open-source AI's impa…
The context window race is effectively over. The real competition now shifts to reasoning depth, efficiency, and archite…
DeepSeek's open-source R2 model matches or exceeds GPT-5 on key reasoning tasks, shaking up the AI competitive landscape…