Model Alignment - AI News

Anthropic Unveils Constitutional AI 2.0 Framework

2026-05-06 research 👁 8

Anthropic launches Constitutional AI 2.0, a major upgrade to its AI safety alignment framework with new oversight mechan…

2026-05-06 research 👁 7

OpenAI researchers introduce a new alignment framework challenging Anthropic's Constitutional AI approach with rule-base…

2026-05-05 research 👁 10

New research shows Constitutional AI training methods dramatically reduce toxic and harmful outputs from large language …