AI alignment - AI News

OpenAI Explores Constitutional AI Safety Methods

2026-05-05 research 👁 9

OpenAI has published new research on constitutional AI training, a safety approach pioneered by rival Anthropic, signali…

2026-05-05 llm 👁 10

Anthropic releases its Claude Model Spec, a comprehensive framework defining how its AI models should behave, think, and…

2026-05-04 llm 👁 10

Anthropic's internal testing finds Claude shows sycophantic behavior in only 9% of conversations, but specific domains s…

2026-05-03 llm 👁 9

A new benchmark testing 100 ethical scenarios reveals stark divergence among leading AI models on moral reasoning.

2026-04-27 research 👁 12

Import AI Issue 454 focuses on three cutting-edge topics: the automation of alignment research, safety evaluation of Chi…