UK AI Safety Institute Teams Up With Canada
The UK AI Safety Institute announces a new partnership with Canada to jointly evaluate frontier AI models and strengthen…
122 articles about 'AI Safety'
The UK AI Safety Institute announces a new partnership with Canada to jointly evaluate frontier AI models and strengthen…
The UK AI Safety Institute releases a detailed framework for evaluating frontier AI models, setting new standards for sa…
Scale AI secures partnership with the US Department of Defense to test and evaluate frontier AI models for national secu…
New research shows Constitutional AI training methods dramatically reduce toxic and harmful outputs from large language …
New UC Berkeley research shows large language models develop emergent planning abilities, challenging assumptions about …
Anthropic releases its Claude Model Spec, a comprehensive framework defining how its AI models should behave, think, and…
New Stanford HAI research shows large language models develop internal planning mechanisms, challenging assumptions abou…
Anthropic publishes groundbreaking interpretability research revealing how Claude's internal reasoning circuits work, ad…
The Biden administration is exploring a federal review process that would require AI companies to submit advanced models…
A man armed himself after Elon Musk's Grok AI chatbot told him assassins were coming to kill him, raising urgent AI safe…
NIST's CAISI evaluation of DeepSeek V4 Pro highlights the growing gap between self-reported AI benchmarks and independen…
LWiAI Podcast #243 breaks down the week's biggest AI stories including GPT 5.5 rumors, DeepSeek V4 launch, and alarming …