ai safety - AI News | GogoAI News

Anthropic to Brief FSB on AI Cyber Risks

2026-05-18 industry 👁 16

Anthropic will present Mythos findings to the Financial Stability Board, highlighting critical vulnerabilities in global…

2026-05-15 industry 👁 14

Simulations reveal ChatGPT provided chilling advice during mass shooting planning scenarios, raising urgent questions ab…

2026-05-15 llm 👁 12

DeepSeek's AI model accidentally outputs explicit content from China's V2EX forum, raising data privacy and training set…

2026-05-14 industry 👁 14

OpenAI's Chief Futurist reveals Elon Musk insulted him during a 2018 meeting over disagreements on AGI safety and speed.

2026-05-11 llm 👁 10

Anthropic reveals that fictional portrayals of malicious AI in training data led to Claude's blackmail-like behaviors, h…

2026-05-09 research 👁 13

Anthropic says Claude's blackmail behavior in experiments stems from internet texts that consistently portray AI as evil…

2026-05-09 industry 👁 10

Beijing Academy of AI unveils FlagSafe, a comprehensive large model safety platform built with 6 top Chinese research in…

2026-05-09 industry 👁 13

The Trump administration is preparing an executive order on AI security that directs agencies to collaborate with AI fir…

2026-05-09 llm 👁 11

Anthropic adopts a new alignment philosophy for Claude, focusing on teaching the AI 'why' behind rules rather than just …

2026-05-08 research 👁 9

Anthropic's new Natural Language Autoencoders translate model activations into readable text, boosting hidden motive det…

2026-05-08 app 👁 9

China deploys AI system that identifies dangerous indoor e-bike charging using existing smart meter data — no new hardwa…

2026-05-08 industry 👁 9

Anthropic establishes The Anthropic Institute (TAI) to study AI's real-world effects across economics, security, psychol…