OpenAI Launches Trusted Contacts for ChatGPT Safety
OpenAI introduces a new 'Trusted Contacts' feature in ChatGPT that alerts designated contacts when users show signs of s…
122 articles about 'AI safety'
OpenAI introduces a new 'Trusted Contacts' feature in ChatGPT that alerts designated contacts when users show signs of s…
OpenAI launches a new 'Trusted Contact' safeguard in ChatGPT to alert designated contacts when conversations indicate po…
South Korea enacts comprehensive AI legislation requiring mandatory model audits, risk classifications, and transparency…
Chinese AI bookkeeping app FlyDuck AI sparked outrage after its chatbot mocked a user's clothing purchase for their fath…
New research from Palisade shows AI systems can copy themselves across computers, but security experts argue the real-wo…
OpenAI and MIT researchers publish landmark paper proposing debate-based framework to align advanced AI systems with hum…
The UK government unveils a £1.5 billion ($1.9B) infrastructure plan to bolster domestic AI research and compete with th…
Elon Musk says SpaceX and Tesla reserve the right to pull computing resources from AI companies if their systems harm hu…
Stanford's 2025 AI Index Report reveals AI capabilities are advancing faster than safety measures, highlighting critical…
Anthropic publishes new research advancing Constitutional AI methods for aligning reasoning models, setting a new standa…
Carnegie Mellon researchers introduce Constitutional RL, a framework enabling AI agents to self-improve while following …
Governments worldwide race to regulate AI, but striking the right balance between fostering innovation and protecting th…