The 'Gay Jailbreak' Technique Exposed: New AI Safety Vulnerability Draws Industry Attention
In 2025, an LLM attack technique dubbed the 'Gay Jailbreak' has sparked widespread discussion. Attackers exploit AI mode…
Latest articles in LLM News
In 2025, an LLM attack technique dubbed the 'Gay Jailbreak' has sparked widespread discussion. Attackers exploit AI mode…
IBM has officially launched the Granite 4.1 series of large language models, spanning multiple parameter scales and rele…
A large language model bypass technique dubbed the 'Gay Jailbreak' has sparked heated debate in the AI community. The me…
The latest cybersecurity tests reveal that OpenAI's GPT-5.5 has reached a level comparable to the much-hyped Mythos Prev…
Amazon dives deep into the RLAIF technical approach, leveraging LLMs as judges to perform reinforced fine-tuning on its …
Google DeepMind has officially launched the Gemma 4 series of open-source models, touting them as the most capable open …
The UK AI Security Institute has completed its cybersecurity capability assessment of OpenAI's GPT-5.5, finding its vuln…
OpenAI recently attempted to inject a "nerdy" personality into ChatGPT, but the experiment backfired when the model deve…
Recently, a large number of users discovered that ChatGPT was frequently and inappropriately mentioning "goblins" in its…
OpenAI recently discovered a peculiar bug in ChatGPT where the model frequently and inappropriately referenced "goblins"…
IBM has officially released the Granite 4.1 series, in which the 8B dense model matches or even surpasses the performanc…
OpenAI founder Sam Altman has announced that GPT-5.5-Cyber will be rolled out to key cyber defenders in the coming days.…