DeepSeek Gains Vision — Then Deletes Its Own Paper
DeepSeek quietly published a technical paper revealing its first multimodal vision capabilities, only to remove the docu…
Latest articles in LLM News
DeepSeek quietly published a technical paper revealing its first multimodal vision capabilities, only to remove the docu…
AI safety experts warn that OpenAI's o3 reasoning models introduce unprecedented alignment challenges that existing safe…
AI2 releases OLMo 3, a fully open large language model designed to advance transparent AI research and rival proprietary…
NVIDIA releases a powerful open-source vision-language model achieving benchmark scores competitive with OpenAI's GPT-4o…
Hugging Face debuts an open leaderboard for evaluating agentic AI systems, bringing transparency to one of AI's fastest-…
Google releases Gemma 3, its latest family of open-source AI models designed to run directly on consumer hardware withou…
Mistral AI launches Codestral 2.0, its most powerful code generation model yet, targeting enterprise developers with imp…
Cohere opens fine-tuning API for its Command R Plus model, targeting enterprise RAG workloads with customizable large la…
Meta releases Llama 4 Maverick as a fully open-weight foundation model, intensifying the open-source AI race against clo…
Google expands Gemini API with native 2 million token context window, dwarfing competitors and redefining what developer…
Users report Claude frequently suggests stopping mid-task, raising questions about AI alignment, context window manageme…
Users report Claude AI frequently suggesting breaks, deferring tasks, and recommending pauses mid-conversation, sparking…