Gemma 3 Sets New Bar for Open Weight AI Models
Google DeepMind releases Gemma 3, delivering frontier-class performance in an open weight model that runs on a single GP…
90 articles about 'Large Language Models'
Google DeepMind releases Gemma 3, delivering frontier-class performance in an open weight model that runs on a single GP…
OpenAI's o3-mini reasoning model achieves gold medal-level performance on International Math Olympiad problems, signalin…
OpenAI unveils GPT-5 Turbo featuring native multimodal reasoning across text, image, audio, and video inputs.
Google DeepMind unveils Gemini 2.5 Ultra, its most powerful AI model yet, featuring a 1-million-token context window and…
New research and theoretical analysis suggest hallucination is a fundamental, mathematical limitation of large language …
DeepSeek's latest V4 model closes the gap with top Western AI systems, raising questions about the future of the AI race…
The art and science of communicating with transformer-based AI models is reshaping how developers and users interact wit…
DeepSeek's latest V4 model series arrives 15 months after R1, but benchmark comparisons suggest it still trails top US m…
Large language models are capable of far more than chat-based Q&A. This article outlines seven unconventional LLM applic…
A new arXiv paper explores whether fundamental reasoning modes such as deduction, induction, and abduction can be decoup…
A new study reveals that the geometric relationships of semantic features in the hidden states of large language models …
A new study proposes a 'human-in-the-loop' benchmarking framework that systematically evaluates the performance of multi…