efficient ai - AI News

ZAYA1-8B Matches DeepSeek-R1 on Math With Just 760M Active Params

2026-05-07 llm 👁 9

A new 8B MoE model called ZAYA1-8B achieves DeepSeek-R1-level math performance while activating only 760M of its 8B para…

2026-05-07 research 👁 8

UC Berkeley researchers unveil a new Transformer architecture that cuts compute costs by up to 60% while maintaining ben…

2026-05-07 research 👁 9

Vietnam's VinAI Research publishes cutting-edge work on making Vision Transformers faster and lighter for real-world dep…

2026-05-07 research 👁 9

UC Berkeley researchers unveil a novel attention mechanism that dramatically reduces memory consumption in Transformer m…

2026-05-06 llm 👁 10

Microsoft Research releases Phi-5, a small language model that rivals GPT-4 performance while running on consumer hardwa…