🏷️ sparse models

2 articles about 'sparse models'

Microsoft MoE Architecture Slashes Inference Costs 70%

2026-05-07 research 👁 11

Microsoft Research unveils a sparse Mixture-of-Experts architecture that reduces AI inference costs by 70% while maintai…

2026-05-05 research 👁 10

A new study reveals Mixture-of-Experts models activate only a fraction of parameters during inference, slashing compute …