🏷️ transformer-efficiency

1 articles about 'transformer-efficiency'

Google Proposes Mixture-of-Depths to Cut Transformer Costs

2026-05-05 research 👁 8

Google DeepMind researchers introduce Mixture-of-Depths architecture that dynamically allocates compute per token, cutti…