MIT Cracks Energy-Efficient Transformer Design
MIT researchers unveil a new transformer architecture that cuts energy consumption by up to 70% while maintaining compet…
1 articles about 'adaptive sparse attention'
MIT researchers unveil a new transformer architecture that cuts energy consumption by up to 70% while maintaining compet…