AutoCompress: Efficient Transformer Compression Through Critical Layer Isolation
A research team has proposed the AutoCompress method, discovering that Layer 0 in small Transformers carries over 60 tim…
1 articles about 'Neural Tangent Kernel'
A research team has proposed the AutoCompress method, discovering that Layer 0 in small Transformers carries over 60 tim…