SubQ: Sub-Quadratic LLM Handles 12M-Token Context
SubQ introduces a sub-quadratic architecture enabling LLMs to process up to 12 million tokens, shattering previous conte…
1 articles about 'sub-quadratic attention'
SubQ introduces a sub-quadratic architecture enabling LLMs to process up to 12 million tokens, shattering previous conte…