Memory Optimization - AI News

δ-mem: Cutting LLM Memory Costs by 90%

2026-05-17 research 👁 12

New δ-mem framework slashes GPU memory usage for LLMs by 90%, enabling efficient online inference on consumer hardware.

2026-05-07 research 👁 7

MIT researchers introduce a sparse attention mechanism that slashes Transformer memory usage by 80% while preserving mod…

2026-05-07 research 👁 9

UC Berkeley researchers unveil a novel attention mechanism that dramatically reduces memory consumption in Transformer m…

2026-05-05 research 👁 8

Seoul National University team develops novel memory optimization techniques enabling large AI model training on consume…