AI, Vol. 6, Pages 85: CacheFormer: High-Attention-Based Segment Caching

Greenberg April 18, 2025 in News - 1 Minute

AI, Vol. 6, Pages 85: CacheFormer: High-Attention-Based Segment Caching

Authors:
Sushant Singh
Ausif Mahmood

Efficiently handling long contexts in transformer-based language models with low perplexity is an active area of research. Numerous recent approaches like Linformer, Longformer, Performer, and Structured state space models (SSMs), have not fully resolved this problem. All these models strive to reduce the quadratic time complexity of the attention mechanism while minimizing the loss in quality due to the effective compression of the long context. Inspired by the cache and virtual memory principle in computers, where in case of a cache miss, not only the needed data are retrieved from the memory, but the adjacent data are also obtained, we apply this concept to handling long contexts by dividing it into small segments. In our design, we retrieve the nearby segments in an uncompressed form when high segment-level attention occurs at the compressed level. Our enhancements for handling long context include aggregating four attention mechanisms consisting of short sliding window attention, long compressed segmented attention, dynamically retrieving top-k high-attention uncompressed segments, and overlapping segments in long segment attention to avoid segment fragmentation. These enhancements result in an architecture that outperforms existing SOTA architectures with an average perplexity improvement of 8.5% over similar model sizes.

Source link

Sushant Singh www.mdpi.com

Greenberg

Learn More →

Related Posts

Microorganisms, Vol. 13, Pages 1811: Unveiling the Genetic Landscape of Canine Papillomavirus in the Brazilian Amazon

CIMB, Vol. 47, Pages 611: Xylitol Antioxidant Properties: A Potential Effect for Inflammation Reduction in Menopausal Women?&mdash;A Pilot Study

JCM, Vol. 14, Pages 5450: Leadless Pacemaker Implantation During Extraction in Patients with Active Infection: A Comprehensive Analysis of Safety, Patient Benefits and Costs

Greenberg

CIMB, Vol. 47, Pages 611: Xylitol Antioxidant Properties: A Potential Effect for Inflammation Reduction in Menopausal Women?—A Pilot Study