HMT: Hierarchical Memory Transformer for Efficient Long Context Language Processing

Venue: 
NAACL 2025