Home
News
Members
Publications
NVIDIA Research
Light
Dark
Automatic
LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models
Dachuan Shi
,
Yonggan Fu
,
Xiangchi Yuan
,
Zhongzhi Yu
,
Haoran You
,
Sixu Li
,
Xin Dong
,
Jan Kautz
,
Pavlo Molchanov
,
Yingyan Celine Lin
July 2025
Cite
arXiv
Type
Conference paper
Publication
International Conference on Machine Learning (ICML)
Xin Dong
Jan Kautz
Team Leader
Pavlo Molchanov
Related
CLIMB: Clustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training
Fast-SLM: Towards Latency-Optimal Hybrid Small Language Models
Hymba: A Hybrid-head Architecture for Small Language Models
LongMamba: Enhancing Mamba's Long-Context Capabilities via Training-Free Receptive Field Enlargement
Cite
×