Home
News
Members
Publications
NVIDIA Research
Light
Dark
Automatic
LongMamba: Enhancing Mamba's Long-Context Capabilities via Training-Free Receptive Field Enlargement
Zhifan Ye
,
Kejing Xia
,
Yonggan Fu
,
Xin Dong
,
Jihoon Hong
,
Xiangchi Yuan
,
Shizhe Diao
,
Jan Kautz
,
Pavlo Molchanov
,
Yingyan Celine Lin
April 2025
Cite
pdf
Type
Conference paper
Publication
International Conference on Learning Representations (ICLR)
Xin Dong
Shizhe Diao
Jan Kautz
Team Leader
Pavlo Molchanov
Related
Hymba: A Hybrid-head Architecture for Small Language Models
Cite
×