Home
News
Members
Publications
NVIDIA Research
Light
Dark
Automatic
LlamaFlex: Many-in-One LLMs via Generalized Pruning and Weight Sharing
Ruisi Cai
,
Saurav Muralidharan
,
Hongxu (Danny) Yin
,
Zhangyang Wang
,
Jan Kautz
,
Pavlo Molchanov
April 2025
Cite
pdf
Type
Conference paper
Publication
International Conference on Learning Representations (ICLR)
Saurav Muralidharan
Hongxu (Danny) Yin
Jan Kautz
Team Leader
Pavlo Molchanov
Related
Flextron: Many-in-One Flexible Large Language Model
MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models
Cite
×