Home
News
Members
Publications
NVIDIA Research
Light
Dark
Automatic
Compact Language Models via Pruning and Knowledge Distillation
Saurav Muralidharan
,
Sharath Turuvekere Sreenivas
,
Raviraj Joshi
,
Marcin Chochowski
,
Mostofa Patwary
,
Mohammad Shoeybi
,
Bryan Catanzaro
,
Jan Kautz
,
Pavlo Molchanov
December 2024
Cite
arXiv
Type
Conference paper
Publication
Advances in Neural Information Processing Systems (NeurIPS)
Saurav Muralidharan
Jan Kautz
Team Leader
Pavlo Molchanov
Related
Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning
RLP: Reinforcement Learning Pre-training
Cite
×