Home
News
Members
Publications
NVIDIA Research
Light
Dark
Automatic
Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning
Ali Taghibakhshi
,
Sharath Turuvekere Sreenivas
,
Saurav Muralidharan
,
Marcin Chochowski
,
Yashaswi Karnati
,
Raviraj Bhuminand Joshi
,
Ameya Sunil Mahabaleshwarkar
,
Zijia Chen
,
Yoshi Suhara
,
Oluwatobale Olabiyi
,
Daniel Korzekwa
,
Mostofa Patwary
,
Mohammad Shoeybi
,
Jan Kautz
,
Bryan Catanzaro
December 2025
Cite
arXiv
Type
Conference paper
Publication
Advances in Neural Information Processing Systems (NeurIPS)
Saurav Muralidharan
Jan Kautz
Team Leader
Related
Compact Language Models via Pruning and Knowledge Distillation
CLIMB: Clustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training
Hymba: A Hybrid-head Architecture for Small Language Models
RLP: Reinforcement Learning Pre-training
Cite
×