1. [Publications](/index.php/publications)
2. LLM Pruning and Distillation in Practice: The Minitron Approach
 
 # LLM Pruning and Distillation in Practice: The Minitron Approach

  ![Publication image](/sites/default/files/styles/wide/public/default_images/default.jpeg?itok=qUFsuJCP "Publication image")

 ## Authors



Sharath Turuvekere Sreenivas

[Saurav Muralidharan](/index.php/person/saurav-muralidharan)

Raviraj Joshi

Marcin Chochowski

Mostofa Patwary (NVIDIA)

Mohammad Shoeybi (NVIDIA)

Bryan Catanzaro (NVIDIA)

[Jan Kautz](/index.php/person/jan-kautz)

[Pavlo Molchanov](/index.php/person/pavlo-molchanov)

 

 

 ## Publication Date



Wednesday, August 21, 2024

 

 ## Research Area



[Artificial Intelligence and Machine Learning ](/index.php/research-area/machine-learning-artificial-intelligence)

 

 

 ## Uploaded Files



[LLM Pruning and Distillation in Practice: The Minitron Approach](https://d1qx31qr3h6wln.cloudfront.net/publications/minitron_tech_report_3_0.pdf "Open file in new window")2.26 MB