  Niladrish Chatterjee  

 



  ![](/sites/default/files/person/download.png)

  

 Niladrish Chatterjee is a Senior Research Scientist in the Architecture Research Group. His research focuses on realizing energy-efficient, high-performance memory and processor architectures that will power future supercomputers and artificially intelligent machines.



He received a PhD in Computer Engineering from the University of Utah in 2013, and a B.E. in Computer Science from Jadavpur University in 2007.



   Research Area(s)

[High Performance Computing](/index.php/research-area/high-performance-computing)

[Artificial Intelligence and Machine Learning ](/index.php/research-area/machine-learning-artificial-intelligence)

 

 

  

 Main Field of Interest

[Computer Architecture](/index.php/research-area/computer-architecture)

 

  

 Google Scholar

[https://scholar.google.com/citations?user=NuCyyPgAAAAJ&amp;hl=en](https://scholar.google.com/citations?user=NuCyyPgAAAAJ&hl=en)

 

  

 

 

 



 ### Publications

 

### 2022 

[Saving PAM4 Bus Energy with SMOREs: Sparse Multi-level Opportunistic Restricted Encodings](/index.php/publication/2022-04_saving-pam4-bus-energy-smores-sparse-multi-level-opportunistic-restricted)

[Mike O'Connor](/index.php/person/mike-o-connor), [Donghyuk Lee](/index.php/person/donghyuk-lee), [Niladrish Chatterjee](/index.php/person/niladrish-chatterjee), [Michael B. Sullivan](/index.php/person/mike-sullivan), [Steve Keckler](/index.php/person/stephen-keckler)



[International Symposium on High-Performance Computer Architecture (HPCA)](https://ieeexplore.ieee.org/document/9773229)









### 2021 

[GPU Domain Specialization via Composable On-Package Architecture](/publication/2021-12_gpu-domain-specialization-composable-package-architecture)

[Yaosheng Fu](/person/yaosheng-fu), Evgeny Bolotin, [Niladrish Chatterjee](/person/niladrish-chatterjee), [David Nellans](/person/david-nellans), [Steve Keckler](/person/stephen-keckler)



[ACM Transactions on Architecture and Code Optimization (TACO)](https://dl.acm.org/doi/full/10.1145/3484505)









[GPU Domain Specialization via Composable On-Package Architecture](/index.php/publication/2021-04_gpu-domain-specialization-composable-package-architecture)

[Yaosheng Fu](/index.php/person/yaosheng-fu), Evgeny Bolotin, [Niladrish Chatterjee](/index.php/person/niladrish-chatterjee), [David Nellans](/index.php/person/david-nellans), [Steve Keckler](/index.php/person/stephen-keckler)



[arXiv](https://arxiv.org/abs/2104.02188)









[Learning Sparse Matrix Row Permutations for Efficient SpMM on GPU Architectures](/index.php/publication/2021-03_learning-sparse-matrix-row-permutations-efficient-spmm-gpu-architectures)

Atefeh Mehrabi, [Donghyuk Lee](/index.php/person/donghyuk-lee), [Niladrish Chatterjee](/index.php/person/niladrish-chatterjee), Danial J. Sorin, Benjamin C. Lee, [Mike O'Connor](/index.php/person/mike-o-connor)



[International Symposium on Performance Analysis of Systems and Software (ISPASS)](https://ieeexplore.ieee.org/document/9408181)









[Need for Speed: Experiences Building a Trustworthy System-Level GPU Simulator.](/publication/2021-02_need-speed-experiences-building-trustworthy-system-level-gpu-simulator)

Oreste Villa, [Daniel Lustig](/person/daniel-lustig), [Zi Yan](/person/zi-yan), Evgeny Bolotin, [Yaosheng Fu](/person/yaosheng-fu), [Niladrish Chatterjee](/person/niladrish-chatterjee), [Ted Jiang](/person/ted-jiang), [David Nellans](/person/david-nellans)



[International Symposium on High Performance Computer Architecture (HPCA)](https://doi.org/10.1109/HPCA51647.2021.00077)









### 2019 

[Near-Memory Data Transformation for Efficient Sparse Matrix Multi-Vector Multiplication](/index.php/publication/2019-11_near-memory-data-transformation-efficient-sparse-matrix-multi-vector)

Daichi Fujiki, [Niladrish Chatterjee](/index.php/person/niladrish-chatterjee), [Donghyuk Lee](/index.php/person/donghyuk-lee), [Mike O'Connor](/index.php/person/mike-o-connor)



[International Conference for High-Performance Computing, Networking, Storage, a…](https://dl.acm.org/doi/10.1145/3295500.3356154)









[DeLTA: GPU Performance Model for Deep Learning Applications with In-depth Memory System Traffic Analysis](/index.php/publication/2019-04_delta-gpu-performance-model-deep-learning-applications-depth-memory-system)

Sangkug Lym, [Donghyuk Lee](/index.php/person/donghyuk-lee), [Mike O'Connor](/index.php/person/mike-o-connor), [Niladrish Chatterjee](/index.php/person/niladrish-chatterjee), Mattan Erez



[arXiv](https://arxiv.org/abs/1904.01691)









[DeLTA: GPU Performance Model for Deep Learning Applications with In-depth Memory System Traffic Analysis](/index.php/publication/2019-03_delta-gpu-performance-model-deep-learning-applications-depth-memory-system)

Sankug Lym, [Donghyuk Lee](/index.php/person/donghyuk-lee), [Niladrish Chatterjee](/index.php/person/niladrish-chatterjee), [Mike O'Connor](/index.php/person/mike-o-connor), Mattan Erez



[International Symposium on Performance Analysis of Systems and Software (ISPASS)](https://ieeexplore.ieee.org/document/8695646)









### 2018 

[What Your DRAM Power Models Are Not Telling You: Lessons from a Detailed Experimental Study](/publication/2018-07_what-your-dram-power-models-are-not-telling-you-lessons-detailed-experimental)

Saugata Ghose, Abdullah Giray Yağlıkçı, Raghav Gupta, [Donghyuk Lee](/person/donghyuk-lee), Kais Kudrolli, William X. Liu, Hasan Hassan, Kevin K. Chang, [Niladrish Chatterjee](/person/niladrish-chatterjee), Aditya Agrawal, [Mike O'Connor](/person/mike-o-connor), Onur Mutlu



[arXiv](https://arxiv.org/abs/1807.05102)









[What Your DRAM Power Models Aren’t Telling You: Lessons from a Detailed Experimental Study](/index.php/publication/2018-06_what-your-dram-power-models-aren-t-telling-you-lessons-detailed-experimental)

Saugata Ghose, Abdullah Giray Yağlıkçı, Raghav Gupta, [Donghyuk Lee](/index.php/person/donghyuk-lee), Kais Kudrolli, William X. Liu, Hasan Hassan, Kevin Chang, [Niladrish Chatterjee](/index.php/person/niladrish-chatterjee), Aditya Agrawal, [Mike O'Connor](/index.php/person/mike-o-connor), Onur Mutlu



[ACM International Conference on Measurement and Analysis of Computer Systems (S…](https://dl.acm.org/doi/abs/10.1145/3224419)









[Voltron: Understanding and Exploiting the Voltage-Latency-Reliability Trade-Offs in Modern DRAM Chips to Improve Energy Efficiency](/index.php/publication/2018-05_voltron-understanding-and-exploiting-voltage-latency-reliability-trade-offs)

Kevin K. Chang, Abdullah Giray Yağlıkçı, Saugata Ghose, Aditya Agrawal, [Niladrish Chatterjee](/index.php/person/niladrish-chatterjee), Abhijith Kashyap, [Donghyuk Lee](/index.php/person/donghyuk-lee), [Mike O'Connor](/index.php/person/mike-o-connor), Hasan Hassan, Onur Mutlu



[arXiv](https://arxiv.org/abs/1805.03175)









[Reducing Data Transfer Energy by Exploiting Similarity within a Data Transaction](/index.php/publication/2018-02_reducing-data-transfer-energy-exploiting-similarity-within-data-transaction)

[Donghyuk Lee](/index.php/person/donghyuk-lee), [Mike O'Connor](/index.php/person/mike-o-connor), [Niladrish Chatterjee](/index.php/person/niladrish-chatterjee)



[International Symposium on High Performance Computer Architecture (HPCA)](https://ieeexplore.ieee.org/document/8326997)



Best Paper nominee





[Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks](/publication/2018-02_compressing-dma-engine-leveraging-activation-sparsity-training-deep-neural)

Minsoo Rhu, [Mike O'Connor](/person/mike-o-connor), [Niladrish Chatterjee](/person/niladrish-chatterjee), Jeff Pool, Youngeun Kwon, [Steve Keckler](/person/stephen-keckler)



[International Symposium on High Performance Computer Architecture (HPCA)](https://ieeexplore.ieee.org/document/8327000)









### 2017 

[Toward Standardized Near-Data Processing with Unrestricted Data Placement for GPUs](/index.php/publication/2017-11_toward-standardized-near-data-processing-unrestricted-data-placement-gpus)

Gwangsun Kim, [Niladrish Chatterjee](/index.php/person/niladrish-chatterjee), [Mike O'Connor](/index.php/person/mike-o-connor), Kevin Hsieh



[International Conference for High-Performance Computing, Networking, Storage, a…](https://dl.acm.org/citation.cfm?id=3126965)









[Fine-Grained DRAM: Energy-Efficient DRAM for Extreme Bandwidth Systems](/publication/2017-10_fine-grained-dram-energy-efficient-dram-extreme-bandwidth-systems)

[Mike O'Connor](/person/mike-o-connor), [Niladrish Chatterjee](/person/niladrish-chatterjee), [Donghyuk Lee](/person/donghyuk-lee), [John Wilson](/person/john-wilson), Aditya Agrawal, [Steve Keckler](/person/stephen-keckler), [William Dally](/person/william-dally)



[International Symposium on Microarchitecture (MICRO)](https://dl.acm.org/citation.cfm?id=3124545)









[Understanding Reduced-Voltage Operation in Modern DRAM Devices: Experimental Characterization, Analysis, and Mechanisms](/index.php/publication/2017-06_understanding-reduced-voltage-operation-modern-dram-devices-experimental)

Kevin Chang, Abdullah Giray Yağlıkçı, Saugata Ghose, Aditya Agrawal, [Niladrish Chatterjee](/index.php/person/niladrish-chatterjee), Abhijith Kashyap, [Donghyuk Lee](/index.php/person/donghyuk-lee), [Mike O'Connor](/index.php/person/mike-o-connor), Hasan Hassan, Onur Mutlu



[ACM Conference on Measurement and Analysis of Computer Systems (SIGMETRICS 2017)](http://dl.acm.org/citation.cfm?id=3078590)









[Understanding Reduced-Voltage Operation in Modern DRAM Chips: Characterization, Analysis, and Mechanisms](/publication/2017-05_understanding-reduced-voltage-operation-modern-dram-chips-characterization)

Kevin K. Chang, Abdullah Giray Yağlıkçı, Saugata Ghose, Aditya Agrawal, [Niladrish Chatterjee](/person/niladrish-chatterjee), Abhijith Kashyap, [Donghyuk Lee](/person/donghyuk-lee), [Mike O'Connor](/person/mike-o-connor), Hasan Hassan, Onur Mutlu



[arXiv](https://arxiv.org/abs/1705.10292)









[Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks](/publication/2017-05_compressing-dma-engine-leveraging-activation-sparsity-training-deep-neural)

Minsoo Rhu, [Mike O'Connor](/person/mike-o-connor), [Niladrish Chatterjee](/person/niladrish-chatterjee), Jeff Pool, [Stephen W. Keckler](/person/stephen-keckler)



[arXiv](https://arxiv.org/abs/1705.01626)









[Architecting an Energy-Efficient DRAM System for GPUs](/index.php/publication/2017-02_architecting-energy-efficient-dram-system-gpus)

[Niladrish Chatterjee](/index.php/person/niladrish-chatterjee), [Mike O'Connor](/index.php/person/mike-o-connor), [Donghyuk Lee](/index.php/person/donghyuk-lee), Daniel Johnson, Minsoo Rhu, [Steve Keckler](/index.php/person/stephen-keckler), [William Dally](/index.php/person/william-dally)



[International Symposium on High Performance Computer Architecture (HPCA)](http://ieeexplore.ieee.org/document/7920815/)









### 2016 

[CLARA: Circular Linked-List Auto- and Self-Refresh Architecture](/index.php/publication/2016-10_clara-circular-linked-list-auto-and-self-refresh-architecture)

Aditya Agrawal, [Mike O'Connor](/index.php/person/mike-o-connor), Evgeny Bolotin, [Niladrish Chatterjee](/index.php/person/niladrish-chatterjee), [Joel Emer](/index.php/person/joel-emer), [Steve Keckler](/index.php/person/stephen-keckler)



[International Symposium on Memory Systems (MEMSYS'16)](https://dl.acm.org/doi/10.1145/2989081.2989084)









[Transparent Offloading and Mapping (TOM): Enabling Programmer-Transparent Near-Data Processing in GPU Systems](/index.php/publication/2016-06_transparent-offloading-and-mapping-tom-enabling-programmer-transparent-near)

Kevin Hsieh, Eiman Ebrahimi, Gwangsun Kim, [Niladrish Chatterjee](/index.php/person/niladrish-chatterjee), [Mike O'Connor](/index.php/person/mike-o-connor), Nandita Vijaykumar, Onur Mutlu, [Steve Keckler](/index.php/person/stephen-keckler)



[International Symposium on Computer Architecture (ISCA)](http://ieeexplore.ieee.org/document/7551394/)









### 2015 

[Anatomy of GPU Memory System for Multi-Application Execution](/index.php/publication/2015-10_anatomy-gpu-memory-system-multi-application-execution)

Adwait Jog, Onur Kayiran, Tuba Kesten, Ashutosh Pattnaik, Evgeny Bolotin, [Niladrish Chatterjee](/index.php/person/niladrish-chatterjee), [Steve Keckler](/index.php/person/stephen-keckler), Mahmut T. Kandemir, Chita R. Das



[International Symposium on Memory Systems (MEMSYS)](http://dl.acm.org/citation.cfm?id=2818979)