Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Publications
Our publications provide insight into some of our leading-edge research.
Filters
Search
Apply
Filters
Filters
Publication Year
2023
(8)
2022
(24)
2021
(31)
2020
(22)
2019
(27)
2018
(25)
2017
(23)
2016
(25)
2015
(20)
2014
(7)
2013
(2)
2012
(2)
2011
(4)
2009
(1)
2007
(1)
Facet Publication Year
Research Areas
Computer Architecture
(24)
Artificial Intelligence and Machine Learning
(6)
Resilience and Safety
(6)
Autonomous Vehicles
(3)
Circuits and VLSI Design
(2)
Programming Languages, Systems and Tools
(2)
Computer Graphics
(1)
High Performance Computing
(1)
Real-Time Rendering
(1)
Events
NeurIPS
(1)
24 results found
Computer Architecture
Clear all
2022
Computer Architecture
2022
HEAT: Hardware-Efficient Automatic Tensor Decomposition for Transformer Compression
Jiaqi Gu,
Ben Keller
,
Jean Kossaifi
, Anima Anandkumar,
Brucek Khailany
, David Z. Pan
NeurIPS
Spotlight Paper
LNS-Madam: Low-Precision Training in Logarithmic Number System Using Multiplicative Weight Update
Jiawei Zhao,
Steve Dai
,
Rangharajan Venkatesan
,
Brian Zimmer
, Mustafa Ali,
Ming-Yu Liu
,
Brucek Khailany
,
William Dally
, Anima Anandkumar
Towards Precision-Aware Fault Tolerance Approaches for Mixed-Precision Applications
Bo Fang,
Siva Hari
, Timothy Tsai, Xinyi Li, Ganesh Gopalakrishnan, Ignacio Laguna, Kevin Barker, Ang Li
The Implications of Page Size Management on Graph Analytics
Aninda Manocha,
Zi Yan
, Esin Tureci, Juan Luis Aragón,
David Nellans
, Margaret Martonosi
Demystifying Map Space Exploration for NPUs
Sheng-Chun Kao,
Angshuman Parashar
,
Po-An Tsai
, Tushar Krishna
Sparseloop: An Analytical Approach to Sparse Tensor Accelerator Modeling
Yannan Nellie Wu,
Po-An Tsai
,
Angshuman Parashar
, Vivienne Sze,
Joel Emer
Distinguished Artifact award
SEC-BADAEC: An Efficient ECC With No Vacancy for Strong Memory Protection
Yuseok Song, Sangjae Park,
Michael B. Sullivan
, Jungrae Kim
Self Adaptive Reconfigurable Arrays (SARA): Learning Flexible GEMM Accelerator Configuration and Mapping-space using ML
Ananda Samajdar, Eric Qin,
Michael Pellauer
, Tushar Krishna
Zhuyi: Perception Processing Rate Estimation for Safety in Autonomous Vehicles
Yu-Shun Hsiao,
Siva Hari
, Michał Filipiuk, Timothy Tsai,
Michael B. Sullivan
, Vijay Janapa Reddi, Vasu Singh,
Steve Keckler
Exploiting Temporal Data Diversity for Detecting Safety-critical Faults in AV Compute Systems
Saurabh Jha, Shengkun Cui, Timothy Tsai,
Siva Hari
,
Michael B. Sullivan
, Zbigniew T. Kalbarczyk,
Steve Keckler
, Ravishankar K. Iyer
Ruby: Improving Hardware Efficiency for Tensor Algebra Accelerators Through Imperfect Factorization
Mark Horeni, Pooria Taheri,
Po-An Tsai
,
Angshuman Parashar
,
Joel Emer
, Siddharth Joshi
Mixed-Proxy Extensions for the NVIDIA PTX Memory Consistency Model
Daniel Lustig
, Simon Cooksey, Olivier Giroux
IEEE Micro Top Picks in Computer Architecture (Honorable Mention)
SIMD^2: A Generalized Matrix Instruction Set for Accelerating Tensor Computation beyond GEMM
Yunan Zhang,
Po-An Tsai
, Hung-Wei Tseng
A Formalism of DNN Accelerator Flexibility
Sheng-Chun Kao, Hyoukjun Kwon,
Michael Pellauer
,
Angshuman Parashar
, Tushar Krishna
Learning A Continuous and Reconstructible Latent Space for Hardware Accelerator Design
Qijing Jenny Huang
, Charles Hong, John Wawrzynek, Mahesh Subedar, Yakun Sophia Shao
Zhuyi: Perception Processing Rate Estimation for Safety in Autonomous Vehicles
Yu-Shun Hsiao,
Siva Hari
, Michał Filipiuk, Timothy Tsai,
Michael B. Sullivan
, Vijay Janapa Reddi, Vasu Singh,
Steve Keckler
Saving PAM4 Bus Energy with SMOREs: Sparse Multi-level Opportunistic Restricted Encodings
Mike O'Connor
,
Donghyuk Lee
,
Niladrish Chatterjee
,
Michael B. Sullivan
,
Steve Keckler
Improving Locality of Irregular Updates with Hardware Assisted Propagation Blocking
Vignesh Balaji
, Brandon Lucia
Best Paper nominee
Characterizing and Mitigating Soft Errors in GPU DRAM
Michael B. Sullivan
, Nirmal R. Saxena,
Mike O'Connor
,
Donghyuk Lee
, Paul Racunas, Saurabh Hukerikar, Timothy Tsai,
Siva Kumar Sastry Hari
,
Stephen W. Keckler
DiGamma: Domain-aware Genetic Algorithm for HW-Mapping Co-optimization for DNN Accelerators
Sheng-Chun Kao,
Michael Pellauer
,
Angshuman Parashar
, Tushar Krishna
Marvel: A Data-Centric Approach for Mapping Deep Learning Operators on Spatial Accelerators
Prasanth Chatarasi, Hyoukjun Kwon,
Angshuman Parashar
,
Michael Pellauer
, Tushar Krishna, Vivek Sarkar
DAGguise: Mitigating Memory Timing Side Channels
Peter W. Deutsch, Yuheng Yang, Thomas Bourgeat, Jules Drean,
Joel Emer
, Mengjia Yan
Accelerators
Steve Keckler
, Dejan Milojicic
GPU Subwarp Interleaving
Sana Damani,
Mark Stephenson
, Ram Rangan, Daniel Johnson, Rishkul Kulkarni,
Steve Keckler