Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Artificial Intelligence Computing Leadership from NVIDIA
Login
Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Search
Search
Enter the terms you wish to search for.
Publications
Our publications provide insight into some of our leading-edge research.
Filters
Search
Apply
Filters
Filters
Publication Year
2023
(8)
2022
(24)
2021
(31)
2020
(22)
2019
(27)
2018
(25)
2017
(23)
2016
(25)
2015
(20)
2014
(7)
2013
(2)
2012
(2)
2011
(4)
2009
(1)
2007
(1)
Facet Publication Year
Research Areas
Computer Architecture
(222)
Artificial Intelligence and Machine Learning
(40)
Resilience and Safety
(34)
High Performance Computing
(30)
Programming Languages, Systems and Tools
(21)
Circuits and VLSI Design
(16)
Networking
(5)
Autonomous Vehicles
(4)
Computer Graphics
(3)
Real-Time Rendering
(2)
Robotics
(2)
Computer Vision
(1)
Generative AI
(1)
Events
IROS
(1)
NeurIPS
(1)
PLDI
(1)
222 results found
Computer Architecture
Clear all
Computer Architecture
2023
Unity ECC: Unified Memory Protection Against Bit and Chip Errors
Dongwhee Kim, Jaeyoon Lee, Wonyeong Jung,
Michael B. Sullivan
, Jungrae Kim
VaPr: Variable-Precision Tensors to Accelerate Robot Motion Planning
Yu-Shun Hsiao,
Siva Hari
,
Balakumar Sundaralingam
, Jason Yik, Thierry Tambe,
Charbel Sakr
,
Steve Keckler
, Vijay Janapa Reddi
IROS
Efficient Transformer Inference with Statically Structured Sparse Attention
Steve Dai
, Hasan Genc,
Rangharajan Venkatesan
,
Brucek Khailany
Implicit Memory Tagging: No-Overhead Memory Safety Using Alias-Free Tagged ECC
Michael B. Sullivan
,
Mohamed Tarek Ibn Ziad
,
Aamer Jaleel
,
Stephen W. Keckler
cuCatch: A Debugging Tool for Efficiently Catching Memory Safety Violations in CUDA Applications
Mohamed Tarek Ibn Ziad
,
Sana Damani
,
Aamer Jaleel
,
Stephen W. Keckler
,
Mark Stephenson
PLDI
CuRobo: Parallelized Collision-Free Robot Motion Generation
Balakumar Sundaralingam
,
Siva Hari
, Adam Fishman,
Caelan Garrett
, Karl Van Wyk,
Valts Blukis
, Alexander Millane, Helen Oleynikova, Ankur Handa,
Fabio Ramos
, Nathan Ratliff,
Dieter Fox
Parsimony: Enabling SIMD/Vector Programming in Standard Compiler Flows
Vijay Kandiah,
Daniel Lustig
,
Oreste Villa
,
David Nellans
, Nikos Hardavellas
A 95.6-TOPS/W Deep Learning Inference Accelerator With Per-Vector Scaled 4-bit Quantization in 5 nm
Ben Keller
,
Rangharajan Venkatesan
,
Steve Dai
,
Stephen Tell
,
Brian Zimmer
,
Charbel Sakr
,
William Dally
,
Tom Gray
,
Brucek Khailany
2022
HEAT: Hardware-Efficient Automatic Tensor Decomposition for Transformer Compression
Jiaqi Gu,
Ben Keller
,
Jean Kossaifi
, Anima Anandkumar,
Brucek Khailany
, David Z. Pan
NeurIPS
Spotlight Paper
LNS-Madam: Low-Precision Training in Logarithmic Number System Using Multiplicative Weight Update
Jiawei Zhao,
Steve Dai
,
Rangharajan Venkatesan
,
Brian Zimmer
, Mustafa Ali,
Ming-Yu Liu
,
Brucek Khailany
,
William Dally
, Anima Anandkumar
Towards Precision-Aware Fault Tolerance Approaches for Mixed-Precision Applications
Bo Fang,
Siva Hari
, Timothy Tsai, Xinyi Li, Ganesh Gopalakrishnan, Ignacio Laguna, Kevin Barker, Ang Li
The Implications of Page Size Management on Graph Analytics
Aninda Manocha,
Zi Yan
, Esin Tureci, Juan Luis Aragón,
David Nellans
, Margaret Martonosi
Demystifying Map Space Exploration for NPUs
Sheng-Chun Kao,
Angshuman Parashar
,
Po-An Tsai
, Tushar Krishna
Sparseloop: An Analytical Approach to Sparse Tensor Accelerator Modeling
Yannan Nellie Wu,
Po-An Tsai
,
Angshuman Parashar
, Vivienne Sze,
Joel Emer
Distinguished Artifact award
SEC-BADAEC: An Efficient ECC With No Vacancy for Strong Memory Protection
Yuseok Song, Sangjae Park,
Michael B. Sullivan
, Jungrae Kim
Self Adaptive Reconfigurable Arrays (SARA): Learning Flexible GEMM Accelerator Configuration and Mapping-space using ML
Ananda Samajdar, Eric Qin,
Michael Pellauer
, Tushar Krishna
Zhuyi: Perception Processing Rate Estimation for Safety in Autonomous Vehicles
Yu-Shun Hsiao,
Siva Hari
, Michał Filipiuk, Timothy Tsai,
Michael B. Sullivan
, Vijay Janapa Reddi, Vasu Singh,
Steve Keckler
Exploiting Temporal Data Diversity for Detecting Safety-critical Faults in AV Compute Systems
Saurabh Jha, Shengkun Cui, Timothy Tsai,
Siva Hari
,
Michael B. Sullivan
, Zbigniew T. Kalbarczyk,
Steve Keckler
, Ravishankar K. Iyer
Ruby: Improving Hardware Efficiency for Tensor Algebra Accelerators Through Imperfect Factorization
Mark Horeni, Pooria Taheri,
Po-An Tsai
,
Angshuman Parashar
,
Joel Emer
, Siddharth Joshi
Mixed-Proxy Extensions for the NVIDIA PTX Memory Consistency Model
Daniel Lustig
, Simon Cooksey, Olivier Giroux
IEEE Micro Top Picks in Computer Architecture (Honorable Mention)
SIMD^2: A Generalized Matrix Instruction Set for Accelerating Tensor Computation beyond GEMM
Yunan Zhang,
Po-An Tsai
, Hung-Wei Tseng
A Formalism of DNN Accelerator Flexibility
Sheng-Chun Kao, Hyoukjun Kwon,
Michael Pellauer
,
Angshuman Parashar
, Tushar Krishna
Learning A Continuous and Reconstructible Latent Space for Hardware Accelerator Design
Qijing Jenny Huang
, Charles Hong, John Wawrzynek, Mahesh Subedar, Yakun Sophia Shao
Zhuyi: Perception Processing Rate Estimation for Safety in Autonomous Vehicles
Yu-Shun Hsiao,
Siva Hari
, Michał Filipiuk, Timothy Tsai,
Michael B. Sullivan
, Vijay Janapa Reddi, Vasu Singh,
Steve Keckler
Saving PAM4 Bus Energy with SMOREs: Sparse Multi-level Opportunistic Restricted Encodings
Mike O'Connor
,
Donghyuk Lee
,
Niladrish Chatterjee
,
Michael B. Sullivan
,
Steve Keckler
Improving Locality of Irregular Updates with Hardware Assisted Propagation Blocking
Vignesh Balaji
, Brandon Lucia
Best Paper nominee
Characterizing and Mitigating Soft Errors in GPU DRAM
Michael B. Sullivan
, Nirmal R. Saxena,
Mike O'Connor
,
Donghyuk Lee
, Paul Racunas, Saurabh Hukerikar, Timothy Tsai,
Siva Kumar Sastry Hari
,
Stephen W. Keckler
DiGamma: Domain-aware Genetic Algorithm for HW-Mapping Co-optimization for DNN Accelerators
Sheng-Chun Kao,
Michael Pellauer
,
Angshuman Parashar
, Tushar Krishna
Marvel: A Data-Centric Approach for Mapping Deep Learning Operators on Spatial Accelerators
Prasanth Chatarasi, Hyoukjun Kwon,
Angshuman Parashar
,
Michael Pellauer
, Tushar Krishna, Vivek Sarkar
DAGguise: Mitigating Memory Timing Side Channels
Peter W. Deutsch, Yuheng Yang, Thomas Bourgeat, Jules Drean,
Joel Emer
, Mengjia Yan
Accelerators
Steve Keckler
, Dejan Milojicic
GPU Subwarp Interleaving
Sana Damani,
Mark Stephenson
, Ram Rangan, Daniel Johnson, Rishkul Kulkarni,
Steve Keckler
Pagination
Current page
1
Page
2
Page
3
Page
4
Page
5
Page
6
Page
7
Next page
Next ›
Last page
Last »