Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Artificial Intelligence Computing Leadership from NVIDIA
Login
Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Search
Search
Enter the terms you wish to search for.
Publications
Our publications provide insight into some of our leading-edge research.
Filters
Search
Apply
Filters
Filters
Publication Year
2023
(2)
2022
(6)
2021
(10)
2020
(8)
2019
(6)
2018
(3)
2017
(2)
2016
(3)
Facet Publication Year
Research Areas
Artificial Intelligence and Machine Learning
(10)
Computer Architecture
(10)
Circuits and VLSI Design
(3)
Resilience and Safety
(2)
High Performance Computing
(1)
Programming Languages, Systems and Tools
(1)
Events
No Results Available
10 results found
Artificial Intelligence and Machine Learning
Computer Architecture
Clear all
2021
Artificial Intelligence and Machine Learning
Computer Architecture
2021
Softermax: Hardware/Software Co-Design of an Efficient Softmax for Transformers
Jacob R. Stevens,
Rangharajan Venkatesan
,
Steve Dai
,
Brucek Khailany
, Anand Raghunathan
Optimizing Selective Protection for CNN Resilience
Abdulrahman Mahmoud,
Siva Hari
, Christopher W. Fletcher, Sarita V. Adve,
Charbel Sakr
, Naresh Shanbhag,
Pavlo Molchanov
,
Michael B. Sullivan
, Timothy Tsai,
Steve Keckler
Union: A Unified HW-SW Co-Design Ecosystem in MLIR for Evaluating Tensor Operations on Spatial Accelerators
Geonhwa Jeong, Gokcen Kestor, Prasanth Chatarasi,
Angshuman Parashar
,
Po-An Tsai
, Sivasankaran Rajamanickam, Roberto Gioiosa, Tushar Krishna
Simba: scaling deep-learning inference with chiplet-based architecture
Yakun Sophia Shao,
Jason Clemons
,
Rangharajan Venkatesan
,
Brian Zimmer
,
Matt Fojtik
,
Ted Jiang
,
Ben Keller
, Alicia Klinefelter,
Nathaniel Pinckney
, Priyanka Raina,
Stephen Tell
,
Yanqing Zhang
,
William Dally
,
Joel Emer
,
Tom Gray
,
Brucek Khailany
,
Steve Keckler
ACM Research Highlight
GAMMA: Exploiting Gustavson’s Algorithm to Accelerate Sparse Matrix Multiplication
Guowei Zhang, Nithya Attaluri,
Joel Emer
, Daniel Sanchez
VS-QUANT: Per-Vector Scaled Quantization for Accurate Low-Precision Neural Network Inference
Steve Dai
,
Rangharajan Venkatesan
,
Haoxing (Mark) Ren
,
Brian Zimmer
,
William Dally
,
Brucek Khailany
Learning Sparse Matrix Row Permutations for Efficient SpMM on GPU Architectures
Atefeh Mehrabi,
Donghyuk Lee
,
Niladrish Chatterjee
, Danial J. Sorin, Benjamin C. Lee,
Mike O'Connor
Making Convolutions Resilient via Algorithm-Based Error Detection Techniques
Siva Hari
,
Michael B. Sullivan
, Timothy Tsai,
Steve Keckler
Heterogeneous Dataflow Accelerators for Multi-DNN Workloads
Hyoukjun Kwon, Liangzhen Lai,
Michael Pellauer
, Tushar Krishna, Yu-Hsin Chen, Vikas Chandra
Flexion: A Quantitative Metric for Flexibility in DNN Accelerators
Hyoukjun Kwon,
Michael Pellauer
,
Angshuman Parashar
, Tushar Krishna