Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Artificial Intelligence Computing Leadership from NVIDIA
Login
Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Search
Search
Enter the terms you wish to search for.
Publications
Our publications provide insight into some of our leading-edge research.
Filters
Search
Apply
Filters
Filters
Publication Year
All
2023
(2)
2022
(2)
2021
(3)
2019
(3)
2018
(1)
2017
(3)
2016
(1)
2013
(1)
All
2023 (2)
2022 (2)
2021 (3)
2019 (3)
2018 (1)
2017 (3)
2016 (1)
2013 (1)
Facet Publication Year
Research Areas
Artificial Intelligence and Machine Learning
(3)
Artificial Intelligence and Machine Learning
(3)
Circuits and VLSI Design
(3)
Circuits and VLSI Design
(3)
Computer Architecture
(3)
Computer Architecture
(3)
Events
No Results Available
3 results found
Circuits and VLSI Design
Computer Architecture
Clear all
2021
Circuits and VLSI Design
Computer Architecture
2021
Softermax: Hardware/Software Co-Design of an Efficient Softmax for Transformers
Jacob R. Stevens,
Rangharajan Venkatesan
,
Steve Dai
,
Brucek Khailany
, Anand Raghunathan
Simba: scaling deep-learning inference with chiplet-based architecture
Yakun Sophia Shao,
Jason Clemons
,
Rangharajan Venkatesan
,
Brian Zimmer
,
Matt Fojtik
,
Ted Jiang
,
Ben Keller
, Alicia Klinefelter,
Nathaniel Pinckney
, Priyanka Raina,
Stephen Tell
,
Yanqing Zhang
,
William Dally
,
Joel Emer
,
Tom Gray
,
Brucek Khailany
,
Steve Keckler
ACM Research Highlight
VS-QUANT: Per-Vector Scaled Quantization for Accurate Low-Precision Neural Network Inference
Steve Dai
,
Rangharajan Venkatesan
,
Haoxing (Mark) Ren
,
Brian Zimmer
,
William Dally
,
Brucek Khailany