Research Labs
All Research Labs
Spatial Intelligence
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Artificial Intelligence Computing Leadership from NVIDIA
Login
Research Labs
All Research Labs
Spatial Intelligence
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Search
Search
Enter the terms you wish to search for.
Publications
Our publications provide insight into some of our leading-edge research.
Filters
Search
Apply
Filters
Filters
Publication Year
2026
(1)
2025
(4)
2023
(4)
2022
(4)
2021
(6)
2020
(5)
2019
(5)
2018
(6)
2017
(6)
2016
(1)
2015
(3)
2014
(2)
2013
(2)
2012
(1)
2011
(2)
2010
(1)
2008
(2)
Facet Publication Year
Research Areas
Programming Languages, Systems and Tools
(10)
Computer Architecture
(5)
Artificial Intelligence and Machine Learning
(2)
High Performance Computing
(2)
Computer Graphics
(1)
Events
No Results Available
10 results found
Programming Languages, Systems and Tools
Clear all
2020
2019
Programming Languages, Systems and Tools
2020
Locality-Centric Data and Threadblock Management for Massive GPUs
Mahmoud Khairy, Vadim Nikiforov,
David Nellans
, Timothy G. Rogers
A Programmable Approach to Neural Network Compression
Vinu Joseph, Ganesh L. Gopalakrishnan,
Saurav Muralidharan
,
Michael Garland
, Animesh Garg
Zeroploit: Exploiting Zero Valued Operands in Interactive Gaming Applications
Ram Rangan,
Mark Stephenson
, Aditya Ukarande, Shyam Murthy, Virat Agarwal, Marc Blackstein
There’s Plenty of Room at the Top: What Will Drive Computer Performance after Moore’s Law?
Charles E. Leiserson, Neil C. Thompson,
Joel Emer
, Bradley C. Kuszmaul, Butler W. Lampson, Daniel Sanchez , Tao B. Schardl
Speculative Reconvergence for Improved SIMT Efficiency
Sana Damani, Daniel Johnson,
Mark Stephenson
, Eddie Yan, Olivier Giroux, Michael McKeown,
Steve Keckler
2019
Legate NumPy: Accelerated and Distributed Array Computing
Michael Bauer
,
Michael Garland
NVBit: A Dynamic Binary Instrumentation Framework for NVIDIA GPUs
Oreste Villa,
Mark Stephenson
,
David Nellans
,
Steve Keckler
Task Bench: A Parameterized Benchmark for Evaluating Parallel Runtime Performance
Elliott Slaughter, Wei Wu, Yuankun Fu, Legend Brandenburg, Nicolai Garcia, Wilhem Kautz, Emily Marx, Kaleb S. Morris, Qinglei Cao, George Bosilca, Seema Mirchandaney, Wonchan Lee, Sean Treichler, Patrick McCormick, Alex Aiken
Timeloop: A Systematic Approach to DNN Accelerator Evaluation
Angshuman Parashar
, Priyanka Raina, Yakun Sophia Shao, Yu-Hsin Chen, Victor A. Ying, Anurag Mukkara,
Rangharajan Venkatesan
,
Brucek Khailany
,
Steve Keckler
,
Joel Emer
Throughput-oriented GPU memory allocation
Isaac Gelado
,
Michael Garland