Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Publications
Our publications provide insight into some of our leading-edge research.
Filters
Search
Apply
Filters
Filters
Publication Year
2025
(4)
2023
(4)
2022
(4)
2021
(6)
2020
(5)
2019
(5)
2018
(6)
2017
(6)
2016
(1)
2015
(2)
2014
(2)
2013
(2)
2012
(1)
2011
(2)
2010
(1)
2008
(2)
Facet Publication Year
Research Areas
Programming Languages, Systems and Tools
(11)
Computer Architecture
(5)
High Performance Computing
(5)
Artificial Intelligence and Machine Learning
(1)
Computer Graphics
(1)
Networking
(1)
Events
No Results Available
11 results found
Programming Languages, Systems and Tools
Clear all
2020
2017
Programming Languages, Systems and Tools
2020
Locality-Centric Data and Threadblock Management for Massive GPUs
Mahmoud Khairy, Vadim Nikiforov,
David Nellans
, Timothy G. Rogers
A Programmable Approach to Neural Network Compression
Vinu Joseph, Ganesh L. Gopalakrishnan,
Saurav Muralidharan
,
Michael Garland
, Animesh Garg
Zeroploit: Exploiting Zero Valued Operands in Interactive Gaming Applications
Ram Rangan,
Mark Stephenson
, Aditya Ukarande, Shyam Murthy, Virat Agarwal, Marc Blackstein
There’s Plenty of Room at the Top: What Will Drive Computer Performance after Moore’s Law?
Charles E. Leiserson, Neil C. Thompson,
Joel Emer
, Bradley C. Kuszmaul, Butler W. Lampson, Daniel Sanchez , Tao B. Schardl
Speculative Reconvergence for Improved SIMT Efficiency
Sana Damani, Daniel Johnson,
Mark Stephenson
, Eddie Yan, Olivier Giroux, Michael McKeown,
Steve Keckler
2017
Integrating External Resources with a Task-Based Programming Model
Zhihao Jia, Sean Treichler, Galen Shipman,
Michael Bauer
, Noah Watkins, Carlos Maltzahn, Patrick McCormick, Alex Aiken
A Novel Shard-Based Approach for Asynchronous Many-Task Models for In Situ Analysis
Philippe P. Pébaÿ, Giulio Borghesi, Hemanth Kolla, Janine C. Bennett, Sean Treichler
Control Replication: Compiling Implicit Parallelism to Efficient SPMD with Logical Regions
Elliott Slaughter, Wonchan Lee, Sean Treichler, Wen Zhang,
Michael Bauer
, Galen Shipman, Patrick McCormick, Alex Aiken
Relaxations for High-Performance Message Passing on Massively Parallel SIMT Processors
Benjamin Klenk, Holger Fröning,
Hans Eberle
,
Larry Dennison
Best Paper Award
Automated Synthesis of Comprehensive Memory Model Litmus Test Suites
Daniel Lustig
, Andrew Wright, Alexandros Papakonstantinou, Olivier Giroux
TriCheck: Memory Model Verification at the Trisection of Software, Hardware, and ISA
Caroline Trippel, Yatin A. Manerkar,
Daniel Lustig
,
Michael Pellauer
, Margaret Martonosi
IEEE Micro Top Picks in Computer Architecture