Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Publications
Our publications provide insight into some of our leading-edge research.
Filters
Search
Apply
Filters
Filters
Publication Year
2025
(48)
2024
(109)
2023
(164)
2022
(165)
2021
(153)
2020
(135)
2019
(123)
2018
(107)
2017
(85)
2016
(57)
2015
(52)
2014
(23)
2013
(26)
2012
(20)
2011
(29)
2010
(19)
2009
(9)
2008
(14)
2007
(10)
2006
(1)
2005
(3)
2003
(1)
2001
(1)
Facet Publication Year
Research Areas
Computer Vision
(44)
Artificial Intelligence and Machine Learning
(38)
Computer Graphics
(31)
Computer Architecture
(29)
High Performance Computing
(18)
Robotics
(15)
Programming Languages, Systems and Tools
(8)
Real-Time Rendering
(8)
Resilience and Safety
(8)
Algorithms and Numerical Methods
(7)
Circuits and VLSI Design
(6)
Computational Photography and Imaging
(6)
Human Computer Interaction
(6)
VR, AR and Display Technology
(6)
Generative AI
(3)
Applied Perception
(2)
Autonomous Vehicles
(2)
Networking
(2)
Climate Simulation
(1)
Medical
(1)
Events
CORL
(1)
CVPR
(1)
ECCV
(1)
ICRA
(1)
136 results found
Clear all
2018
2011
2018
A Switching Linear Regulator Based on a Fast-Self-Clocked Comparator with Very Low Probability of Meta-stability and a Parallel Analog Ripple Control Module
Sudhir Kudva
,
Sanquan Song
, John Poulton,
John Wilson
, Wenxu Zhao,
Tom Gray
DUO: Exposing On-chip Redundancy to Rank-Level ECC for High Reliability
Seong-Lyong Gong, Jungrae Kim,
Michael B. Sullivan
, Howard David, Mattan Erez
Synchronous Multi-GPU Deep Learning with Low-Precision Communication: An Experimental Study
Demjan Grubic, Leo Tam, Dan Alistarh, Ce Zhang
Riemannian Motion Policies
Nathan Ratliff, Jan Issac, Daniel Kappler,
Stan Birchfield
, Dieter Fox
Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks
Minsoo Rhu,
Mike O'Connor
,
Niladrish Chatterjee
, Jeff Pool, Youngeun Kwon,
Steve Keckler
Reducing Data Transfer Energy by Exploiting Similarity within a Data Transaction
Donghyuk Lee
,
Mike O'Connor
,
Niladrish Chatterjee
Best Paper nominee
Stitch-X: An Accelerator Architecture for Exploiting Unstructured Sparsity in Deep Neural Networks
Ching-En Lee, Yakun Sophia Shao, Jie-Fang Zhang,
Angshuman Parashar
,
Joel Emer
,
Steve Keckler
, Zhengya Zhang
MeltdownPrime and SpectrePrime: Automatically-Synthesized Attacks Exploiting Invalidation-Based Coherence Protocols
Caroline Trippel,
Daniel Lustig
, Margaret Martonosi
A 1.17pJ/b 25Gb/s/pin Ground-Referenced Single Ended Serial Link for Off- and On-Package Communication in 16nm CMOS Using a Process- and Temperature-Adaptive Voltage Regulator
John Wilson
,
Walker Turner
, John Poulton,
Brian Zimmer
,
Xi Chen
,
Sanquan Song
,
Stephen Tell
,
Nikola Nedovic
, Wenxu Zhao, Sunil Sudhakaran,
Tom Gray
,
William Dally
Learning Binary Residual Representations for Domain-specific Video Streaming
Yi-Hsuan Tsai,
Ming-Yu Liu
, Deqing Sun, Ming-Hsuan Yang,
Jan Kautz
Learning Adaptive Parameter Tuning for Image Processing
Jingming Dong,
Iuri Frosio
,
Jan Kautz
2011
Allocation-oriented Algorithm Design with Application to GPU Computing, Ph.D. Dissertation
Duane Merrill
A Compile-Time Managed Multi-Level Register File Hierarchy
Mark Gebhart,
Steve Keckler
,
William Dally
CudaDMA: Optimizing GPU Memory Bandwidth via Warp Specialization
Michael Bauer, Henry Cook,
Brucek Khailany
Improved Dual-Space Bounds for Simultaneous Motion and Defocus Blur
Samuli Laine
,
Tero Karras
Gaussian Process Regression Flow for Analysis of Motion Trajectories
Kihwan Kim, Dongryeol Lee, Irfan Essa
Thrust: A Productivity-Oriented Library for CUDA
Nathan Bell,
Jared Hoberock
Processing Device Arrays with C++ Metaprogramming
Jonathan Cohen
A Hybrid Method for Solving Tridiagonal Systems on the GPU
Yao Zhang, Jonathan Cohen, Andrew A. Davidson, John Owens
Efficient Triangle Coverage Tests for Stochastic Rasterization
Samuli Laine
,
Tero Karras
GPUs and the Future of Parallel Computing
Steve Keckler
,
William Dally
,
Brucek Khailany
,
Michael Garland
, David Glasco
Interactive Indirect Illumination Using Voxel Cone Tracing
Cyril Crassin
, Fabrice Neyret, Miguel Sainz, Simon Green, Elmar Eisemann
VoxelPipe: A Programmable Pipeline for 3D Voxelization
Jacopo Pantaleoni
Simpler and Faster HLBVH with Work Queues
Kirill Garanzha, Jacopo Pantaleoni, David McAllister
High-Performance Software Rasterization on GPUs
Samuli Laine
,
Tero Karras
High Performance and Scalable GPU Graph Traversal
Duane Merrill,
Michael Garland
, Andrew Grimshaw
Stochastic Transparency
Eric Enderton, Erik Sintorn, Peter Shirley,
David Luebke
Temporal Light Field Reconstruction for Rendering Distribution Effects
Jaakko Lehtinen
,
Timo Aila
, Jiawen Chen,
Samuli Laine
, Frédo Durand
Clipless Dual-Space Bounds for Faster Stochastic Rasterization
Samuli Laine
,
Timo Aila
,
Tero Karras
,
Jaakko Lehtinen
The Alchemy Screen-space Ambient Obscurance Algorithm
Morgan McGuire, Brian Osman, Michael Bukowski, Padraic Hennessy
Exposing Fine-Grained Parallelism in Algebraic Multigrid Methods
Nathan Bell, Steven Dalton, Luke Olson
The Workflow Scale: Why 5x Faster Might Not Be Enough
Eric Enderton, Daniel Wexler
Pagination
First page
« First
Previous page
‹ Previous
Page
1
Page
2
Page
3
Current page
4
Page
5
Next page
Next ›
Last page
Last »