Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Publications
Our publications provide insight into some of our leading-edge research.
Filters
Search
Apply
Filters
Filters
Publication Year
2025
(4)
2023
(4)
2022
(4)
2021
(6)
2020
(5)
2019
(5)
2018
(6)
2017
(6)
2016
(1)
2015
(2)
2014
(2)
2013
(2)
2012
(1)
2011
(2)
2010
(1)
2008
(2)
Facet Publication Year
Research Areas
Programming Languages, Systems and Tools
(10)
High Performance Computing
(6)
Computer Architecture
(4)
Computer Graphics
(3)
Real-Time Rendering
(2)
Algorithms and Numerical Methods
(1)
Artificial Intelligence and Machine Learning
(1)
Networking
(1)
Events
PLDI
(1)
SIGGRAPH
(1)
10 results found
Programming Languages, Systems and Tools
Clear all
2025
2021
Programming Languages, Systems and Tools
2025
Task-Based Tensor Computations on Modern GPUs
Rohan Yadav,
Michael Garland
, Alex Aiken,
Michael Bauer
PLDI
Adaptive Algebraic Reuse of Reordering in Cholesky Factorizations with Dynamic Sparsity Patterns
Behrooz Zarebavani, Danny Kaufman, David Levin,
Maryam Mehri Dehnavi
SIGGRAPH
Composing Distributed Computations Through Task and Kernel Fusion
Rohan Yadav, Shiv Sundrum, Wonchan Lee,
Michael Garland
,
Michael Bauer
, Alex Aiken, Fredrik Kjolstad
Automatic Tracing in Task-Based Runtime Systems
Rohan Yadav,
Michael Bauer
, David Broman,
Michael Garland
, Alex Aiken, Fredrik Kjolstad
2021
Union: A Unified HW-SW Co-Design Ecosystem in MLIR for Evaluating Tensor Operations on Spatial Accelerators
Geonhwa Jeong, Gokcen Kestor, Prasanth Chatarasi,
Angshuman Parashar
,
Po-An Tsai
, Sivasankaran Rajamanickam, Roberto Gioiosa, Tushar Krishna
Cooperative Profile Guided Optimization
Mark Stephenson
, Ram Rangan,
Steve Keckler
Efficient Multi-GPU Shared Memory via Automatic Optimization of Fine-Grained Transfers
Harini Muthukrishnan
,
David Nellans
,
Daniel Lustig
, Jeffrey Fessler, Thomas Wenisch
PGZ: Automatic Zero-Value Code Specialization
Mark Stephenson
, Ram Rangan
Scaling Implicit Parallelism via Dynamic Control Replication
Michael Bauer
, Wonchan Lee, Elliott Slaughter, Zhihao Jia, Mario Di Renzo, Manolis Papadakis, Galen Shipman, Patrick McCormick,
Michael Garland
, Alex Aiken
Hardware Abstractions for Targeting EDDO Architectures with the Polyhedral Model
Angshuman Parashar
, Prasanth Chatarasi,
Po-An Tsai