Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Publications
Our publications provide insight into some of our leading-edge research.
Filters
Search
Apply
Filters
Filters
Publication Year
2025
(4)
2023
(4)
2022
(4)
2021
(6)
2020
(5)
2019
(5)
2018
(6)
2017
(6)
2016
(1)
2015
(2)
2014
(2)
2013
(2)
2012
(1)
2011
(2)
2010
(1)
2008
(2)
Facet Publication Year
Research Areas
Programming Languages, Systems and Tools
(9)
Computer Architecture
(4)
High Performance Computing
(4)
Artificial Intelligence and Machine Learning
(1)
Computer Graphics
(1)
Events
PLDI
(1)
9 results found
Programming Languages, Systems and Tools
Clear all
2023
2019
Programming Languages, Systems and Tools
2023
Legate Sparse: Distributed Sparse Computing in Python
Rohan Yadav, Wonchan Lee,
Melih Elibol
,
Taylor Patti
, Manolis Papadakis,
Michael Garland
, Alex Aiken, Fredrik Kjolstad,
Michael Bauer
cuCatch: A Debugging Tool for Efficiently Catching Memory Safety Violations in CUDA Applications
Mohamed Tarek Ibn Ziad
,
Sana Damani
,
Aamer Jaleel
,
Stephen W. Keckler
,
Mark Stephenson
PLDI
Visibility Algorithms for Dynamic Dependence Analysis and Distributed Coherence
Michael Bauer
, Elliott Slaughter, Sean Treichler, Wonchan Lee,
Michael Garland
, Alex Aiken
Parsimony: Enabling SIMD/Vector Programming in Standard Compiler Flows
Vijay Kandiah,
Daniel Lustig
,
Oreste Villa
,
David Nellans
, Nikos Hardavellas
2019
Legate NumPy: Accelerated and Distributed Array Computing
Michael Bauer
,
Michael Garland
NVBit: A Dynamic Binary Instrumentation Framework for NVIDIA GPUs
Oreste Villa
,
Mark Stephenson
,
David Nellans
,
Steve Keckler
Task Bench: A Parameterized Benchmark for Evaluating Parallel Runtime Performance
Elliott Slaughter, Wei Wu, Yuankun Fu, Legend Brandenburg, Nicolai Garcia, Wilhem Kautz, Emily Marx, Kaleb S. Morris, Qinglei Cao, George Bosilca, Seema Mirchandaney, Wonchan Lee, Sean Treichler, Patrick McCormick, Alex Aiken
Timeloop: A Systematic Approach to DNN Accelerator Evaluation
Angshuman Parashar
, Priyanka Raina, Yakun Sophia Shao, Yu-Hsin Chen, Victor A. Ying, Anurag Mukkara,
Rangharajan Venkatesan
,
Brucek Khailany
,
Steve Keckler
,
Joel Emer
Throughput-oriented GPU memory allocation
Isaac Gelado
,
Michael Garland