Publications | Research

15 results found
High Performance Computing

Clear all

2020

Accelerating Reinforcement Learning through GPU Atari Emulation

Iuri Frosio, Steven Dalton

Locality-Centric Data and Threadblock Management for Massive GPUs

Mahmoud Khairy, Vadim Nikiforov, David Nellans, Timothy G. Rogers

EMOGI: Efficient Memory-access for Out-of-memory Graph-traversal In GPUs

Seung Won Min, Vikram Sharma Mailthody, Zaid Qureshi, Jinjun Xiong, Eiman Ebrahimi, Wen-mei Hwu

Buddy Compression: Enabling Larger Memory for Deep Learning and HPC Workloads on GPUs

Esha Chouske, Michael B. Sullivan, Mike O'Connor, Mattan Erez, Jeff Pool, David Nellans, Steve Keckler

An In-Network Architecture for Accelerating Shared-Memory Multiprocessor Collectives

Benjamin Klenk, Ted Jiang, Greg Thorson, Larry Dennison

NWChem: Past, Present, and Future

Edoardo Aprà, Many others, Oreste Villa, Many others

2016

Tensor Contractions with Extended BLAS Kernels on CPU and GPU

Yang Shi, U. N. Niranjan, Animashree Anandkumar, Cris Cecka

vDNN: Virtualized Deep Neural Networks for Scalable, Memory-Efficient Neural Network Design.

Minsoo Rhu, Natalia Gimelshein, Jason Clemons, Arslan Zulfiqar, Steve Keckler

Approxilyzer: Towards A Systematic Framework for Instruction-Level Approximate Computing and its Application to Hardware Resiliency

Radha Venkatagiri, Abdulrahman Mahmoud, Siva Hari, Sarita Adve

All-Inclusive ECC: Thorough End-to-End Protection for Reliable Computer Memory

Jungrae Kim, Michael B. Sullivan, Sangkug Lym, Mattan Erez

S-Step and Communication-Avoiding Iterative Methods

Maxim Naumov

Selective GPU Caches to Eliminate CPU-GPU HW Cache Coherence

Neha Agarwal, David Nellans, Eiman Ebrahimi, Thomas F. Wenisch, John Danskin, Steve Keckler

Towards High Performance Paged Memory for GPUs

Tianhao Zheng, David Nellans, Arslan Zulfiqar, Mark Stephenson, Steve Keckler

A Case for Toggle-Aware Compression for GPU Systems

Gennady Pekhimenko, Evgeny Bolotin, Nandita Vijaykumar, Onur Mutlu, Todd C. Mowry, Steve Keckler

Parallel Spectral Graph Partitioning

Maxim Naumov, Timothy Moon