A Case for Toggle-Aware Compression for GPU Systems

Data compression can be an effective method to achieve higher system performance and energy efficiency in modern data-intensive applications by exploiting redundancy and data similarity. Prior works have studied a variety of data compression techniques to improve both capacity (e.g., of caches and main memory) and bandwidth utilization (e.g., of the on-chip and off-chip interconnects). In this paper, we make a new observation about the energy-efficiency of communication when compression is applied. While compression reduces the amount of transferred data, it leads to a substantial increase in the number of bit toggles (i.e., communication channel switchings from 0 to 1 or from 1 to 0). The increased toggle count increases the dynamic energy consumed by on-chip and off-chip buses due to more frequent charging and discharging of the wires. Our results show that the total bit toggle count can increase from 20% to 2.2x when compression is applied for some compression algorithms, averaged across different application suites. We characterize and demonstrate this new problem across 242 GPU applications and six different compression algorithms. To mitigate the problem, we propose two new toggle-aware compression techniques: Energy Control and Metadata Consolidation. These techniques greatly reduce the bit toggle count impact of the data compression algorithms we examine, while keeping most of their bandwidth reduction benefits.

Authors

Gennady Pekhimenko (Carnegie Mellon University)

Evgeny Bolotin (NVIDIA)

Nandita Vijaykumar (Carnegie Mellon University)

Onur Mutlu (Carnegie Mellon University)

Todd C. Mowry (Carnegie Mellon University)

Steve Keckler

Publication Date

Saturday, March 12, 2016

Published in

International Symposium on High Performance Computer Architecture (HPCA)

Research Area

Computer Architecture

High Performance Computing

External Links

IEEE Digital Library

Uploaded Files

Published manuscript1.55 MB

Copyright

This material is posted here with permission of the IEEE. Internal or personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution must be obtained from the IEEE by writing to pubs-permissions@ieee.org.