Improving Locality of Irregular Updates with Hardware Assisted Propagation Blocking

Many application domains perform irregular memory updates. Irregular accesses lead to inefficient use of conventional cache hierarchies. To make better use of the cache, we focus on Propagation Blocking (PB), a software-based cache locality optimization initially designed for graph processing applications. We make two contributions in this work. First, we show that PB generalizes beyond graph processing applications to any application with unordered parallelism and irregular memory updates. Second, we identify the inefficiencies of a PB execution on conventional multicore processors and propose architecture support to further improve the performance gains from PB. Our proposed architecture, COBRA, optimizes the PB execution of a range of applications with irregular memory updates, offering speedups of up to 3.78x compared to PB (1.74x on average).

Authors

Vignesh Balaji

Brandon Lucia (Carnegie Mellon University)

Publication Date

Saturday, April 2, 2022

Published in

International Symposium on High Performance Computer Architecture (HPCA)

Research Area

Computer Architecture

External Links

IEEE Digital Library

Uploaded Files

Published Manuscript1.03 MB

Award

Best Paper nominee

Copyright

This material is posted here with permission of the IEEE. Internal or personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution must be obtained from the IEEE by writing to pubs-permissions@ieee.org.