Union: A Unified HW-SW Co-Design Ecosystem in MLIR for Evaluating Tensor Operations on Spatial Accelerators

To meet the extreme compute demands for deep learning across commercial and scientific applications, dataflow accelerators are becoming increasingly popular. While these “domain-specific” accelerators are not fully programmable like CPUs and GPUs, they retain varying levels of flexibility with respect to data orchestration, i.e., dataflow and tiling optimizations to enhance efficiency. There are several challenges when designing new algorithms and mapping approaches to execute the algorithms for a target problem on new hardware. Previous works have addressed these challenges individually. To address this challenge as a whole, in this work, we present a HW-SW co-design ecosystem for spatial accelerators called Union1 within the popular MLIR compiler infrastructure. Our framework allows exploring different algorithms and their mappings on several accelerator cost models. Union also includes a plug-and-play library of accelerator cost models and mappers which can easily be extended. The algorithms and accelerator cost models are connected via a novel mapping abstraction that captures the map space of spatial accelerators which can be systematically pruned based on constraints from the hardware, workload, and mapper. We demonstrate the value of Union for the community with several case studies which examine offloading different tensor operations (CONV/GEMM/Tensor Contraction) on diverse accelerator architectures using different mapping schemes.

Authors

Geonhwa Jeong (Georgia Institute of Technology)

Gokcen Kestor (Pacific Northwest National Laboratory)

Prasanth Chatarasi (IBM Research)

Angshuman Parashar

Po-An Tsai

Sivasankaran Rajamanickam (Sandia National Laboratories)

Roberto Gioiosa (Pacific Northwest National Laboratory)

Tushar Krishna (Georgia Institute of Technology)

Publication Date

Sunday, September 26, 2021

Published in

Parallel Architectures and Compilation Techniques (PACT)

Research Area

Artificial Intelligence and Machine Learning

Computer Architecture

Programming Languages, Systems and Tools

External Links

IEEE Digital Library

Uploaded Files

Published manuscript548.46 KB

Copyright

This material is posted here with permission of the IEEE. Internal or personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution must be obtained from the IEEE by writing to pubs-permissions@ieee.org.