Dynamic Tracing: Memoization of Task Graphs for Dynamic Task-based Runtimes

Many recent programming systems for both supercomputing and data center workloads generate task graphs to express computations that run on parallel and distributed machines. Due to the overhead associated with constructing these graphs the dependence analysis that generates them is often statically computed and memoized, and the resulting graph executed repeatedly at runtime. However, many applications require a dynamic dependence analysis due to data dependent behavior, but there are new challenges in capturing and re-executing task graphs at runtime. In this work, we introduce dynamic tracing, a technique to capture a dynamic dependence analysis of a trace that generates a task graph, and replay it. We show that an implementation of dynamic tracing improves strong scaling by an average of 4.9× and up to 7.0× on a suite of already optimized benchmarks.

Authors

Wonchan Lee (Stanford University)

Elliott Slaughter (SLAC National Accelerator Laboratory)

Michael Bauer

Sean Treichler (NVIDIA)

Todd Warszawski (Stanford University)

Michael Garland

Alex Aiken (Stanford University)

Publication Date

Sunday, November 11, 2018

Published in

International Conference for High Performance Computing and Communications (SC'…

Research Area

High Performance Computing

Programming Languages, Systems and Tools

External Links

ACM Digital Library

Uploaded Files

Published manuscript439.06 KB

Copyright

Copyright by the Association for Computing Machinery, Inc. Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, to republish, to post on servers, or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from Publications Dept, ACM Inc., fax +1 (212) 869-0481, or permissions@acm.org. The definitive version of this paper can be found at ACM's Digital Library http://www.acm.org/dl/.