1. [Publications](/index.php/publications)
2. GPU Subwarp Interleaving
 
 # GPU Subwarp Interleaving

  ![Publication image](/sites/default/files/styles/wide/public/default_images/default.jpeg?itok=qUFsuJCP "Publication image")

 Raytracing applications have naturally high thread divergence, low warp occupancy and are limited by memory latency. In this paper, we present an architectural enhancement called Subwarp Interleaving that exploits thread divergence to hide pipeline stalls in divergent sections of low warp occupancy workloads. Subwarp Interleaving allows for fine-grained interleaved execution of diverged paths within a warp with the goal of increasing hardware utilization and reducing warp latency. However, notwithstanding the promise shown by early microbenchmark studies and an average performance upside of 6.3% (up to 20%) on a simulator across a suite of raytracing application traces, the Subwarp Interleaving design feature has shortcomings that preclude its near-term implementation. This paper introduces the reader to the challenges of raytracing and discusses a novel microarchitectural approach that, on paper, addresses many of the challenges. A thorough analysis of the idea on a production simulator reveals that the high-level motivating statistics are optimistic, and second-order effects, along with other architectural sharp edges, limit the idea’s potential. We identify Subwarp Interleaving’s primary limiters for an NVIDIA Turing-like architecture, and we outline the conditions under which the approach could be more effective.



 ## Authors



Sana Damani (Georgia Institute of Technology)

[Mark Stephenson](/index.php/person/mark-stephenson)

Ram Rangan (NVIDIA)

Daniel Johnson (NVIDIA)

Rishkul Kulkarni (NVIDIA)

[Steve Keckler](/index.php/person/stephen-keckler)

 

 

 ## Publication Date



Thursday, January 13, 2022

 

 ## Published in



[International Symposium on High-Performance Computer Architecture (HPCA)](https://ieeexplore.ieee.org/document/9773183)

 

 ## Research Area



[Computer Architecture](/index.php/research-area/computer-architecture)

[Computer Graphics](/index.php/research-area/computer-graphics)

[Real-Time Rendering](/index.php/research-area/real-time-rendering)

 

 

 ## External Links



[IEEE Digital Library](https://ieeexplore.ieee.org/document/9773183)

 

 

 ## Uploaded Files



[Published manuscript](https://d1qx31qr3h6wln.cloudfront.net/publications/HPCA_2022_SubwarpInterleaving.pdf "Open file in new window")2.19 MB