  Michael Bauer  

 



  ![](/sites/default/files/person/mike-bauer.jpg)

  

 Michael Bauer joined NVIDIA Research in October of 2014 after finishing his PhD in computer science at Stanford University. As part of his thesis, he developed the Legion programming system for high performance supercomputing codes. Legion is currently deployed in several production applications running on the top supercomputers in the world. Michael was also the primary author of both the CudaDMA library and Singe DSL compiler, both of which were early examples of warp-specialized GPU programming.



   Research Area(s)

[High Performance Computing](/index.php/research-area/high-performance-computing)

[Programming Languages, Systems and Tools](/index.php/research-area/programming-languages-systems)

 

 

  

 

 

 



 ### Publications

 

### 2025 

[Task-Based Tensor Computations on Modern GPUs](/index.php/publication/2025-06_task-based-tensor-computations-modern-gpus)

Rohan Yadav, [Michael Garland](/index.php/person/michael-garland), Alex Aiken, [Michael Bauer](/index.php/person/mike-bauer)



[PLDI](https://pldi25.sigplan.org/)









[Composing Distributed Computations Through Task and Kernel Fusion](/index.php/publication/2025-03_composing-distributed-computations-through-task-and-kernel-fusion)

Rohan Yadav, Shiv Sundrum, Wonchan Lee, [Michael Garland](/index.php/person/michael-garland), [Michael Bauer](/index.php/person/mike-bauer), Alex Aiken, Fredrik Kjolstad



[ASPLOS](https://www.asplos-conference.org/asplos2025/)









[Automatic Tracing in Task-Based Runtime Systems](/index.php/publication/2025-03_automatic-tracing-task-based-runtime-systems)

Rohan Yadav, [Michael Bauer](/index.php/person/mike-bauer), David Broman, [Michael Garland](/index.php/person/michael-garland), Alex Aiken, Fredrik Kjolstad



[ASPLOS](https://www.asplos-conference.org/asplos2025/)









### 2023 

[Legate Sparse: Distributed Sparse Computing in Python](/publication/2023-11_legate-sparse-distributed-sparse-computing-python)

Rohan Yadav, Wonchan Lee, [Melih Elibol](/person/melih-elibol), [Taylor Patti](/person/taylor-patti), Manolis Papadakis, [Michael Garland](/person/michael-garland), Alex Aiken, Fredrik Kjolstad, [Michael Bauer](/person/mike-bauer)



[Supercomputing](https://sc23.supercomputing.org/presentation/?id=pap119&sess=sess172)









[Visibility Algorithms for Dynamic Dependence Analysis and Distributed Coherence](/index.php/publication/2023-02_visibility-algorithms-dynamic-dependence-analysis-and-distributed-coherence)

[Michael Bauer](/index.php/person/mike-bauer), Elliott Slaughter, Sean Treichler, Wonchan Lee, [Michael Garland](/index.php/person/michael-garland), Alex Aiken



[PPoPP](https://conf.researchr.org/home/ppopp-2023)









### 2021 

[Scaling Implicit Parallelism via Dynamic Control Replication](/index.php/publication/2021-02_scaling-implicit-parallelism-dynamic-control-replication)

[Michael Bauer](/index.php/person/mike-bauer), Wonchan Lee, Elliott Slaughter, Zhihao Jia, Mario Di Renzo, Manolis Papadakis, Galen Shipman, Patrick McCormick, [Michael Garland](/index.php/person/michael-garland), Alex Aiken



[Principles and Practices of Parallel Programming (PPoPP)](https://ppopp21.sigplan.org/)









### 2019 

[Legate NumPy: Accelerated and Distributed Array Computing](/publication/2019-11_legate-numpy-accelerated-and-distributed-array-computing)

[Michael Bauer](/person/mike-bauer), [Michael Garland](/person/michael-garland)



[The International Conference for High Performance Computing, Networking, Storag…](https://sc19.supercomputing.org/presentation/?id=pap271&sess=sess163)









### 2018 

[Dynamic Tracing: Memoization of Task Graphs for Dynamic Task-based Runtimes](/index.php/publication/2018-11_dynamic-tracing-memoization-task-graphs-dynamic-task-based-runtimes)

Wonchan Lee, Elliott Slaughter, [Michael Bauer](/index.php/person/mike-bauer), Sean Treichler, Todd Warszawski, [Michael Garland](/index.php/person/michael-garland), Alex Aiken



[International Conference for High Performance Computing and Communications (SC'…](https://dl.acm.org/doi/10.5555/3291656.3291702)









### 2017 

[Integrating External Resources with a Task-Based Programming Model](/index.php/publication/2017-12_integrating-external-resources-task-based-programming-model)

Zhihao Jia, Sean Treichler, Galen Shipman, [Michael Bauer](/index.php/person/mike-bauer), Noah Watkins, Carlos Maltzahn, Patrick McCormick, Alex Aiken



[International Conference on High Performance Computing (HiPC)](https://ieeexplore.ieee.org/document/8287762)









[Control Replication: Compiling Implicit Parallelism to Efficient SPMD with Logical Regions](/publication/2017-11_control-replication-compiling-implicit-parallelism-efficient-spmd-logical)

Elliott Slaughter, Wonchan Lee, Sean Treichler, Wen Zhang, [Michael Bauer](/person/mike-bauer), Galen Shipman, Patrick McCormick, Alex Aiken



[ International Conference for High Performance Computing and Communications (SC…](https://dl.acm.org/doi/10.1145/3126908.3126949)









### 2015 

[Verification of Producer-Consumer Synchronization in GPU Programs](/index.php/publication/2015-06_verification-producer-consumer-synchronization-gpu-programs)

[Michael Bauer](/index.php/person/mike-bauer), Rahul Sharma, Alex Aiken