Umar Iqbal

I am a Senior Research Scientist at NVIDIA Research and part of the Machine Learning and Perception group headed by Jan Kautz. Prior to that, I completed my Ph.D. in Computer Science (2014-2018) from the University of Bonn, Germany, under the supervision of Prof. Juergen Gall.

Arash Vahdat

Arash Vahdat is a Research Director, leading the fundamental generative AI research (GenAIR) team at NVIDIA Research. Before joining NVIDIA, he was a research scientist at D-Wave Systems, working on generative learning and its applications in label-efficient training. Before D-Wave, Arash was a research faculty member at Simon Fraser University (SFU), where he led deep learning-based video analysis research and taught master courses on machine learning for big data.

Zi Yan

My research interests focus on Computer Architecture and Operating Systems, especially on Virtual Memory. Virtual memory is a nice middle layer providing good programmability and performance, but now needs to catch up with new Heterogenous Memory Systems.

Jeff Smith

Jeff Smith joined NVIDIA in 2016, working on computer vision and deep learning tools for autonomous aerial vehicles.  He joined NVIDIA Research in 2018 where he develops tools and systems for deep learning in the fields of computer perception and robotics.

Metaoptimization on a Distributed System for Deep Reinforcement Learning

Training intelligent agents through reinforcement learning is a notoriously unstable procedure. Massive parallelization on GPUs and distributed systems has been exploited to generate a large amount of training experiences and consequently reduce instabilities, but the success of training remains strongly influenced by the choice of the hyperparameters.

A Fusion Approach for Multi-Frame Optical Flow Estimation

To date, top-performing optical flow estimation methods only take pairs of consecutive frames into account. While elegant and appealing, the idea of using more than two frames has not yet produced state-of-the-art results. We present a simple, yet effective fusion approach for multi-frame optical flow that benefits from longer-term temporal cues. Our method first warps the optical flow from previous frames to the current, thereby yielding multiple plausible estimates. It then fuses the complementary information carried by these estimates into a new optical flow field.

A Ray-Box Intersection Algorithm and Efficient Dynamic Voxel Rendering

We introduce a novel and efficient method for rendering large models composed of individually-oriented voxels. The core of this method is a new algorithm for computing the intersection point and normal of a 3D ray with an arbitrarily-oriented 3D box, which also has non-rendering applications in GPU physics, such as ray casting and particle collision detection. We measured throughput improvements of 2× to 10× for the intersection operation versus previous ray-box intersection algorithms on GPUs.

Routability-Driven Macro Placement with Embedded CNN-Based Prediction Model

With the dramatic shrink of feature size and the advance of semiconductor technology nodes, numerous and complicated design rules need to be followed, and a chip design can only be taped-out after passing design rule check (DRC). The high design complexity seriously deteriorates design routability, which can be measured by the number of DRC violations after the detailed routing stage. In addition, a modern large-scaled design typically consists of many huge macros due to the wide use of intellectual properties (IPs).

RouteNet: Routability Prediction for Mixed-size Designs using Convolutional Neural Network

Early routability prediction helps designers and tools perform preventive measures so that design rule violations can be avoided in a proactive manner. However, it is a huge challenge to have a predictor that is both accurate and fast. In this work, we study how to leverage convolutional neural network to address this challenge. The proposed method, called RouteNet, can either evaluate the overall routability of cell placement solutions without global routing or predict the locations of DRC (Design Rule Checking) hotspots.