| Research

I am a distinguished research scientist at NVResearch. My current research interest is efficient Large Language Model (LLM) and Vision Language model (VLM). Before joining NVidia, I was a TLM at Google Deepmind where I worked on reinforcement learning, imitation learning on embodied AI. I co-led "SayCan", "RT-1", "RT-2", "RT-X" etc. that have been featured extensively by media, such as New York Times, Washington Post, Forbes, Reuters, TechCrunch, The WIRED, etc.

Read more about Yao Lu (Jason)

Wen-Hao Liu

Wen-Hao Liu, received his Ph.D. degree in Computer Science from National Chiao Tung University, Taiwan in 2013. His research interests include routing, placement, clock synthesis, logic synthesis, and 3D-IC in electronic design automation (EDA) field. Wen-Hao has published more than 40 papers and 15 patents in this field, and he has served on the technical program committee of DAC, ICCAD, ISPD, and ASPDAC. Currently, Wen-Hao works at Nvidia Research as a Principal Research Scientist to explore the solutions for advanced VLSI-related challenges.

Read more about Wen-Hao Liu

Juan Gomez Luna

Read more about Juan Gomez Luna

Graph Neural Networks for Enhanced Decoding of Quantum LDPC Codes

In this work, we propose a fully differentiable iterative decoder for quantum low-density parity-check (LDPC) codes. The proposed algorithm is composed of classical belief propagation (BP) decoding stages and intermediate graph neural network (GNN) layers. Both component decoders are defined over the same sparse decoding graph enabling a seamless integration and scalability to large codes.

Read more about Graph Neural Networks for Enhanced Decoding of Quantum LDPC Codes

A Neural Receiver for 5G NR Multi-user MIMO

We introduce a neural network (NN)-based multiuser multiple-input multiple-output (MU-MIMO) receiver with 5G New Radio (5G NR) physical uplink shared channel (PUSCH) compatibility. The NN architecture is based on convolution layers to exploit the time and frequency correlation of the channel and a graph neural network (GNN) to handle multiple users. The proposed architecture adapts to an arbitrary number of sub-carriers and supports a varying number of multiple-input multiple-output (MIMO) layers and users without the need for any retraining.

Read more about A Neural Receiver for 5G NR Multi-user MIMO

Efficient Transformer Inference with Statically Structured Sparse Attention

Self-attention matrices of Transformers are often highly sparse because the relevant context of each token is typically limited to just a few other tokens in the sequence. To reduce the computational burden of self-attention on Transformer inference, we propose static, structured, sparse attention masks that split attention matrices into dense regions, skipping computations outside these regions while reducing computations inside these regions.

Read more about Efficient Transformer Inference with Statically Structured Sparse Attention

Controlling graph dynamics with reinforcement learning and graph neural networks

We consider the problem of controlling a partially-observed dynamic process on a graph by a limited number of interventions. This problem naturally arises in contexts such as scheduling virus tests to curb an epidemic; targeted marketing in order to promote a product; and manually inspecting posts to detect fake news spreading on social networks. We formulate this setup as a sequential decision problem over a temporal graph process.

Read more about Controlling graph dynamics with reinforcement learning and graph neural networks

Train Hard, Fight Easy: Robust Meta Reinforcement Learning

A major challenge of reinforcement learning (RL) in real-world applications is the variation between environments, tasks or clients. Meta-RL (MRL) addresses this issue by learning a meta-policy that adapts to new tasks. Standard MRL methods optimize the average return over tasks, but often suffer from poor results in tasks of high risk or difficulty. This limits system reliability whenever test tasks are not known in advance. In this work, we propose a robust MRL objective with a controlled robustness level.

Read more about Train Hard, Fight Easy: Robust Meta Reinforcement Learning

From local structures to size generalization in graph neural networks

Graph neural networks (GNNs) can process graphs of different sizes, but their ability to generalize across sizes, specifically from small to large graphs, is still not well understood. In this paper, we identify an important type of data where generalization from small to large graphs is challenging: graph distributions for which the local structure depends on the graph size. This effect occurs in multiple important graph learning domains, including social and biological networks.

Read more about From local structures to size generalization in graph neural networks

Subscribe to