Imad El Hanafi

Interested in distributed systems for large scale training and inference.

https://imadelh.gitlab.io 

Eugene d'Eon

Eugene d'Eon is a Principal Research Scientist at NVIDIA, where he specializes in real-time rendering technologies. His career includes significant contributions during his initial tenure at NVIDIA, where he published influential works on real-time skin rendering. Subsequently, at Weta Digital, he advanced research in the appearance modeling of tissue, hair, and rough surfaces.

CorrFill: Enhancing Faithfulness in Reference-based Inpainting with Correspondence Guidance in Diffusion Models

In the task of reference-based image inpainting, an additional reference image is provided to restore a damaged target image to its original state. The advancement of diffusion models, particularly Stable Diffusion, allows for simple formulations in this task. However, existing diffusion-based methods often lack explicit constraints on the correlation between the reference and damaged images, resulting in lower faithfulness to the reference images in the inpainting results.

Spatio-Temporal Context Prompting for Zero-Shot Action Detection

Spatio-temporal action detection encompasses the tasks of localizing and classifying individual actions within a video. Recent works aim to enhance this process by incorporating interaction modeling, which captures the relationship between people and their surrounding context. However, these approaches have primarily focused on fully-supervised learning, and the current limitation lies in the lack of generalization capability to recognize unseen action categories. In this paper, we aim to adapt the pretrained image-language models to detect unseen actions.

DRC-Coder: Automated DRC Checker Code Generation Using LLM Autonomous Agent

In the advanced technology nodes, the integrated design rule checker (DRC) is often utilized in place and route tools for fast optimization loops for power-performance-area. Implementing integrated DRC checkers to meet the standard of commercial DRC tools demands extensive human expertise to interpret foundry specifications, analyze layouts, and debug code iteratively. However, this labor-intensive process, requiring to be repeated by every update of technology nodes, prolongs the turnaround time of designing circuits.

Ximing Lu

Ximing Lu is a research staff of Large Language Model (LLM) Research at NVIDIA and a Ph.D. candidate at the University of Washington, advised by Professor Yejin Choi. She previously earned her B.S. degree in Computer Science at University of Washington. Her research interest centers around data synthesis, model architecture, science of LLMs, commonsense reasoning, knowledge acquisition, and multimodality. She is a co-recipient of the Best Paper Award at NAACL 2022 and the Outstanding Paper Award at EMNLP 2023.