Welcome to the homepage of NVIDIA’s Data-Driven AI for Robotics (DAIR) group, led by Umar Iqbal. We are part of the Learning and Perception Research (LPR) organization within NVIDIA Research.

Our group investigates how robots can learn directly from human data, such as videos, motion capture, and large-scale demonstrations, to acquire skills that generalize across tasks, embodiments, and environments. We work at the intersection of computer vision, machine learning, and robotics, developing models that understand, reconstruct, and imitate human behaviors.

Our research contributes to NVIDIA’s broader vision of foundation models for robotics, combining advances in human motion modeling, human–object and human–scene interaction modeling, physics-based simulation, and embodied intelligence to enable scalable robot learning. Ultimately, we aim to bridge the gap between human understanding and robotic intelligence, advancing the goal of robots that learn by watching humans.

News

March 2026

We released a whole new ecosystem for Human(oid) Motion including SOMA, Kimodo, GEM, SOMA-Retargeter and, BONES-SEED dataset.

December 2025

We released SONIC, a state-of-the-art generalist humanoid controller.

July 2025

Four papers accepted to ICCV 2025 including GENMO, GeoMan, AdaHuman, and HumanOLAT.

Feb 2025

SimAvatar accepted to CVPR 2025!

Members

Umar Iqbal

Team Leader

Publications

Enrico Pallotta, Sina Mokhtarzadeh Azar, Lars Doorenbos, Serdar Ozsoy, Umar Iqbal, Juergen Gall

June 2026 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

EgoControl: Controllable Egocentric Video Generation via 3D Full-Body Poses

arXiv Website

Xianghui Xie, Bowen Wen, Yan Chang, Hesam Rabeti, Jiefeng Li, Ye Yuan, Gerard Pons-Moll, Stan Birchfield

June 2026 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

CARI4D: Category Agnostic 4D Reconstruction of Human-Object Interaction

arXiv

Marcel Buehler, Ye Yuan, Xueting Li, Yangyi Huang, Koki Nagano, Umar Iqbal

March 2026 International Conference on 3D Vision (3DV)

Dream, Lift, Animate: From Single Images to Animatable Gaussian Avatars

arXiv

Davis Rempe, Mathis Petrovich, Ye Yuan, Haotian Zhang, Xue Bin Peng, Yifeng Jiang, Tingwu Wang, Umar Iqbal, David Minor, Michael de Ruyter, Jiefeng Li, Chen Tessler, Edy Lim, Eugene Jeong, Sam Wu, Ehsan Hassani, Michael Huang, Jin-Bey Yu, Chaeyeon Chung, Lina Song, Olivier Dionne, Jan Kautz, Simon Yuen, Sanja Fidler

March 2026 ArXiv Preprint

Kimodo: Scaling Controllable Human Motion Generation

arXiv Website

Jun Saito, Jiefeng Li, Michael de Ruyter, Miguel Guerrero, Edy Lim, Ehsan Hassani, Roger Blanco Ribera, Hyejin Moon, Magdalena Dadela, Marco Di Lucca, Qiao Wang, Xueting Li, Jan Kautz, Simon Yuen, Umar Iqbal

March 2026 ArXiv Preprint

SOMA: Unifying Parametric Human Body Models

arXiv Website pdf

Zhengyi Luo, Ye Yuan, Tingwu Wang, Chenran Li, Sirui Chen, Fernando Castañeda, Zi-Ang Cao, Jiefeng Li, David Minor, Qingwei Ben, Xingye Da, Runyu Ding, Cyrus Hogg, Lina Song, Edy Lim, Eugene Jeon, Tairan He, Haoru Xue, Wenli Xiao, Zi Wang, Simon Yuen, Jan Kautz, Yan Chang, Umar Iqbal, Linxi Fan, Yuke Zhu

January 2026 Science Robotics

SONIC: Supersizing Motion Tracking for Natural Humanoid Whole-Body Control

Website pdf

Yangyi Huang, Ye Yuan, Xueting Li, Jan Kautz, Umar Iqbal

October 2025 IEEE International Conference on Computer Vision (ICCV)

AdaHuman: Animatable Detailed 3D Human Generation with Compositional Multiview Diffusion

arXiv Website

Jiefeng Li, Jinkun Cao, Haotian Zhang, Davis Rempe, Jan Kautz, Umar Iqbal, Ye Yuan

October 2025 IEEE International Conference on Computer Vision (ICCV)

GEM: A GENeralist Model for Human MOtion

arXiv Website

Gwanghyun Kim, Xueting Li, Ye Yuan, Koki Nagano, Tianye Li, Jan Kautz, Se Young Chun, Umar Iqbal

October 2025 IEEE International Conference on Computer Vision (ICCV)

GeoMan: Temporally Consistent Human Geometry Estimation using Image-to-Video Diffusion

arXiv Website

Timo Teufel, Xilong Zhou, Umar Iqbal, Pramod Rao, Pulkit Gera, Jan Kautz, Vladislav Golyanik, Christian Theobalt

October 2025 IEEE International Conference on Computer Vision (ICCV)

HumanOLAT: A Large-Scale Dataset for Full-Body Human Relighting and Novel-View Synthesis

arXiv Website

See all publications

News

Members

Umar Iqbal

Team Leader

Jiefeng Li

Ye Yuan

Xueting Li

Yufei Ye

Jinhyung (David) Park

Tianyi Xie

Publications

EgoControl: Controllable Egocentric Video Generation via 3D Full-Body Poses

CARI4D: Category Agnostic 4D Reconstruction of Human-Object Interaction

Dream, Lift, Animate: From Single Images to Animatable Gaussian Avatars

Kimodo: Scaling Controllable Human Motion Generation

SOMA: Unifying Parametric Human Body Models

SONIC: Supersizing Motion Tracking for Natural Humanoid Whole-Body Control

AdaHuman: Animatable Detailed 3D Human Generation with Compositional Multiview Diffusion

GEM: A GENeralist Model for Human MOtion

GeoMan: Temporally Consistent Human Geometry Estimation using Image-to-Video Diffusion

HumanOLAT: A Large-Scale Dataset for Full-Body Human Relighting and Novel-View Synthesis