  Shalini De Mello  

 



  ![](/sites/default/files/person/Shalini-De-Mello-6117_1920x1080_small2%20copy.jpg)

  

 Shalini De Mello is a Director of Research, New Experiences, leading NVIDIA's[ AI-Mediated Reality and Interaction (AMRI) Research](https://research.nvidia.com/labs/amri/) group.

Previously, she was a Distinguished Research Scientist in the [Learning and Perception Research](https://research.nvidia.com/labs/lpr/) group at NVIDIA. Her research interests are in AI, computer vision and human-computer interaction. She has co-authored scores of peer-reviewed publications and patents, serves as an area chair and is a frequent keynote presenter at all top-tier AI conferences. Her inventions have contributed to several NVIDIA AI products, including [DriveIX](https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Furldefense.com%2Fv3%2F__https%3A%2Fwww.nvidia.com%2Fen-us%2Fself-driving-cars%2Fdrive-ix%2F__%3B!!DZ3fjg!6aSHrb26l4qhlR5zhxudifFBi5bANgQnmkMOBcbz2zTchYZ9OcQrBUwKZmOvQAIxdPnyF9yGh8YHnfqp5w%24&data=05%7C01%7Cshalinig%40nvidia.com%7Cc546756f3fc74fc15b9408dbcf388259%7C43083d15727340c1b7db39efd9ccc17a%7C0%7C0%7C638331614148347616%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=DzJrv8fXPTWsnsu9XipR16vgm6FpaIDTfr2ChTzptK8%3D&reserved=0), [Maxine](https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Furldefense.com%2Fv3%2F__https%3A%2Fdeveloper.nvidia.com%2Fmaxine__%3B!!DZ3fjg!6aSHrb26l4qhlR5zhxudifFBi5bANgQnmkMOBcbz2zTchYZ9OcQrBUwKZmOvQAIxdPnyF9yGh8aBhLYiZw%24&data=05%7C01%7Cshalinig%40nvidia.com%7Cc546756f3fc74fc15b9408dbcf388259%7C43083d15727340c1b7db39efd9ccc17a%7C0%7C0%7C638331614148347616%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=j76j6O7F4dEAV6nmlGyk85XfzIRu7PIJE1OAXwAmA4M%3D&reserved=0) and [TAO Toolkit](https://developer.nvidia.com/tao-toolkit). She received her Doctoral and Master’s degrees in Electrical and Computer Engineering from the University of Texas at Austin. For details see her [Curriculum Vitae](https://drive.google.com/file/d/1cvrw0A4Ob4T6ld4FQQGsPQv9JihLGp6p/view?usp=share_link).

Shalini's [AMRI research group](https://research.nvidia.com/labs/amri/) spans algorithms, theory and applications of AI to human-computer interaction. A particular interest is in understanding ***human social interactions*** and modeling ***dynamic 4D neural worlds*** to build ***interactive physical AIs***.



   Research Area(s)

[Artificial Intelligence and Machine Learning ](/research-area/machine-learning-artificial-intelligence)

[Computer Graphics](/research-area/computer-graphics)

[Computer Vision](/research-area/computer-vision)

[Generative AI](/research-area/generative-ai)

[Human Computer Interaction](/research-area/human-computer-interaction)

[VR, AR and Display Technology](/research-area/virtual-augmented-reality)

 

 

  

 Main Field of Interest

[Computer Vision](/research-area/computer-vision)

 

  

 Google Scholar

[https://scholar.google.com/citations?hl=en&amp;user=xQM4BlMAAAAJ&amp;cstart=20&amp;view\_op=…](https://scholar.google.com/citations?hl=en&user=xQM4BlMAAAAJ&cstart=20&view_op=list_works&gmla=AJsN-F6zkesXD4mmXVx7PyL-YlEtF-ASlISyJajX59JAebwUNomtT29jnA5h8V1eGkhaXwp2zi4Aiiw6c0FOkRa-h7wrzR0ntw)

 

  

 

 

 



 ### Publications

 

### 2026 

[Alpha-Vision: A Real-Time Always-on Vision Processor with 787µs Face Detection Latency in &lt;5mW](/publication/2026-02_alpha-vision-real-time-always-vision-processor-787ms-face-detection-latency)

[Ben Keller](/person/ben-keller), [Rangharajan Venkatesan](/person/rangharajan-venkatesan), [Steve Dai](/person/steve-dai), [Jason Clemons](/person/jason-clemons), [Matt Fojtik](/person/matt-fojtik), [Muya Chang](/person/muya-chang), Thierry Tambe, [Nathaniel Pinckney](/person/nathaniel-pinckney), [Stephen Tell](/person/stephen-tell), [Qijing Jenny Huang](/person/qijing-jenny-huang), [Shalini De Mello](/person/shalini-de-mello), [Brucek Khailany](/person/brucek-khailany)



[ISSCC 2026](https://www.isscc.org/)









### 2025 

[Play4D: Accelerated and Interactive Free-viewpoint Video Streaming for Virtual Reality and Light Field Displays](/index.php/publication/2025-12_play4d-accelerated-and-interactive-free-viewpoint-video-streaming-virtual)

[Jonghyun Kim](/index.php/person/jonghyun-kim), [Michael Stengel](/index.php/person/michael-stengel), [Amrita Mazumdar](/index.php/person/amrita-mazumdar), [Tianye Li](/index.php/person/tianye-li), [Cheng Sun](/index.php/person/cheng-sun), [David Luebke](/index.php/person/david-luebke), [Shalini De Mello](/index.php/person/shalini-de-mello)



[ACM SIGGRAPH Emerging Technologies 2025](https://sa2025.conference-schedule.org/presentation/?id=emt_131&sess=sess182)









[Seeing What Matters: Generalizable AI-generated Video Detection with Forensic-Oriented Augmentation](/index.php/publication/2025-11_seeing-what-matters-generalizable-ai-generated-video-detection-forensic)

Riccardo Corvi, Davide Cozzolino, [Ekta Prashnani](/index.php/person/ekta-prashnani), [Shalini De Mello](/index.php/person/shalini-de-mello), [Koki Nagano](/index.php/person/koki-nagano), Luisa Verdoliva



[Advances in Neural Information Processing Systems (NeurIPS) 2025](https://neurips.cc/virtual/2025/loc/san-diego/poster/117010)









[Real-time 3D Visualization of Radiance Fields on Light Field Displays](/index.php/publication/2025-08_real-time-3d-visualization-radiance-fields-light-field-displays)

[Jonghyun Kim](/index.php/person/jonghyun-kim), [Cheng Sun](/index.php/person/cheng-sun), [Michael Stengel](/index.php/person/michael-stengel), Matthew Chan, Andrew Russell, Jaehyun Jung, Wil Braithewaite, [Shalini De Mello](/index.php/person/shalini-de-mello), [David Luebke](/index.php/person/david-luebke)



[ArXiv](https://arxiv.org/abs/2508.18540)









[GAIA: Generative Animatable Interactive Avatars with Expression-conditioned Gaussians](/publication/2025-08_gaia-generative-animatable-interactive-avatars-expression-conditioned-gaussians)

Zhengming Yu, [Tianye Li](/person/tianye-li), Jingxiang Sun, [Omer Shapira](/person/omer-shapira), [Seonwook Park](/person/seonwook-park), [Michael Stengel](/person/michael-stengel), Matthew Chan, Xin Li, Wenping Wang, [Koki Nagano](/person/koki-nagano), [Shalini De Mello](/person/shalini-de-mello)



[ACM SIGGRAPH 2025](https://dl.acm.org/doi/10.1145/3721238.3730737)









[Coherent 3D Portrait Video Reconstruction via Triplane Fusion](/publication/2025-06_coherent-3d-portrait-video-reconstruction-triplane-fusion)

Shengze Wang, [Xueting Li](/person/xueting-li), [Chao Liu](/person/chao-liu), Matthew Chan, [Michael Stengel](/person/michael-stengel), Henry Fuchs, [Shalini De Mello](/person/shalini-de-mello), [Koki Nagano](/person/koki-nagano)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2025](https://openaccess.thecvf.com/content/CVPR2025/papers/Wang_Coherent_3D_Portrait_Video_Reconstruction_via_Triplane_Fusion_CVPR_2025_paper.pdf)









[SimAvatar: Simulation-Ready Clothed Gaussian Avatars from Text](/publication/2025-06_simavatar-simulation-ready-clothed-gaussian-avatars-text)

[Xueting Li](/person/xueting-li), [Ye Yuan](/person/ye-yuan), [Shalini De Mello](/person/shalini-de-mello), Gilles Daviet, Jonathan Leaf, Miles Macklin, [Jan Kautz](/person/jan-kautz), [Umar Iqbal](/person/umar-iqbal)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2025](https://openaccess.thecvf.com/content/CVPR2025/papers/Li_SimAvatar_Simulation-Ready_Avatars_with_Layered_Hair_and_Clothing_CVPR_2025_paper.pdf)









[BLADE: Single-view Body Mesh Estimation through Accurate Depth Estimation](/index.php/publication/2025-06_blade-single-view-body-mesh-estimation-through-accurate-depth-estimation)

Shengze Wang, [Jiefeng Li](/index.php/person/jiefeng-li), [Tianye Li](/index.php/person/tianye-li), [Ye Yuan](/index.php/person/ye-yuan), Henry Fuchs, [Koki Nagano](/index.php/person/koki-nagano), [Shalini De Mello](/index.php/person/shalini-de-mello), [Michael Stengel](/index.php/person/michael-stengel)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2025](https://openaccess.thecvf.com/content/CVPR2025/papers/Wang_BLADE_Single-view_Body_Mesh_Estimation_through_Accurate_Depth_Estimation_CVPR_2025_paper.pdf)









[AI 3D Selfie: Real-Time Single-Image 3D Face Reconstruction for Light-Field Displays](/publication/2025-05_ai-3d-selfie-real-time-single-image-3d-face-reconstruction-light-field-displays)

[Jonghyun Kim](/person/jonghyun-kim), [Michael Stengel](/person/michael-stengel), Matthew Chan, [Koki Nagano](/person/koki-nagano), [Shalini De Mello](/person/shalini-de-mello), [David Luebke](/person/david-luebke)



[The Society of Information Display (SID) 2025](https://sid.onlinelibrary.wiley.com/doi/abs/10.1002/sdtp.18377)









### 2024 

[QUEEN: QUantized Efficient ENcoding for Streaming Free-viewpoint Videos](/index.php/publication/2024-12_queen-quantized-efficient-encoding-streaming-free-viewpoint-videos)

Sharath Girish, [Tianye Li](/index.php/person/tianye-li), [Amrita Mazumdar](/index.php/person/amrita-mazumdar), Abhinav Shrivastava, [David Luebke](/index.php/person/david-luebke), [Shalini De Mello](/index.php/person/shalini-de-mello)



[Advances in Neural Information Processing Systems (NeurIPS) 2024](https://proceedings.neurips.cc/paper_files/paper/2024/hash/4c9477b9e2c7ec0ad3f4f15077aaf85a-Abstract-Conference.html)









[CosAE: Learnable Fourier Series for Image Restoration](/publication/2024-12_cosae-learnable-fourier-series-image-restoration)

[Sifei Liu](/person/sifei-liu), [Shalini De Mello](/person/shalini-de-mello), [Jan Kautz](/person/jan-kautz)



[Advances in Neural Information Processing Systems (NeurIPS) 2024](https://proceedings.neurips.cc/paper_files/paper/2024/file/13e8be77982beb73d7ed0bbf122f9f3c-Paper-Conference.pdf)









[Avatar Fingerprinting for Authorized Use of Synthetic Talking-Head Videos](/publication/2024-09_avatar-fingerprinting-authorized-use-synthetic-talking-head-videos)

[Ekta Prashnani](/person/ekta-prashnani), [Koki Nagano](/person/koki-nagano), [Shalini De Mello](/person/shalini-de-mello), [David Luebke](/person/david-luebke), Orazio Gallo



[European Conference on Computer Vision (ECCV) 2024](https://eccv.ecva.net)









[What You See is What You GAN: Rendering Every Pixel for High-Fidelity Geometry in 3D GANs](/publication/2024-06_what-you-see-what-you-gan-rendering-every-pixel-high-fidelity-geometry-3d-gans)

[Alexander Trevithick](/person/alexander-trevithick), Matthew Chan, Towaki Takikawa, [Umar Iqbal](/person/umar-iqbal), [Shalini De Mello](/person/shalini-de-mello), Manmohan Chandraker, Ravi Ramamoorthi, [Koki Nagano](/person/koki-nagano)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2024](https://openaccess.thecvf.com/content/CVPR2024/papers/Trevithick_What_You_See_is_What_You_GAN_Rendering_Every_Pixel_CVPR_2024_paper.pdf)









[GAvatar: Animatable 3D Gaussian Avatars with Implicit Mesh Learning](/publication/2024-06_gavatar-animatable-3d-gaussian-avatars-implicit-mesh-learning)

[Ye Yuan](/person/ye-yuan), [Xueting Li](/person/xueting-li), Yangyi Huang, [Shalini De Mello](/person/shalini-de-mello), [Koki Nagano](/person/koki-nagano), [Jan Kautz](/person/jan-kautz), [Umar Iqbal](/person/umar-iqbal)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2024](https://openaccess.thecvf.com/content/CVPR2024/papers/Yuan_GAvatar_Animatable_3D_Gaussian_Avatars_with_Implicit_Mesh_Learning_CVPR_2024_paper.pdf)



Highlight





[Dream-in-4D: A Unified Approach for Text- and Image-guided 4D Scene Generation](/publication/2024-06_dream-4d-unified-approach-text-and-image-guided-4d-scene-generation)

Yufeng Zheng, [Xueting Li](/person/xueting-li), [Koki Nagano](/person/koki-nagano), [Sifei Liu](/person/sifei-liu), Otmar Hilliges, [Shalini De Mello](/person/shalini-de-mello)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2024](https://openaccess.thecvf.com/content/CVPR2024/papers/Zheng_A_Unified_Approach_for_Text-_and_Image-guided_4D_Scene_Generation_CVPR_2024_paper.pdf)









[RegionGPT: Towards Region Understanding Vision Language Model](/publication/2024-06_regiongpt-towards-region-understanding-vision-language-model)

Qiushan Guo, [Shalini De Mello](/person/shalini-de-mello), [Hongxu Danny Yin](/person/danny-yin), [Wonmin Byeon](/person/wonmin-byeon), Ka Chun Cheung, Yizhou Yu, Ping Luo, [Sifei Liu](/person/sifei-liu)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2024](https://openaccess.thecvf.com/content/CVPR2024/papers/Guo_RegionGPT_Towards_Region_Understanding_Vision_Language_Model_CVPR_2024_paper.pdf)









[3D Reconstruction with Generalizable Neural Fields using Scene Priors](/publication/2024-05_3d-reconstruction-generalizable-neural-fields-using-scene-priors)

Yang Fu, [Shalini De Mello](/person/shalini-de-mello), [Xueting Li](/person/xueting-li), Amey Kulkarni, [Jan Kautz](/person/jan-kautz), Xiaolong Wang, [Sifei Liu](/person/sifei-liu)



[International Conference on Learning Representations (ICLR) 2024](https://proceedings.iclr.cc/paper_files/paper/2024/hash/0bd32794b26cfc99214b89313764da8e-Abstract-Conference.html)









### 2023 

[Convolutional State Space Models for Long-Range Spatiotemporal Modeling](/publication/2023-12_convolutional-state-space-models-long-range-spatiotemporal-modeling)

Jimmy T. H. Smith, [Shalini De Mello](/person/shalini-de-mello), [Jan Kautz](/person/jan-kautz), Scott Linderman, [Wonmin Byeon](/person/wonmin-byeon)



[Advances in Neural Information Processing Systems (NeurIPS) 2023](https://nips.cc/)









[Generalizable One-shot 3D Neural Head Avatar](/publication/2023-12_generalizable-one-shot-3d-neural-head-avatar)

[Xueting Li](/person/xueting-li), [Shalini De Mello](/person/shalini-de-mello), [Sifei Liu](/person/sifei-liu), [Koki Nagano](/person/koki-nagano), [Umar Iqbal](/person/umar-iqbal), [Jan Kautz](/person/jan-kautz)



[Advances in Neural Information Processing Systems (NeurIPS) 2023](https://neurips.cc/)









[Generative Novel View Synthesis with 3D-Aware Diffusion Models](/publication/2023-10_generative-novel-view-synthesis-3d-aware-diffusion-models)

Eric R. Chan, [Koki Nagano](/person/koki-nagano), Matthew Chan, Alexander W. Bergman, Jeong Joon Park, Axel Levy, [Miika Aittala](/person/miika-aittala), [Shalini De Mello](/person/shalini-de-mello), [Tero Karras](/person/tero-karras), Gordon Wetzstein



[International Conference on Computer Vision (ICCV) 2023](https://iccv2023.thecvf.com/)



Oral





[AI-Mediated 3D Video Conferencing](/index.php/publication/2023-08_ai-mediated-3d-video-conferencing)

[Michael Stengel](/index.php/person/michael-stengel), [Koki Nagano](/index.php/person/koki-nagano), [Chao Liu](/index.php/person/chao-liu), Matthew Chan, Alex Trevithick, [Shalini De Mello](/index.php/person/shalini-de-mello), [Jonghyun Kim](/index.php/person/jonghyun-kim), [David Luebke](/index.php/person/david-luebke), [Amrita Mazumdar](/index.php/person/amrita-mazumdar), Shengze Wang, Mayoore Jaiswal



[ACM SIGGRAPH Emerging Technologies 2023](https://dl.acm.org/doi/abs/10.1145/3588037.3595385)









[Affordance Diffusion: Synthesizing Hand-Object Interactions](/publication/2023-06_affordance-diffusion-synthesizing-hand-object-interactions)

Yufei Ye, [Xueting Li](/person/xueting-li), Abhinav Gupta, [Shalini De Mello](/person/shalini-de-mello), [Stan Birchfield](/person/stan-birchfield), Jiaming Song, Shubham Tulsiani, [Sifei Liu](/person/sifei-liu)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023](https://cvpr2023.thecvf.com/)









[Zero-shot Pose Transfer for Unrigged Stylized 3D Characters](/publication/2023-06_zero-shot-pose-transfer-unrigged-stylized-3d-characters)

Jiashun Wang, [Xueting Li](/person/xueting-li), [Sifei Liu](/person/sifei-liu), [Shalini De Mello](/person/shalini-de-mello), Orazio Gallo, Xiaolong Wang, [Jan Kautz](/person/jan-kautz)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023](https://openaccess.thecvf.com/content/CVPR2023/papers/Wang_Zero-Shot_Pose_Transfer_for_Unrigged_Stylized_3D_Characters_CVPR_2023_paper.pdf)









[GazeNeRF: 3D-Aware Gaze Redirection with Neural Radiance Fields](/publication/2023-06_gazenerf-3d-aware-gaze-redirection-neural-radiance-fields)

Alessandro Ruzzi, Xiangwei Shi, Xi Wang, Gengyan Li, [Shalini De Mello](/person/shalini-de-mello), Hyung Jin Chang, Xucong Zhang, Otmar Hilliges



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023](https://cvpr2023.thecvf.com/)









[Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models](/publication/2023-06_open-vocabulary-panoptic-segmentation-text-image-diffusion-models)

Jiarui Xu, [Sifei Liu](/person/sifei-liu), [Arash Vahdat](/person/arash-vahdat), [Wonmin Byeon](/person/wonmin-byeon), Xiaolong Wang, [Shalini De Mello](/person/shalini-de-mello)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023](https://cvpr2023.thecvf.com/)



Hightlight top 10%





[GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation](/publication/2023-05_gpvit-high-resolution-non-hierarchical-vision-transformer-group-propagation)

Chenhongyi Yang, Jiarui Xu, [Shalini De Mello](/person/shalini-de-mello), Elliot J. Crowley, Xiaolong Wang



[International Conference on Learning Representations (ICLR) 2023](https://iclr.cc/virtual/2023/poster/11986)



Notable top 25%, Oral





### 2022 

[CoordGAN: Self-Supervised Dense Correspondences Emerge from GANs](/publication/2022-07_coordgan-self-supervised-dense-correspondences-emerge-gans)

Jiteng Mu, [Shalini De Mello](/person/shalini-de-mello), [Zhiding Yu](/person/zhiding-yu), Nuno Vasconcelos, Xiaolong Wang, [Sifei Liu](/person/sifei-liu)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022](https://cvpr2022.thecvf.com/)









[GroupViT: Semantic Segmentation Emerges from Text Supervision](/publication/2022-06_groupvit-semantic-segmentation-emerges-text-supervision)

Jiarui Xu, [Shalini De Mello](/person/shalini-de-mello), [Sifei Liu](/person/sifei-liu), [Wonmin Byeon](/person/wonmin-byeon), [Thomas Breuel](/person/thomas-breuel), [Jan Kautz](/person/jan-kautz), Xiaolong Wang



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022](https://cvpr2022.thecvf.com/)









[Efficient Geometry-aware 3D Generative Adversarial Networks](/publication/2022-06_efficient-geometry-aware-3d-generative-adversarial-networks)

Eric R. Chan, Connor Z. Lin, Matthew A. Chan, [Koki Nagano](/person/koki-nagano), Boxiao Pan, [Shalini De Mello](/person/shalini-de-mello), Orazio Gallo, Leonidas Guibas, [Jonathan Tremblay](/person/jonathan-tremblay), Sameh Khamis, [Tero Karras](/person/tero-karras), Gordon Wetzstein



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022](https://cvpr2022.thecvf.com/)



Oral





[FreeSOLO: Learning to Segment Objects without Annotations](/publication/2022-06_freesolo-learning-segment-objects-without-annotations)

Xinlong Wang, [Zhiding Yu](/person/zhiding-yu), [Shalini De Mello](/person/shalini-de-mello), [Jan Kautz](/person/jan-kautz), Anima Anandkumar, Chunhua Shen, Jose M. Alvarez



[ IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022](https://cvpr2022.thecvf.com/)









[Learning Continuous Environment Fields via Implicit Functions](/publication/2022-04_learning-continuous-environment-fields-implicit-functions)

[Xueting Li](/person/xueting-li), [Shalini De Mello](/person/shalini-de-mello), Xiaolong Wang, Ming-Hsuan Yang, [Jan Kautz](/person/jan-kautz), [Sifei Liu](/person/sifei-liu)



[International Conference on Learning Representations (ICLR), 2022](https://iclr.cc/Conferences/2022)









[Learning Contrastive Representation for Semantic Correspondence](/publication/2022-03_learning-contrastive-representation-semantic-correspondence)

Taihong Xiao, [Sifei Liu](/person/sifei-liu), [Shalini De Mello](/person/shalini-de-mello), [Zhiding Yu](/person/zhiding-yu), [Jan Kautz](/person/jan-kautz), Ming-Hsuan Yang



[International Journal of Computer Vision (IJCV) 2022](https://link.springer.com/article/10.1007/s11263-022-01602-y)









### 2021 

[Self-Supervised Object Detection via Generative Image Synthesis](/publication/2021-10_self-supervised-object-detection-generative-image-synthesis)

Siva Karthik Mustikovela, [Shalini De Mello](/person/shalini-de-mello), Aayush Prakash, [Umar Iqbal](/person/umar-iqbal), [Sifei Liu](/person/sifei-liu), Thu Nguyen-Phuoc, Carsten Rother, [Jan Kautz](/person/jan-kautz)



[International Conference on Computer Vision (ICCV) 2021](https://iccv2021.thecvf.com/)









[Weakly-Supervised Physically Unconstrained Gaze Estimation](/publication/2021-06_weakly-supervised-physically-unconstrained-gaze-estimation)

Rakshit Kothari, [Shalini De Mello](/person/shalini-de-mello), [Umar Iqbal](/person/umar-iqbal), [Wonmin Byeon](/person/wonmin-byeon), Seonwook Park, [Jan Kautz](/person/jan-kautz)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2021](http://cvpr2021.thecvf.com/)



Oral





[Learning to Track Instances without Video Annotations](/publication/2021-06_learning-track-instances-without-video-annotations)

Yang Fu, [Sifei Liu](/person/sifei-liu), [Umar Iqbal](/person/umar-iqbal), [Shalini De Mello](/person/shalini-de-mello), Humphrey Shi, [Jan Kautz](/person/jan-kautz)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2021](http://cvpr2021.thecvf.com/)



Oral





[Contrastive Syn-to-Real Generalization](/publication/2021-05_contrastive-syn-real-generalization)

Wuyang Chen, [Zhiding Yu](/person/zhiding-yu), [Shalini De Mello](/person/shalini-de-mello), [Sifei Liu](/person/sifei-liu), Jose M. Alvarez, Zhangyang Wang, Anima Anandkumar



[International Conference on Learning Representations (ICLR) 2021](https://openreview.net/group?id=ICLR.cc/2021/Conference)









### 2020 

[Online Adaptation for Consistent Mesh Reconstruction in the Wild](/publication/2020-12_online-adaptation-consistent-mesh-reconstruction-wild)

Xueting Li, [Sifei Liu](/person/sifei-liu), [Shalini De Mello](/person/shalini-de-mello), Kihwan Kim, Xiaolong Wang, Ming-Hsuan Yang, [Jan Kautz](/person/jan-kautz)



[Neural Information Processing Systems (NeurIPS) 2020](https://nips.cc/virtual/2020/public/poster_aba3b6fd5d186d28e06ff97135cade7f.html)









[Self-Learning Transformations for Improving Gaze and Head Redirection](/publication/2020-12_self-learning-transformations-improving-gaze-and-head-redirection)

Yufeng Zheng, Seonwook Park, Xucong Zhang, [Shalini De Mello](/person/shalini-de-mello), Otmar Hilliges



[ Neural Information Processing Systems (NeurIPS) 2020](https://proceedings.neurips.cc/paper/2020)









[Self-supervised Single-view 3D Reconstruction via Semantic Consistency](/publication/2020-08_self-supervised-single-view-3d-reconstruction-semantic-consistency)

Xueting Li, [Sifei Liu](/person/sifei-liu), Kihwan Kim, [Shalini De Mello](/person/shalini-de-mello), Varun Jampani, Ming-Hsuan Yang, [Jan Kautz](/person/jan-kautz)



[European Conference on Computer Vision (ECCV) 2020](https://eccv2020.eu/)









[Self-Supervised Viewpoint Learning From Image Collections ](/publication/2020-06_self-supervised-viewpoint-learning-image-collections)

Siva Karthik Mustikovela, Varun Jampani, [Shalini De Mello](/person/shalini-de-mello), [Sifei Liu](/person/sifei-liu), [Umar Iqbal](/person/umar-iqbal), Carsten Rother, [Jan Kautz](/person/jan-kautz)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2020](http://cvpr2020.thecvf.com/)









### 2019 

[Joint-task Self-supervised Learning for Temporal Correspondence ](/publication/2019-12_joint-task-self-supervised-learning-temporal-correspondence)

Xueting Li, [Sifei Liu](/person/sifei-liu), [Shalini De Mello](/person/shalini-de-mello), Xiaolong Wang, [Jan Kautz](/person/jan-kautz), Ming-Hsuan Yang



[Neural Information Processing Systems (NeurIPS) 2019](https://sites.google.com/view/uvc2019)









[Content-Consistent Generation of Realistic Eyes with Style ](/publication/2019-11_content-consistent-generation-realistic-eyes-style)

Marcel Bühler , Seonwook Park, [Shalini De Mello](/person/shalini-de-mello), Xucong, Otmar Hilliges



International Conference on Computer Vision Workshop (ICCVW) 2019



Winner (1st place) Synthetic Eye Generation Challenge





[Few-Shot Adaptive Gaze Estimation ](/publication/2019-10_few-shot-adaptive-gaze-estimation)

Seonwook Park, [Shalini De Mello](/person/shalini-de-mello), [Pavlo Molchanov](/person/pavlo-molchanov), [Umar Iqbal](/person/umar-iqbal), Otmar Hilliges, [Jan Kautz](/person/jan-kautz)



[International Conference on Computer Vision (ICCV) 2019](http://iccv2019.thecvf.com/program/overview)



Oral





[Learning Propagation for Arbitrarily-Structured Data](/publication/2019-09_learning-propagation-arbitrarily-structured-data)

[Sifei Liu](/person/sifei-liu), Xueting Li, Varun Jampani, [Shalini De Mello](/person/shalini-de-mello), [Jan Kautz](/person/jan-kautz)



[International Conference on Computer Vision (ICCV) 2019](http://openaccess.thecvf.com/content_ICCV_2019/html/Liu_Learning_Propagation_for_Arbitrarily-Structured_Data_ICCV_2019_paper.htm)









[Few-Shot Viewpoint Estimation](/publication/2019-09_few-shot-viewpoint-estimation)

Hung-Yu Tseng, [Shalini De Mello](/person/shalini-de-mello), [Jonathan Tremblay](/person/jonathan-tremblay), [Sifei Liu](/person/sifei-liu), [Stan Birchfield](/person/stan-birchfield), Ming-Hsuan Yang, [Jan Kautz](/person/jan-kautz)



[British Machine Vision Conference (BMVC) 2019](https://bmvc2019.org/programme/)









[NVGaze: An Anatomically-Informed Dataset for Low-Latency, Near-Eye Gaze Estimation](/index.php/publication/2019-05_nvgaze-anatomically-informed-dataset-low-latency-near-eye-gaze-estimation)

[Joohwan Kim](/index.php/person/joohwan-kim), [Michael Stengel](/index.php/person/michael-stengel), Alexander Majercik, [Shalini De Mello](/index.php/person/shalini-de-mello), David Dunn, [Samuli Laine](/index.php/person/samuli-laine), Morgan McGuire, [David Luebke](/index.php/person/david-luebke)



ACM Conference on Human-Computer-Interaction (CHI) 2019









### 2018 

[Switchable Temporal Propagation Network ](/index.php/publication/2018-09_switchable-temporal-propagation-network)

[Sifei Liu](/index.php/person/sifei-liu), Guangyu Zhong, [Shalini De Mello](/index.php/person/shalini-de-mello), [Jinwei Gu](/index.php/person/jinwei-gu), Varun Jampani



[European Conference on Computer Vision (ECCV) 2018](http://faculty.ucmerced.edu/mhyang/papers/eccv2018_stpn.pdf)









[Light-weight Head Pose Invariant Gaze Tracking ](/publication/2018-06_light-weight-head-pose-invariant-gaze-tracking)

Rajeev Ranjan, [Shalini De Mello](/person/shalini-de-mello), [Jan Kautz](/person/jan-kautz)



[IEEE Computer Vision and Pattern Recognition Workshop (CVPRW) 2018](http://cvpr2018.thecvf.com/program/workshops)



Best Paper (runner up) Workshop on Analysis and Modeling of Faces and Gestures





### 2017 

[Learning Affinity via Spatial Propagation Networks ](/publication/2017-12_learning-affinity-spatial-propagation-networks)

[Sifei Liu](/person/sifei-liu), [Shalini De Mello](/person/shalini-de-mello), [Jinwei Gu](/person/jinwei-gu), Guangyu Zhong, Ming-Hsuan Yang, [Jan Kautz](/person/jan-kautz)



[Conference on Neural Information Processing Systems (NIPS) 2017](https://nips.cc/)









[Dynamic Facial Analysis: From Bayesian Filtering to Recurrent Neural Network](/publication/2017-07_dynamic-facial-analysis-bayesian-filtering-recurrent-neural-network)

[Jinwei Gu](/person/jinwei-gu), [Xiaodong Yang](/person/xiaodong-yang), [Shalini De Mello](/person/shalini-de-mello), [Jan Kautz](/person/jan-kautz)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017](http://cvpr2017.thecvf.com/)









### 2016 

[Towards Selecting Robust Hand Gestures for Automotive Interfaces](/index.php/publication/2016-06_towards-selecting-robust-hand-gestures-automotive-interfaces-0)

[Shalini Gupta](/index.php/person/shalini-de-mello), [Pavlo Molchanov](/index.php/person/pavlo-molchanov), Xiaodong Yang, [Stephen Tyree](/index.php/person/stephen-tyree), [Jan Kautz](/index.php/person/jan-kautz)



[IEEE Intelligent Vehicles Symposium (IV) 2016](http://iv2016.org/)









[Online Detection and Classification of Dynamic Hand Gestures with Recurrent 3D Convolutional Neural Networks ](/publication/2016-06_online-detection-and-classification-dynamic-hand-gestures-recurrent-3d)

[Pavlo Molchanov](/person/pavlo-molchanov), [Xiaodong Yang](/person/xiaodong-yang), [Shaline Gupta](/person/shalini-de-mello), Kihwan Kim, [Stephen Tyree](/person/stephen-tyree), [Jan Kautz](/person/jan-kautz)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2016](http://cvpr2016.thecvf.com/)









### 2015 

[Robust Model-based 3D Head Pose Estimation](/publication/2015-12_robust-model-based-3d-head-pose-estimation)

Gregory P Meyer, [Shalini Gupta](/person/shalini-de-mello), [Iuri Frosio](/person/iuri-frosio), Dikpal Reddy, [Jan Kautz](/person/jan-kautz)



[IEEE International Conference on Computer Vision (ICCV) 2015](http://pamitc.org/iccv15/)









[Hand Gesture Recognition with 3D Convolutional Neural Networks ](/publication/2015-06_hand-gesture-recognition-3d-convolutional-neural-networks)

[Pavlo Molchanov](/person/pavlo-molchanov), [Shalini Gupta](/person/shalini-de-mello), Kihwan Kim, [Jan Kautz](/person/jan-kautz)



[IEEE Computer Vision and Pattern Recognition Workshop (CVPRW) 2015](http://www.pamitc.org/cvpr15/)



Winner (1st place) Hand Gesture Recognition Challenge





[Multi-sensor System for Driver’s Hand-Gesture Recognition ](/publication/2015-05_multi-sensor-system-driver-s-hand-gesture-recognition)

[Pavlo Molchanov](/person/pavlo-molchanov), [Shalini Gupta](/person/shalini-de-mello), Kihwan Kim, Kari Pulli



[IEEE International Conference on Automatic Face and Gesture Recognition (FG) 20…](http://www.fg2015.org/)