  ## Computer Vision

 ### Associated Publications

 

### 2026 

[QCalEval: Benchmarking Vision-Language Models for Quantum Calibration Plot Understanding](/publication/2026-04_qcaleval-benchmarking-vision-language-models-quantum-calibration-plot)

Shuxiang Cao, Zijian Zhang, Abhishek Agarwal, Grace Bratrud, Niyaz R. Beysengulov, Daniel C. Cole, Alejandro Gomez Frieiro, Elena O. Glen, Hao Hsu, Gang Huang, Raymond Jow, Greshma Shaji, Tom Lubowe, [Ligeng Zhu](/person/ligeng-zhu), Luis Mantilla Calderon, Nicola Pancotti, Joel Pendleton, Brandon Severin, Charles Etienne Staub, Sara Sussman, Antti Vepsäläinen, Neel Rajeshbhai Vora, Yilun Xu, Varinia Bernales, Daniel Bowring, Elica Kyoseva, Ivan Rungger, Giulia Semeghini, Sam Stanwyck, Timothy Costa, [Alán Aspuru-Guzik](/person/alan-aspuru-guzik), Krysta Svore













[3D-GENERALIST: Vision-Language-Action Models for Crafting 3D Worlds](/publication/2026-03_3d-generalist-vision-language-action-models-crafting-3d-worlds)

Fan-Yun Sun, Shengguang Wu, Christian Jacobsen, Thomas Yim, Haoming Zou, [Alex Zook](/person/alex-zook), Shangru Li, Yu-Hsin Chou, Ethem Can, Xunlei Wu, Clemens Eppner, [Valts Blukis](/person/valts-blukis), [Jonathan Tremblay](/person/jonathan-tremblay), Jiajun Wu, [Stan Birchfield](/person/stan-birchfield), Nick Haber



[International Conference on 3D Vision 2026](https://3dvconf.github.io/2026/)









[Alpha-Vision: A Real-Time Always-on Vision Processor with 787µs Face Detection Latency in &lt;5mW](/publication/2026-02_alpha-vision-real-time-always-vision-processor-787ms-face-detection-latency)

[Ben Keller](/person/ben-keller), [Rangharajan Venkatesan](/person/rangharajan-venkatesan), [Steve Dai](/person/steve-dai), [Jason Clemons](/person/jason-clemons), [Matt Fojtik](/person/matt-fojtik), [Muya Chang](/person/muya-chang), Thierry Tambe, [Nathaniel Pinckney](/person/nathaniel-pinckney), [Stephen Tell](/person/stephen-tell), [Qijing Jenny Huang](/person/qijing-jenny-huang), [Shalini De Mello](/person/shalini-de-mello), [Brucek Khailany](/person/brucek-khailany)



[ISSCC 2026](https://www.isscc.org/)









### 2025 

[Play4D: Accelerated and Interactive Free-viewpoint Video Streaming for Virtual Reality and Light Field Displays](/publication/2025-12_play4d-accelerated-and-interactive-free-viewpoint-video-streaming-virtual)

[Jonghyun Kim](/person/jonghyun-kim), [Michael Stengel](/person/michael-stengel), [Amrita Mazumdar](/person/amrita-mazumdar), [Tianye Li](/person/tianye-li), [Cheng Sun](/person/cheng-sun), [David Luebke](/person/david-luebke), [Shalini De Mello](/person/shalini-de-mello)



[ACM SIGGRAPH Emerging Technologies 2025](https://sa2025.conference-schedule.org/presentation/?id=emt_131&sess=sess182)









[Beyond Behavior Cloning in Autonomous Driving: a Survey of Closed-Loop Training Techniques](/publication/2025-12_beyond-behavior-cloning-autonomous-driving-survey-closed-loop-training)

[Peter Karkus](/person/peter-karkus), [Maximilian Igl](/person/maximilian-igl), [Yuxiao Chen](/person/yuxiao-chen), Kashyap Chitta, Jef Packer, [Bertrand Douillard](/person/bertrand-douillard), [Thomas Tian](/person/thomas-tian), Alexander Naumann, Guillermo Garcia-Cobo, Shuhan Tan, [Alperen Degirmenci](/person/alperen-degirmenci), Alexander Popov, Nikolai Smolyanskiy, Urs Muller, [Boris Ivanovic](/person/boris-ivanovic), [Marco Pavone](/person/marco-pavone)













[RaySt3R: Predicting Novel Depth Maps for Zero-Shot Object Completion](/index.php/publication/2025-12_rayst3r-predicting-novel-depth-maps-zero-shot-object-completion)

Bardienus P. Duisterhof, Jan Oberst, [Bowen Wen](/index.php/person/bowen-wen), [Stan Birchfield](/index.php/person/stan-birchfield), Deva Ramanan, Jeffrey Ichnowski



[NeurIPS 2025](https://neurips.cc/)









[Seeing What Matters: Generalizable AI-generated Video Detection with Forensic-Oriented Augmentation](/publication/2025-11_seeing-what-matters-generalizable-ai-generated-video-detection-forensic)

Riccardo Corvi, Davide Cozzolino, [Ekta Prashnani](/person/ekta-prashnani), [Shalini De Mello](/person/shalini-de-mello), [Koki Nagano](/person/koki-nagano), Luisa Verdoliva



[Advances in Neural Information Processing Systems (NeurIPS) 2025](https://neurips.cc/virtual/2025/loc/san-diego/poster/117010)









[Attention on the Sphere](/publication/2025-11_attention-sphere)

[Boris Bonev](/person/boris-bonev), Max Rietmann, Andrea Paris, Alberto Carpentieri, Thorsten Kurth



<https://neurips.cc/virtual/2025/poster/117783>









[Alpamayo 1: Bridging Reasoning and Action Prediction for Generalizable Autonomous Driving in the Long Tail](/publication/2025-10_alpamayo-r1)

[Marco Pavone](/person/marco-pavone), Many other contributors found on Page 33













[Task-Oriented Human Grasp Synthesis via Context- and Task-Aware Diffusers](/publication/2025-10_task-oriented-human-grasp-synthesis-context-and-task-aware-diffusers)

An-Lun Liu, [Yu-Wei Chao](/person/yu-wei-chao), Yi-Ting Chen



[IEEE/CVF International Conference on Computer Vision (ICCV) 2025](https://iccv.thecvf.com)









[Pedestrian Collision Avoidance in Hemianopia during Natural Walking in Immersive Virtual Reality](/index.php/publication/2025-10_pedestrian-collision-avoidance-hemianopia-during-natural-walking-immersive)

Jonathan K. Doyon, Sujin Kim, Alex D. Hwang, [Jae-Hyun Jung](/index.php/person/jae-hyun-jung)



[arXiv](https://arxiv.org/abs/2510.04218)









[Real-time 3D Visualization of Radiance Fields on Light Field Displays](/publication/2025-08_real-time-3d-visualization-radiance-fields-light-field-displays)

[Jonghyun Kim](/person/jonghyun-kim), [Cheng Sun](/person/cheng-sun), [Michael Stengel](/person/michael-stengel), Matthew Chan, Andrew Russell, Jaehyun Jung, Wil Braithewaite, [Shalini De Mello](/person/shalini-de-mello), [David Luebke](/person/david-luebke)



[ArXiv](https://arxiv.org/abs/2508.18540)









[GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control](/publication/2025-08_gen3c-3d-informed-world-consistent-video-generation-precise-camera-control)

Xuanchi Ren, Tianchang Shen, Jiahui Huang, Huan Ling, Yifan Lu, [Merlin Nimier-David](/person/merlin-nimier-david), [Thomas Müller](/person/thomas-muller), [Alex Keller](/person/alex-keller), [Sanja Fidler](/person/sanja-fidler), Jun Gao



[CVPR 2025](https://ieeexplore.ieee.org/document/11092782)









[MAISI-v2: Accelerated 3D High-Resolution Medical Image Synthesis with Rectified Flow and Region-specific Contrastive Loss](/publication/2025-08_maisi-v2-accelerated-3d-high-resolution-medical-image-synthesis-rectified-flow)

[Can Zhao](/person/can-zhao), Pengfei Guo, [Dong Yang](/person/dong-yang), Yucheng Tang, [Yufan He](/person/yufan-he), Benjamin Simon, Mason Belue, Stephanie Harmon, Baris Turkbey, [Daguang Xu](/person/daguang-xu)



[AAAI 2026](https://arxiv.org/abs/2508.05772)









[Radiance Surfaces: Optimizing Surface Representations with a 5D Radiance Field Loss](/index.php/publication/2025-07_radiance-surfaces-optimizing-surface-representations-5d-radiance-field-loss)

Ziyi Zhang, Nicolas Roussel, [Thomas Müller](/index.php/person/thomas-muller), [Tizian Zeltner](/index.php/person/tizian-zeltner), [Merlin Nimier-David](/index.php/person/merlin-nimier-david), [Fabrice Rousselle](/index.php/person/fabrice-rousselle), Wenzel Jakob



[SIGGRAPH 2025](https://s2025.siggraph.org/)









[Identity-Motion Trade-offs in Text-to-Video Generation](/publication/2025-07_identity-motion-trade-offs-text-video-generation)

[Yuval Atzmon](/person/yuval-atzmon), Rinon Gal, [Yoad Tewel](/person/yoad-tewel), [Yoni Kasten](/person/yoni-kasten), [Gal Chechik](/person/gal-chechik)



[BMVC 2025](https://bmvc2025.bmva.org/proceedings/159/)









[FoundationStereo: Zero-Shot Stereo Matching](/publication/2025-06_foundationstereo-zero-shot-stereo-matching)

[Bowen Wen](/person/bowen-wen), Matthew Trepte, Joseph Aribido, [Jan Kautz](/person/jan-kautz), Orazio Gallo, [Stan Birchfield](/person/stan-birchfield)



[CVPR 2025](https://cvpr.thecvf.com/)



Best Paper Nomination





[Adapting to the Unknown: Training-Free Audio-Visual Event Perception with Dynamic Thresholds](/publication/2025-06_adapting-unknown-training-free-audio-visual-event-perception-dynamic-thresholds)

Eitan Shaar, Ariel Shaulov, [Gal Chechik](/person/gal-chechik), Lior Wolf



[CVPR 2025](https://cvpr.thecvf.com/virtual/2025/poster/34008)









[RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression](/publication/2025-06_rl-rc-dot-block-level-rl-agent-task-aware-video-compression)

Uri Gadot, Assaf Shocher, [Shie Mannor](/person/shie-mannor), [Gal Chechik](/person/gal-chechik), [Assaf Hallak](/person/assaf-hallak)



[CVPR 2025](https://openaccess.thecvf.com/content/CVPR2025/papers/Gadot_RL-RC-DoT_A_Block-level_RL_agent_for_Task-Aware_Video_Compression_CVPR_2025_paper.pdf)









[TriTex: Learning Texture from a Single Mesh via Triplane Semantic Features](/publication/2025-06_tritex-learning-texture-single-mesh-triplane-semantic-features)

Dana Cohen-Bar, Daniel Cohen-Or, [Gal Chechik](/person/gal-chechik), [Yoni Kasten](/person/yoni-kasten)



[CVPR 2025](https://cvpr.thecvf.com/virtual/2025/poster/34561)









[BLADE: Single-view Body Mesh Estimation through Accurate Depth Estimation](/publication/2025-06_blade-single-view-body-mesh-estimation-through-accurate-depth-estimation)

Shengze Wang, [Jiefeng Li](/person/jiefeng-li), [Tianye Li](/person/tianye-li), [Ye Yuan](/person/ye-yuan), Henry Fuchs, [Koki Nagano](/person/koki-nagano), [Shalini De Mello](/person/shalini-de-mello), [Michael Stengel](/person/michael-stengel)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2025](https://openaccess.thecvf.com/content/CVPR2025/papers/Wang_BLADE_Single-view_Body_Mesh_Estimation_through_Accurate_Depth_Estimation_CVPR_2025_paper.pdf)









[SimAvatar: Simulation-Ready Clothed Gaussian Avatars from Text](/publication/2025-06_simavatar-simulation-ready-clothed-gaussian-avatars-text)

[Xueting Li](/person/xueting-li), [Ye Yuan](/person/ye-yuan), [Shalini De Mello](/person/shalini-de-mello), Gilles Daviet, Jonathan Leaf, Miles Macklin, [Jan Kautz](/person/jan-kautz), [Umar Iqbal](/person/umar-iqbal)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2025](https://openaccess.thecvf.com/content/CVPR2025/papers/Li_SimAvatar_Simulation-Ready_Avatars_with_Layered_Hair_and_Clothing_CVPR_2025_paper.pdf)









[Coherent 3D Portrait Video Reconstruction via Triplane Fusion](/publication/2025-06_coherent-3d-portrait-video-reconstruction-triplane-fusion)

Shengze Wang, [Xueting Li](/person/xueting-li), [Chao Liu](/person/chao-liu), Matthew Chan, [Michael Stengel](/person/michael-stengel), Henry Fuchs, [Shalini De Mello](/person/shalini-de-mello), [Koki Nagano](/person/koki-nagano)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2025](https://openaccess.thecvf.com/content/CVPR2025/papers/Wang_Coherent_3D_Portrait_Video_Reconstruction_via_Triplane_Fusion_CVPR_2025_paper.pdf)









[GRS: Generating robotic simulation tasks from real-world images](/publication/2025-06_grs-generating-robotic-simulation-tasks-real-world-images-0)

[Alex Zook](/person/alex-zook), [Josef Spjut](/person/josef-spjut), [Jonathan Tremblay](/person/jonathan-tremblay)



[CVPR 2025](https://cvpr.thecvf.com/Conferences/2025)









[MambaVision: A Hybrid Mamba-Transformer Vision Backbone](/publication/2025-06_mambavision-hybrid-mamba-transformer-vision-backbone)

Ali Hatamizadeh , [Jan Kautz](/person/jan-kautz)



[The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2025](https://cvpr.thecvf.com/)









[RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics](/index.php/publication/2025-06_robospatial-teaching-spatial-understanding-2d-and-3d-vision-language-models)

Chan Hee Song, [Valts Blukis](/index.php/person/valts-blukis), [Jonathan Tremblay](/index.php/person/jonathan-tremblay), [Stephen Tyree](/index.php/person/stephen-tyree), Yu Su, [Stan Birchfield](/index.php/person/stan-birchfield)



[CVPR 2025](https://cvpr.thecvf.com/)









[SPOT: SE(3) Pose Trajectory Diffusion for Object-Centric Manipulation](/index.php/publication/2025-05_spot-se3-pose-trajectory-diffusion-object-centric-manipulation)

Cheng-Chun Hsu, [Bowen Wen](/index.php/person/bowen-wen), [Jie Xu](/index.php/person/jie-xu), [Yashraj Narang](/index.php/person/yashraj-narang), , [Yuke Zhu](/index.php/person/yuke-zhu), Joydeep Biswas, [Stan Birchfield](/index.php/person/stan-birchfield)



[ICRA 2025](https://2025.ieee-icra.org/)









[AI 3D Selfie: Real-Time Single-Image 3D Face Reconstruction for Light-Field Displays](/publication/2025-05_ai-3d-selfie-real-time-single-image-3d-face-reconstruction-light-field-displays)

[Jonghyun Kim](/person/jonghyun-kim), [Michael Stengel](/person/michael-stengel), Matthew Chan, [Koki Nagano](/person/koki-nagano), [Shalini De Mello](/person/shalini-de-mello), [David Luebke](/person/david-luebke)



[The Society of Information Display (SID) 2025](https://sid.onlinelibrary.wiley.com/doi/abs/10.1002/sdtp.18377)









[LongVILA: Scaling Long-Context Visual Language Models for Long Videos](/publication/2025-04_longvila-scaling-long-context-visual-language-models-long-videos)

[Yukang Chen](/person/yukang-chen), Fuzhao Xue, Dacheng Li, Qinghao Hu, [Ligeng Zhu](/person/ligeng-zhu), Xiuyu Li, Yunhao Fang, Haotian Tang, Shang Yang, Zhijian Liu, Ethan He, Hongxu Yin, [Pavlo Molchanov](/person/pavlo-molchanov), [Jan Kautz](/person/jan-kautz), Linxi Fan, [Yuke Zhu](/person/yuke-zhu), Yao Lu (Jason), [Song Han](/person/song-han)



<https://openreview.net/forum?id=wCXAlfvCy6>









[Multi-student Diffusion Distillation for Better One-step Generators](/publication/2025-03_multi-student-diffusion-distillation-better-one-step-generators)

Yanke Song, Jonathan Lorraine, Weili Nie, [Karsten Kreis](/person/karsten-kreis), James Lucas 



[Arxiv](https://arxiv.org/pdf/2410.23274)









[LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models](/publication/2025-03_llama-mesh-unifying-3d-mesh-generation-language-models)

Zhengyi Wang, Jonathan Lorraine, Yikai Wang, Hang Su, Jun Zhu, Sanja Fidler, Xiaohui Zeng



[Arxiv](https://arxiv.org/pdf/2411.09595)









[CorrFill: Enhancing Faithfulness in Reference-based Inpainting with Correspondence Guidance in Diffusion Models](/publication/2025-02_corrfill-enhancing-faithfulness-reference-based-inpainting-correspondence)

Kuan-Hung Liu, Cheng-Kun Yang, [Min-Hung Chen](/person/min-hung-chen), Yu-Lun Liu, Yen-Yu Lin



[Winter Conference on Applications of Computer Vision (WACV)](https://wacv2025.thecvf.com/)









[Semantic Prompt Learning for Weakly-Supervised Semantic Segmentation](/publication/2025-02_semantic-prompt-learning-weakly-supervised-semantic-segmentation)

Ci-Siang Lin, Chien-Yi Wang, [Frank Wang](/person/frank-wang), [Min-Hung Chen](/person/min-hung-chen)



[Winter Conference on Applications of Computer Vision (WACV)](https://wacv2025.thecvf.com/)









[Spatio-Temporal Context Prompting for Zero-Shot Action Detection](/publication/2025-02_spatio-temporal-context-prompting-zero-shot-action-detection)

Wei-Jhe Huang, [Min-Hung Chen](/person/min-hung-chen), Shang-Hong Lai



[Winter Conference on Applications of Computer Vision (WACV)](https://wacv2025.thecvf.com/)









### 2024 

[CosAE: Learnable Fourier Series for Image Restoration](/publication/2024-12_cosae-learnable-fourier-series-image-restoration)

[Sifei Liu](/person/sifei-liu), [Shalini De Mello](/person/shalini-de-mello), [Jan Kautz](/person/jan-kautz)



[Advances in Neural Information Processing Systems (NeurIPS) 2024](https://proceedings.neurips.cc/paper_files/paper/2024/file/13e8be77982beb73d7ed0bbf122f9f3c-Paper-Conference.pdf)









[Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models](/publication/2024-12_warped-diffusion-solving-video-inverse-problems-image-diffusion-models)

Giannis Daras, Weili Nie, [Karsten Kreis](/person/karsten-kreis), Alexandros G. Dimakis, [Morteza Mardani](/person/morteza-mardani), [Nikola Kovachki](/person/nikola-kovachki), [Arash Vahdat](/person/arash-vahdat)



[Neural Information Processing Systems (NeurIPS) 2024](https://arxiv.org/abs/2410.16152)









[QUEEN: QUantized Efficient ENcoding for Streaming Free-viewpoint Videos](/publication/2024-12_queen-quantized-efficient-encoding-streaming-free-viewpoint-videos)

Sharath Girish, [Tianye Li](/person/tianye-li), [Amrita Mazumdar](/person/amrita-mazumdar), Abhinav Shrivastava, [David Luebke](/person/david-luebke), [Shalini De Mello](/person/shalini-de-mello)



[Advances in Neural Information Processing Systems (NeurIPS) 2024](https://proceedings.neurips.cc/paper_files/paper/2024/hash/4c9477b9e2c7ec0ad3f4f15077aaf85a-Abstract-Conference.html)









[L4GM: Large 4D Gaussian Reconstruction Model](/publication/2024-12_l4gm-large-4d-gaussian-reconstruction-model)

Jiawei Ren, Kevin Xie, Ashkan Mirzaei, Hanxue Liang, Xiaohui Zeng, [Karsten Kreis](/person/karsten-kreis), Ziwei Liu, Antonio Torralba, Sanja Fidler, Seung Wook Kim, Huan Ling



[Neural Information Processing Systems (NeurIPS) 2024](https://arxiv.org/abs/2406.10324)









[Fast Encoder-Based 3D from Casual Videos via Point Track Processing](/index.php/publication/2024-12_fast-encoder-based-3d-casual-videos-point-track-processing)

[Yoni Kasten](/index.php/person/yoni-kasten), Wuyue Lu, [Haggai Maron](/index.php/person/haggai-maron)



[NeurIPS 2024](https://arxiv.org/pdf/2404.07097)









[Bayesian Example Selection Improves In-Context Learning for Speech, Text, and Visual Modalities](/index.php/publication/2024-11_bayesian-example-selection-improves-context-learning-speech-text-and-visual)

Siyin Wang, [Huck Yang](/index.php/person/huck-yang), Ji Wu, Chao Zhang



[EMNLP](https://arxiv.org/pdf/2404.14716)









[From Descriptive Richness to Bias: Unveiling the Dark Side of Generative Image Caption Enrichment](/publication/2024-11_descriptive-richness-bias-unveiling-dark-side-generative-image-caption)

Yusuke Hirota, [Ryo Hachiuma](/person/ryo-hachiuma), [Huck Yang](/person/huck-yang), Yuta Nakashima



[EMNLP](https://arxiv.org/pdf/2406.13912)









[ReMatching Dynamic Reconstruction Flow](/publication/2024-11_rematching-dynamic-reconstruction-flow)

Sara Oblak, Despoina Paschalidou, Sanja Fidler, Matan Atzmon



[Arxiv](https://arxiv.org/pdf/2411.00705)









[Proto-CLIP: Vision-Language Prototypical Network for Few-Shot Learning](/index.php/publication/2024-10_proto-clip-vision-language-prototypical-network-few-shot-learning)

Jishnu Jaykumar P, Kamalesh Palanisamy, [Yu-Wei Chao](/index.php/person/yu-wei-chao), Xinya Du, Yu Xiang



[IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2024](https://iros2024-abudhabi.org)









[Mitigating Covariate Shift in Imitation Learning for Autonomous Vehicles Using Latent Space Generative World Models](/publication/2024-09_mitigating-covariate-shift-imitation-learning-autonomous-vehicles-using-latent)

Alexander Popov, Alperen Degirmenci, David Wehr, Shashank Hegde , Ryan Oldja, Alexey Kamenev, Bertrand Douillard, David Nistér, Urs Muller, Ruchi Bhargava, [Stan Birchfield](/person/stan-birchfield), Nikolai Smolyanskiy



[arXiv](https://arxiv.org/abs/2409.16663)









[TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models](/publication/2024-08_turboedit-text-based-image-editing-using-few-step-diffusion-models)

Gilad Deutch, Rinon Gal, Daniel Garibi, Or Patashnik, Daniel Cohen-Or



[SIGGRAPH Asia 2024](https://arxiv.org/abs/2408.00735)









[DoRA: Weight-Decomposed Low-Rank Adaptation](/publication/2024-07_dora-weight-decomposed-low-rank-adaptation)

Shih-Yang Liu, Chien-Yi Wang, [Hongxu Danny Yin](/person/danny-yin), [Pavlo Molchanov](/person/pavlo-molchanov), [Frank Wang](/person/frank-wang), Kwang-Ting Cheng, [Min-Hung Chen](/person/min-hung-chen)



[International Conference on Machine Learning (ICML) 2024](https://icml.cc/Conferences/2024)









[RVT-2: Learning Precise Manipulation from Few Examples](/index.php/publication/2024-07_rvt-2-learning-precise-manipulation-few-examples)

[Ankit Goyal](/index.php/person/ankit-goyal), [Valts Blukis](/index.php/person/valts-blukis), [Jie Xu](/index.php/person/jie-xu), [Yijie Guo](/index.php/person/yijie-guo), [Yu-Wei Chao](/index.php/person/yu-wei-chao), Dieter Fox



[Robotics: Science and Systems (RSS) 2024](https://roboticsconference.org/)









[Breathing Life Into Sketches Using Text-to-Video Priors](/publication/2024-07_breathing-life-sketches-using-text-video-priors)

Rinon Gal, Yael Vinker, Yuval Alaluf, Amit Bermano, Daniel Cohen-Or, Ariel Shamir, [Gal Chechik](/person/gal-chechik)



[CVPR 2024](https://arxiv.org/abs/2311.13608)









[GAvatar: Animatable 3D Gaussian Avatars with Implicit Mesh Learning](/publication/2024-06_gavatar-animatable-3d-gaussian-avatars-implicit-mesh-learning)

[Ye Yuan](/person/ye-yuan), [Xueting Li](/person/xueting-li), Yangyi Huang, [Shalini De Mello](/person/shalini-de-mello), [Koki Nagano](/person/koki-nagano), [Jan Kautz](/person/jan-kautz), [Umar Iqbal](/person/umar-iqbal)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2024](https://openaccess.thecvf.com/content/CVPR2024/papers/Yuan_GAvatar_Animatable_3D_Gaussian_Avatars_with_Implicit_Mesh_Learning_CVPR_2024_paper.pdf)



Highlight





[RegionGPT: Towards Region Understanding Vision Language Model](/publication/2024-06_regiongpt-towards-region-understanding-vision-language-model)

Qiushan Guo, [Shalini De Mello](/person/shalini-de-mello), [Hongxu Danny Yin](/person/danny-yin), [Wonmin Byeon](/person/wonmin-byeon), Ka Chun Cheung, Yizhou Yu, Ping Luo, [Sifei Liu](/person/sifei-liu)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2024](https://openaccess.thecvf.com/content/CVPR2024/papers/Guo_RegionGPT_Towards_Region_Understanding_Vision_Language_Model_CVPR_2024_paper.pdf)









[Outdoor Scene Extrapolation with Hierarchical Generative Cellular Automata](/publication/2024-06_outdoor-scene-extrapolation-hierarchical-generative-cellular-automata)

Dongsu Zhang, Francis Williams, Zan Gojcic, [Karsten Kreis](/person/karsten-kreis), Sanja Fidler, Young Min Kim, Amlan Kar



[CVPR 2024 (Highlight)](https://arxiv.org/abs/2406.08292)









[What You See is What You GAN: Rendering Every Pixel for High-Fidelity Geometry in 3D GANs](/publication/2024-06_what-you-see-what-you-gan-rendering-every-pixel-high-fidelity-geometry-3d-gans)

[Alexander Trevithick](/person/alexander-trevithick), Matthew Chan, Towaki Takikawa, [Umar Iqbal](/person/umar-iqbal), [Shalini De Mello](/person/shalini-de-mello), Manmohan Chandraker, Ravi Ramamoorthi, [Koki Nagano](/person/koki-nagano)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2024](https://openaccess.thecvf.com/content/CVPR2024/papers/Trevithick_What_You_See_is_What_You_GAN_Rendering_Every_Pixel_CVPR_2024_paper.pdf)









[Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models](/publication/2024-06_align-your-gaussians-text-4d-dynamic-3d-gaussians-and-composed-diffusion-models)

Huan Ling, Seung Wook Kim, Antonio Torralba, Sanja Fidler, [Karsten Kreis](/person/karsten-kreis)



[CVPR 2024 (Highlight)](https://arxiv.org/abs/2312.13763)









[Neural Implicit Representation for Building Digital Twins of Unknown Articulated Objects](/index.php/publication/2024-06_neural-implicit-representation-building-digital-twins-unknown-articulated)

Yijia Weng, [Bowen Wen](/index.php/person/bowen-wen), [Jonathan Tremblay](/index.php/person/jonathan-tremblay), [Valts Blukis](/index.php/person/valts-blukis), Dieter Fox, Leo Guibas, [Stan Birchfield](/index.php/person/stan-birchfield)



[CVPR 2024](https://cvpr.thecvf.com/Conferences/2024)









[FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects](/publication/2024-06_foundationpose-unified-6d-pose-estimation-and-tracking-novel-objects)

[Bowen Wen](/person/bowen-wen), [Wei Yang](/person/wei-yang), [Jan Kautz](/person/jan-kautz), [Stan Birchfield](/person/stan-birchfield)



[CVPR 2024](https://cvpr.thecvf.com/Conferences/2024)









[NeRFDeformer: NeRF Transformation from a Single View via 3D Scene Flows](/index.php/publication/2024-06_nerfdeformer-nerf-transformation-single-view-3d-scene-flows)

Zhenggang Tang, Zhongzheng Ren, Xiaoming Zhao, [Bowen Wen](/index.php/person/bowen-wen), [Jonathan Tremblay](/index.php/person/jonathan-tremblay), [Stan Birchfield](/index.php/person/stan-birchfield), Alexander Schwing



[CVPR 2024](https://cvpr.thecvf.com/Conferences/2024)









[SynH2R: Synthesizing Hand-Object Motions for Learning Human-to-Robot Handovers](/index.php/publication/2024-05_synh2r-synthesizing-hand-object-motions-learning-human-robot-handovers)

Sammy Christen, Lan Feng, [Wei Yang](/index.php/person/wei-yang), [Yu-Wei Chao](/index.php/person/yu-wei-chao), Otmar Hilliges, Jie Song



[IEEE International Conference on Robotics and Automation (ICRA) 2024](https://2024.ieee-icra.org)









[FasterViT: Fast Vision Transformers with Hierarchical Attention](/publication/2024-05_fastervit-fast-vision-transformers-hierarchical-attention)

Ali Hatamizadeh , [Greg Heinrich](/person/greg-heinrich), [Hongxu Danny Yin](/person/danny-yin), Andrew Tao, Jose M. Alvarez, [Jan Kautz](/person/jan-kautz), [Pavlo Molchanov](/person/pavlo-molchanov)



[International Conference on Learning Representations (ICLR) 2024](https://iclr.cc/)









[WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space](/publication/2024-05_wildfusion-learning-3d-aware-latent-diffusion-models-view-space)

Katja Schwarz, Seung Wook Kim, Jun Gao, Sanja Fidler, Andreas Geiger, [Karsten Kreis](/person/karsten-kreis)



[International Conference on Learning Representations (ICLR) 2024](https://arxiv.org/abs/2311.13570)









[3D Reconstruction with Generalizable Neural Fields using Scene Priors](/publication/2024-05_3d-reconstruction-generalizable-neural-fields-using-scene-priors)

Yang Fu, [Shalini De Mello](/person/shalini-de-mello), [Xueting Li](/person/xueting-li), Amey Kulkarni, [Jan Kautz](/person/jan-kautz), Xiaolong Wang, [Sifei Liu](/person/sifei-liu)



[International Conference on Learning Representations (ICLR) 2024](https://proceedings.iclr.cc/paper_files/paper/2024/hash/0bd32794b26cfc99214b89313764da8e-Abstract-Conference.html)









[LCM-Lookahead for Encoder-based Text-to-Image Personalization](/publication/2024-04_lcm-lookahead-encoder-based-text-image-personalization)

Rinon Gal, Or Lichter, Elad Richardson, Or Patashnik, Amit H Bermano, [Gal Chechik](/person/gal-chechik), Daniel Cohen-Or



[ECCV 2024](https://arxiv.org/abs/2404.03620)









[Consolidating Attention Features for Multi-view Image Editing](/publication/2024-02_consolidating-attention-features-multi-view-image-editing)

Or Patashnik, Rinon Gal, Daniel Cohen-Or, Jun-Yan Zhu, Fernando De la Torre



[SIGGRAPH Asia 2024](https://arxiv.org/abs/2402.14792)









### 2023 

[Compact Neural Graphics Primitives with Learned Hash Probing](/publication/2023-12_compact-neural-graphics-primitives-learned-hash-probing)

Towaki Takikawa, [Thomas Müller](/person/thomas-muller), [Merlin Nimier-David](/person/merlin-nimier-david), Alex Evans, [Sanja Fidler](/person/sanja-fidler), Alec Jacobson, [Alex Keller](/person/alex-keller)



[SIGGRAPH Asia 2023](https://dl.acm.org/doi/10.1145/3610548.3618167)









[Generalizable One-shot 3D Neural Head Avatar](/index.php/publication/2023-12_generalizable-one-shot-3d-neural-head-avatar)

[Xueting Li](/index.php/person/xueting-li), [Shalini De Mello](/index.php/person/shalini-de-mello), [Sifei Liu](/index.php/person/sifei-liu), [Koki Nagano](/index.php/person/koki-nagano), [Umar Iqbal](/index.php/person/umar-iqbal), [Jan Kautz](/index.php/person/jan-kautz)



[Advances in Neural Information Processing Systems (NeurIPS) 2023](https://neurips.cc/)









[Point-Cloud Completion with Pretrained Text-to-image Diffusion Models](/publication/2023-12_point-cloud-completion-pretrained-text-image-diffusion-models)

[Yoni Kasten](/person/yoni-kasten), Ohad Rahamim, [Gal Chechik](/person/gal-chechik)



[NeurIPS 2023](https://arxiv.org/pdf/2306.10533.pdf)









[SceneScape: Text-Driven Consistent Scene Generation](/publication/2023-12_scenescape-text-driven-consistent-scene-generation)

Rafail Fridman, Amit Abecasis, [Yoni Kasten](/person/yoni-kasten), Tali Dekel



[NeurIPS 2023](https://arxiv.org/pdf/2302.01133.pdf)









[XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies](/publication/2023-12_xcube-large-scale-3d-generative-modeling-using-sparse-voxel-hierarchies)

Xuanchi Ren, Jiahui Huang, Xiaohui Zeng, Ken Museth, Sanja Fidler, Francis Williams



[CVPR](https://arxiv.org/abs/2312.03806)









[Adaptive Shells for Efficient Neural Radiance Field Rendering](/publication/2023-12_adaptive-shells-efficient-neural-radiance-field-rendering)

Zian Wang, Tianchang Shen, [Merlin Nimier-David](/person/merlin-nimier-david), Nicholas Sharp, Jun Gao, [Alex Keller](/person/alex-keller), [Sanja Fidler](/person/sanja-fidler), [Thomas Müller](/person/thomas-muller), Zan Gojcic



[SIGGRAPH Asia 2023](https://dl.acm.org/doi/10.1145/3618390)



SIGGRAPH Asia 2023 Best Paper Award





[2D-3D Interlaced Transformer for Point Cloud Segmentation with Scene-Level Supervision](/publication/2023-10_2d-3d-interlaced-transformer-point-cloud-segmentation-scene-level-supervision)

Cheng-Kun Yang, [Min-Hung Chen](/person/min-hung-chen), Yung-Yu Chaung, Yen-Yu Lin



[ IEEE/CVF International Conference on Computer Vision (ICCV) 2023](https://iccv2023.thecvf.com/)









[DreamTeacher: Pretraining Image Backbones with Deep Generative Models](/publication/2023-10_dreamteacher-pretraining-image-backbones-deep-generative-models)

Daiqing Li, Huan Ling, Amlan Kar, David Acuna, Seung Wook Kim, [Karsten Kreis](/person/karsten-kreis), Antonio Torralba, Sanja Fidler



[IEEE/CVF International Conference on Computer Vision (ICCV) 2023](https://arxiv.org/abs/2307.07487)









[ATT3D: Amortized Text-To-3D Object Synthesis](/publication/2023-10_att3d-amortized-text-3d-object-synthesis)

Jonathan Lorraine, Kevin Xie, Xiaohui Zeng, [Chen-Hsuan Lin](/person/chen-hsuan-lin), Towaki Takikawa, Nicholas Sharp, [Tsung-Yi Lin](/person/tsung-yi-lin), [Ming-Yu Liu](/person/ming-yu-liu), Sanja Fidler, James Lucas



[ICCV](https://openaccess.thecvf.com/content/ICCV2023/papers/Lorraine_ATT3D_Amortized_Text-to-3D_Object_Synthesis_ICCV_2023_paper.pdf)









[Neural LiDAR Fields for Novel View Synthesis](/index.php/publication/2023-10_neural-lidar-fields-novel-view-synthesis)

Shengyu Huang, Zan Gojcic, Zian Wang, Francis Williams, [Yoni Kasten](/index.php/person/yoni-kasten), Sanja Fidler, Konrad Schindler, Or Litany



[ICCV 2023](https://nv-tlabs.github.io/nfl/assets/nfl_main.pdf)









[Syntactic Binding in Diffusion Models: Enhancing Attribute Correspondence through Attention Map Alignment](/publication/2023-10_syntactic-binding-diffusion-models-enhancing-attribute-correspondence-through)

Royi Rassin, Eran Hirsch, Daniel Glickman, Shauli Ravfogel, Yoav Goldberg, [Gal Chechik](/person/gal-chechik)



[NeurIPS 2023](https://nips.cc/virtual/2023/oral/73870)



Oral presentation





[HANDAL: A Dataset of Real-World Manipulable Object Categories with Pose Annotations, Affordances, and Reconstructions](/index.php/publication/2023-10_handal-dataset-real-world-manipulable-object-categories-pose-annotations)

Andrew Guo, [Bowen Wen](/index.php/person/bowen-wen), Jianhe Yuan, [Jonathan Tremblay](/index.php/person/jonathan-tremblay), [Stephen Tyree](/index.php/person/stephen-tyree), [Jeff Smith](/index.php/person/jeff-smith), [Stan Birchfield](/index.php/person/stan-birchfield)



[IROS 2023](https://ieee-iros.org/)









[Norm-guided latent space exploration for text-to-image generation](/publication/2023-10_norm-guided-latent-space-exploration-text-image-generation)

Dvir Samuel, Rami Ben-Ari, Nir Darshan, [Haggai Maron](/person/haggai-maron), [Gal Chechik](/person/gal-chechik)



[NeurIPS 2023](https://nips.cc/virtual/2023/poster/70922)









[Online Overexposed Pixels Hallucination in Videos with Adaptive Reference Frame Selection](/publication/2023-08_online-overexposed-pixels-hallucination-videos-adaptive-reference-frame)

Yazhou Xing, [Amrita Mazumdar](/person/amrita-mazumdar), Anjul Patney, [Chao Liu](/person/chao-liu), [Hongxu Danny Yin](/person/danny-yin), Qifeng Chen, [Jan Kautz](/person/jan-kautz), [Iuri Frosio](/person/iuri-frosio)



[Arxiv](https://arxiv.org/abs/2308.15462)









[Differentially Private Diffusion Models](/publication/2023-08_differentially-private-diffusion-models)

Tim Dockhorn, Tianshi Cao, [Arash Vahdat](/person/arash-vahdat), [Karsten Kreis](/person/karsten-kreis)



[Transactions on Machine Learning Research (TMLR) 2023](https://arxiv.org/abs/2210.09929)









[Flexible Isosurface Extraction for Gradient-Based Mesh Optimization](/publication/2023-08_flexible-isosurface-extraction-gradient-based-mesh-optimization)

Tianchang Shen, [Jacob Munkberg](/person/jacob-munkberg), [Jon Hasselgren](/person/jon-hasselgren), Kangxue Yin, Zian Wang, Wenzheng Chen, Zan Gojcic, Sanja Fidler, Nicholas Sharp, Jun Gao



[ACM Transactions On Graphics (SIGGRAPH 2023)](https://dl.acm.org/doi/abs/10.1145/3592430)









[Live 3D Portrait: Real-Time Radiance Fields for Single-Image Portrait View Synthesis](/publication/2023-08_live-3d-portrait-real-time-radiance-fields-single-image-portrait-view-synthesis)

Alexander Trevithick, Matthew Chan, [Michael Stengel](/person/michael-stengel), Eric R. Chan, [Chao Liu](/person/chao-liu), [Zhiding Yu](/person/zhiding-yu), Sameh Khamis, Manmohan Chandraker, Ravi Ramamoorthi, [Koki Nagano](/person/koki-nagano)



[ACM Transactions On Graphics (SIGGRAPH 2023)](https://s2023.siggraph.org/)









[Learning Physically Simulated Tennis Players from Broadcast Videos](/publication/2023-08_learning-physically-simulated-tennis-players-broadcast-videos)

Haotian Zhang, [Ye Yuan](/person/ye-yuan), Viktor Makoviychuk, Yunrong Guo, Sanja Fidler, Jason Peng, Kayvon Fatahalian



[SIGGRAPH 2023 (Best Paper Honorable Mention)](https://s2023.siggraph.org/)









[SSIF: Single-shot Implicit Morphable Faces With Consistent Texture Parameterization](/publication/2023-08_ssif-single-shot-implicit-morphable-faces-consistent-texture-parameterization)

Connor Zhizhen Lin, [Koki Nagano](/person/koki-nagano), [Jan Kautz](/person/jan-kautz), Eric R. Chan, [Umar Iqbal](/person/umar-iqbal), Leonidas Guibas, Gordon Wetzstein, Sameh Khamis



[SIGGRAPH 2023](https://s2023.siggraph.org/)









[Global Context Vision Transformers](/publication/2023-07_global-context-vision-transformers)

Ali Hatamizadeh , [Hongxu Danny Yin](/person/danny-yin), [Greg Heinrich](/person/greg-heinrich), [Jan Kautz](/person/jan-kautz), [Pavlo Molchanov](/person/pavlo-molchanov)



[ International Conference on Machine Learning (ICML) 2023](https://icml.cc/Conferences/2023)









[Task-Aware Risk Estimation of Perception Failures for Autonomous Vehicles](/publication/2023-07_task-aware-risk-estimation-perception-failures-autonomous-vehicles)

Pasquale Antonante, [Sushant Veer](/person/sushant-veer), [Karen Leung](/person/karen-leung), [Xinshuo Weng](/person/xinshuo-weng), Luca Carlone, [Marco Pavone](/person/marco-pavone)



[Robotics: Science and Systems (RSS) 2023](https://roboticsconference.org/)









[AnyTeleop: A General Vision-Based Dexterous Robot Arm-Hand Teleoperation System](/index.php/publication/2023-06_anyteleop-general-vision-based-dexterous-robot-arm-hand-teleoperation-system)

Yuzhe Qin, [Wei Yang](/index.php/person/wei-yang), Binghao Huang, Karl Van Wyk, Hao Su, Xiaolong Wang, [Yu-Wei Chao](/index.php/person/yu-wei-chao), Dieter Fox



[Robotics: Science and Systems (RSS) 2023](https://roboticsconference.org/program/papers/015/)









[TTA-COPE: Test-Time Adaptation for Category-Level Object Pose Estimation](/publication/2023-06_tta-cope-test-time-adaptation-category-level-object-pose-estimation)

Taeyeop Lee, [Jonathan Tremblay](/person/jonathan-tremblay), [Valts Blukis](/person/valts-blukis), [Bowen Wen](/person/bowen-wen), Byeong-Uk Lee, Inkyu Shin, [Stan Birchfield](/person/stan-birchfield), In So Kweon, Kuk-Jin Yoon



[CVPR 2023](https://cvpr2023.thecvf.com/)









[Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models](/publication/2023-06_align-your-latents-high-resolution-video-synthesis-latent-diffusion-models)

Andreas Blattmann, Robin Rombach, Huan Ling, Tim Dockhorn, Seung Wook Kim, Sanja Fidler, [Karsten Kreis](/person/karsten-kreis)



[IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023](https://arxiv.org/abs/2304.08818)









[Affordance Diffusion: Synthesizing Hand-Object Interactions](/index.php/publication/2023-06_affordance-diffusion-synthesizing-hand-object-interactions)

Yufei Ye, [Xueting Li](/index.php/person/xueting-li), Abhinav Gupta, [Shalini De Mello](/index.php/person/shalini-de-mello), [Stan Birchfield](/index.php/person/stan-birchfield), Jiaming Song, Shubham Tulsiani, [Sifei Liu](/index.php/person/sifei-liu)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023](https://cvpr2023.thecvf.com/)









[FreeNeRF: Improving Few-shot Neural Rendering with Free Frequency Regularization](/publication/2023-06_freenerf-improving-few-shot-neural-rendering-free-frequency-regularization)

Jiawei Yang, [Marco Pavone](/person/marco-pavone), [Yue Wang](/person/yue-wang)



[IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023](https://cvpr2023.thecvf.com/)









[NeuralField-LDM: Scene Generation with Hierarchical Latent Diffusion Models](/publication/2023-06_neuralfield-ldm-scene-generation-hierarchical-latent-diffusion-models)

Seung Wook Kim, Bradley Brown, Kangxue Yin, [Karsten Kreis](/person/karsten-kreis), Katja Schwarz, Daiqing Li, Robin Rombach, Antonio Torralba, Sanja Fidler



[IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023](https://arxiv.org/abs/2304.09787)









[Object Pose Estimation with Statistical Guarantees: Conformal Keypoint Detection and Geometric Uncertainty Propagation](/publication/2023-06_object-pose-estimation-statistical-guarantees-conformal-keypoint-detection-and)

[Heng Yang](/person/heng-yang), [Marco Pavone](/person/marco-pavone)



[IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023](https://cvpr2023.thecvf.com/)



Selected as a Highlight Paper





[Neural Congealing: Aligning Images to a Joint Semantic Atlas](/publication/2023-06_neural-congealing-aligning-images-joint-semantic-atlas)

Dolev Ofri-Amar, Michal Geyer, [Yoni Kasten](/person/yoni-kasten), Tali Dekel



[CVPR 2023](https://openaccess.thecvf.com/content/CVPR2023/html/Ofri-Amar_Neural_Congealing_Aligning_Images_to_a_Joint_Semantic_Atlas_CVPR_2023_paper.html)









[Learning Human-to-Robot Handovers from Point Clouds](/publication/2023-06_learning-human-robot-handovers-point-clouds)

Sammy Christen, [Wei Yang](/person/wei-yang), [Claudia Pérez D’Arpino ](/person/cdarpino), Otmar Hilliges, Dieter Fox, [Yu-Wei Chao](/person/yu-wei-chao)



[ IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023](https://cvpr2023.thecvf.com)



Highlight





[Zero-shot Pose Transfer for Unrigged Stylized 3D Characters](/publication/2023-06_zero-shot-pose-transfer-unrigged-stylized-3d-characters)

Jiashun Wang, [Xueting Li](/person/xueting-li), [Sifei Liu](/person/sifei-liu), [Shalini De Mello](/person/shalini-de-mello), Orazio Gallo, Xiaolong Wang, [Jan Kautz](/person/jan-kautz)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023](https://openaccess.thecvf.com/content/CVPR2023/papers/Wang_Zero-Shot_Pose_Transfer_for_Unrigged_Stylized_3D_Characters_CVPR_2023_paper.pdf)









[GazeNeRF: 3D-Aware Gaze Redirection with Neural Radiance Fields](/index.php/publication/2023-06_gazenerf-3d-aware-gaze-redirection-neural-radiance-fields)

Alessandro Ruzzi, Xiangwei Shi, Xi Wang, Gengyan Li, [Shalini De Mello](/index.php/person/shalini-de-mello), Hyung Jin Chang, Xucong Zhang, Otmar Hilliges



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023](https://cvpr2023.thecvf.com/)









[Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models](/publication/2023-06_open-vocabulary-panoptic-segmentation-text-image-diffusion-models)

Jiarui Xu, [Sifei Liu](/person/sifei-liu), [Arash Vahdat](/person/arash-vahdat), [Wonmin Byeon](/person/wonmin-byeon), Xiaolong Wang, [Shalini De Mello](/person/shalini-de-mello)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023](https://cvpr2023.thecvf.com/)



Hightlight top 10%





[BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects](/publication/2023-06_bundlesdf-neural-6-dof-tracking-and-3d-reconstruction-unknown-objects)

[Bowen Wen](/person/bowen-wen), [Jonathan Tremblay](/person/jonathan-tremblay), [Valts Blukis](/person/valts-blukis), [Stephen Tyree](/person/stephen-tyree), [Thomas Müller](/person/thomas-muller), Alex Evans, Dieter Fox, [Jan Kautz](/person/jan-kautz), [Stan Birchfield](/person/stan-birchfield)



[CVPR 2023](https://cvpr2023.thecvf.com/)









[Magic3D: High-Resolution Text-to-3D Content Creation](/publication/2023-06_magic3d-high-resolution-text-3d-content-creation)

[Chen-Hsuan Lin](/person/chen-hsuan-lin), Jun Gao, Luming Tang, Towaki Takikawa, Xiaohui Zeng, Xun Huang, [Karsten Kreis](/person/karsten-kreis), Sanja Fidler, [Ming-Yu Liu](/person/ming-yu-liu), [Tsung-Yi Lin](/person/tsung-yi-lin)



[CVPR 2023 (Highlight)](https://cvpr2023.thecvf.com/)









[Planning for Multi-Object Manipulation with Graph Neural Network Relational Classifiers](/publication/2023-06_planning-multi-object-manipulation-graph-neural-network-relational-classifiers)

Yixuan Huang, Adam Conkey, [Tucker Hermans](/person/tucker-hermans)



[IEEE International Conference on Robotics and Automation (ICRA)](https://www.icra2023.org/)









[Neuralangelo: High-Fidelity Neural Surface Reconstruction](/publication/2023-06_neuralangelo-high-fidelity-neural-surface-reconstruction)

[Max Zhaoshuo Li](/person/max-zhaoshuo-li), [Thomas Müller](/person/thomas-muller), Alex Evans, Russell H. Taylor, Mathias Unberath, [Ming-Yu Liu](/person/ming-yu-liu), [Chen-Hsuan Lin](/person/chen-hsuan-lin)



[CVPR 2023](https://cvpr2023.thecvf.com/)



The Best Inventions of 2023, TIME Magazine





[Parallel Inversion of Neural Radiance Fields for Robust Pose Estimation](/publication/2023-05_parallel-inversion-neural-radiance-fields-robust-pose-estimation)

Yunzhi Lin, [Thomas Müller](/person/thomas-muller), [Jonathan Tremblay](/person/jonathan-tremblay), [Bowen Wen](/person/bowen-wen), [Stephen Tyree](/person/stephen-tyree), Alex Evans, Patricio A. Vela, [Stan Birchfield](/person/stan-birchfield)



[ICRA 2023](https://www.icra2023.org/)









[Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models](/publication/2023-05_planning-occluded-traffic-agents-using-bi-level-variational-occlusion-models)

Filippos Christianos, [Peter Karkus](/person/peter-karkus), [Boris Ivanovic](/person/boris-ivanovic), Stefano V. Albrecht, [Marco Pavone](/person/marco-pavone)



[IEEE International Conference on Robotics and Automation (ICRA) 2023](https://www.icra2023.org/)









[FewSOL: A Dataset for Few-Shot Object Learning in Robotic Environments](/index.php/publication/2023-05_fewsol-dataset-few-shot-object-learning-robotic-environments)

Jishnu Jaykumar P, [Yu-Wei Chao](/index.php/person/yu-wei-chao), Yu Xiang



[IEEE International Conference on Robotics and Automation (ICRA) 2023](https://www.icra2023.org)









[The Best Defense is a Good Offense: Adversarial Augmentation against Adversarial Attacks](/publication/2023-05_best-defense-good-offense-adversarial-augmentation-against-adversarial-attacks)

[Iuri Frosio](/person/iuri-frosio), [Jan Kautz](/person/jan-kautz)



[CVPR 2023](https://cvpr2023.thecvf.com/)









[RGB-Only Reconstruction of Tabletop Scenes for Collision-Free Manipulator Control ](/index.php/publication/2023-05_rgb-only-reconstruction-tabletop-scenes-collision-free-manipulator-control)

Zhenggang Tang, [Balakumar Sundaralingam](/index.php/person/balakumar-sundaralingam), [Jonathan Tremblay](/index.php/person/jonathan-tremblay), [Bowen Wen](/index.php/person/bowen-wen), [Ye Yuan](/index.php/person/ye-yuan), [Stephen Tyree](/index.php/person/stephen-tyree), [Charles Loop](/index.php/person/charles-loop), Alexander Schwing, [Stan Birchfield](/index.php/person/stan-birchfield)



[ICRA 2023](https://www.icra2023.org/)









[Subpixel Deblurring of Anti-Aliased Raster Clip Art](/publication/2023-05_subpixel-deblurring-anti-aliased-raster-clip-art)

Jinfan Yang, [Nicholas Vining](/person/nicholas-vining), Shakiba Kheradmand, Nathan Carr, Leonid Sigal, Alla Sheffer



[Computer Graphics Forum (Proc. Eurographics 2023)](https://diglib.eg.org/handle/10.1111/cgf14744)









[GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation](/publication/2023-05_gpvit-high-resolution-non-hierarchical-vision-transformer-group-propagation)

Chenhongyi Yang, Jiarui Xu, [Shalini De Mello](/person/shalini-de-mello), Elliot J. Crowley, Xiaolong Wang



[International Conference on Learning Representations (ICLR) 2023](https://iclr.cc/virtual/2023/poster/11986)



Notable top 25%, Oral





[Robust and Controllable Object-Centric Learning through Energy-based Models](/publication/2023-05_robust-and-controllable-object-centric-learning-through-energy-based-models)

Ruixiang Zhang, [Gerry Che](/person/gerry-che), [Boris Ivanovic](/person/boris-ivanovic), Renhao Wang, [Marco Pavone](/person/marco-pavone), Yoshua Bengio, Liam Paull



[International Conference on Learning Representations (ICLR) 2023](https://iclr.cc/Conferences/2023)









[Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis](/publication/2023-02_frido-feature-pyramid-diffusion-complex-scene-image-synthesis)

Wan-Cyuan Fan, Yen-Chun Chen, Dongdong Chen, Yu Cheng, Lu Yuan, [Frank Wang](/person/frank-wang)



[AAAI 2023](https://aaai.org/Conferences/AAAI-23/)









[Target-free Text-guided Image Manipulation](/publication/2023-02_target-free-text-guided-image-manipulation)

Wan-Cyuan Fan, Cheng-Fu Yang, Chiao-An Yang, [Frank Wang](/person/frank-wang)



[AAAI 2023](https://aaai.org/Conferences/AAAI-23/)









[Self-Supervised Pyramid Representation Learning for Multi-Label Visual Analysis and Beyond](/publication/2023-01_self-supervised-pyramid-representation-learning-multi-label-visual-analysis-and)

Cheng-Yen Hsieh, Chih-Jung Chang, Fu-En Yang, [Frank Wang](/person/frank-wang)



[WACV 2023](https://wacv2023.thecvf.com/home)









### 2022 

[Learning Robust Real-World Dexterous Grasping Policies via Implicit Shape Augmentation](/publication/2022-12_learning-robust-real-world-dexterous-grasping-policies-implicit-shape)

Qiuyu Chen, Karl Van Wyk, [Yu-Wei Chao](/person/yu-wei-chao), [Wei Yang](/person/wei-yang), Arsalan Mousavian, Abhishek Gupta, Dieter Fox



[The Conference on Robot Learning (CoRL) 2022](https://corl2022.org/)









[Robust Trajectory Prediction against Adversarial Attacks](/publication/2022-12_robust-trajectory-prediction-against-adversarial-attacks)

[Yulong Cao](/person/yulong-cao), [Danfei Xu](/person/danfei-xu), [Xinshuo Weng](/person/xinshuo-weng), Z. Morely Mao, Anima Anandkumar, [Chaowei Xiao](/person/chaowei-xiao), [Marco Pavone](/person/marco-pavone)



[Conference on Robot Learning (CoRL) 2022](https://corl2022.org/)



Selected for Oral Presentation





[Task-Relevant Failure Detection for Trajectory Predictors in Autonomous Vehicles](/publication/2022-12_task-relevant-failure-detection-trajectory-predictors-autonomous-vehicles)

Alec Farid, [Sushant Veer](/person/sushant-veer), [Boris Ivanovic](/person/boris-ivanovic), [Karen Leung](/person/karen-leung), [Marco Pavone](/person/marco-pavone)



[Conference on Robot Learning (CoRL) 2022](https://corl2022.org/)









[MegaPose: 6D Pose Estimation of Novel Objects via Render &amp; Compare](/index.php/publication/2022-12_megapose-6d-pose-estimation-novel-objects-render-compare)

Yann Labbe, Lucas Manuelli, Arsalan Mousavian, [Stephen Tyree](/index.php/person/stephen-tyree), [Stan Birchfield](/index.php/person/stan-birchfield), [Jonathan Tremblay](/index.php/person/jonathan-tremblay), et al.



[CoRL 2022](https://corl2022.org/)









[Motion Policy Networks](/publication/2022-12_motion-policy-networks)

Adam Fishman, [Adithya Murali](/person/adithya-murali), Clemens Eppner, Bryan Peele, Byron Boots, Dieter Fox



[Conference on Robot Learning (CoRL), 2022](https://arxiv.org/abs/2210.12209)









["This is my unicorn, Fluffy": Personalizing frozen vision-language representations](/publication/2022-11_my-unicorn-fluffy-personalizing-frozen-vision-language-representations)

Niv Cohen, Rinon Gal, [Eli Meirom](/person/eli-meirom), [Gal Chechik](/person/gal-chechik), [Yuval Atzmon](/person/yuval-atzmon)



[ECCV 2022](https://www.ecva.net/papers/eccv_2022/papers_ECCV/papers/136800544.pdf)









[Embodied Scene-aware Human Pose Estimation](/publication/2022-11_embodied-scene-aware-human-pose-estimation)

Zhengyi Luo, Shun Iwase, [Ye Yuan](/person/ye-yuan), Kris Kitani



[NeurIPS 2022](https://nips.cc/Conferences/2022)









[Structural Pruning via Latency-Saliency Knapsack](/publication/2022-11_structural-pruning-latency-saliency-knapsack)

Maying Shen, [Hongxu Danny Yin](/person/danny-yin), [Pavlo Molchanov](/person/pavlo-molchanov), Lei Mao, Jianna Liu, Jose M. Alvarez



[NeurIPS 2022](https://nips.cc/Conferences/2022/ScheduleMultitrack?event=52841)









[GENIE: Higher-Order Denoising Diffusion Solvers](/publication/2022-11_genie-higher-order-denoising-diffusion-solvers)

Tim Dockhorn, [Arash Vahdat](/person/arash-vahdat), [Karsten Kreis](/person/karsten-kreis)



[Neural Information Processing Systems (NeurIPS) 2022](https://arxiv.org/abs/2210.05475)









[SPoVT: Semantic-Prototype Variational Transformer for Dense Point Cloud Semantic Completion](/publication/2022-11_spovt-semantic-prototype-variational-transformer-dense-point-cloud-semantic)

Sheng-Yu Huang, Hao-Yu Hsu, [Yu-Chiang Frank Wang](/person/frank-wang)



[NeurIPS 2022](https://nips.cc/)









[Paraphrasing Is All You Need for Novel Object Captioning](/publication/2022-11_paraphrasing-all-you-need-novel-object-captioning)

Cheng-Fu Yang, Yao-Hung Hubert Tsai, Wan-Cyuan Fan, Ruslan Salakhutdinov, Louis-Philippe Morency, [Frank Wang](/person/frank-wang)



[NeurIPS 2022](https://nips.cc/)









[6-DoF Pose Estimation of Household Objects for Robotic Manipulation: An Accessible Dataset and Benchmark](/index.php/publication/2022-11_6-dof-pose-estimation-household-objects-robotic-manipulation-accessible-dataset)

[Stephen Tyree](/index.php/person/stephen-tyree), [Jonathan Tremblay](/index.php/person/jonathan-tremblay), [Stan Birchfield](/index.php/person/stan-birchfield), et al.



[IROS 2022](https://iros2022.org/)









[ Heterogeneous-Agent Trajectory Forecasting Incorporating Class Uncertainty ](/publication/2022-10_heterogeneous-agent-trajectory-forecasting-incorporating-class-uncertainty)

[Boris Ivanovic](/person/boris-ivanovic), Kuan-Hui Lee, Pavel Tokmakov, Blake Wulfe, Adrien Gaidon, [Marco Pavone](/person/marco-pavone)



[IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2022](https://iros2022.org/)









[AdvDO: Realistic Adversarial Attacks for Trajectory Prediction](/publication/2022-10_advdo-realistic-adversarial-attacks-trajectory-prediction)

[Yulong Cao](/person/yulong-cao), [Chaowei Xiao](/person/chaowei-xiao), Anima Anandkumar, [Danfei Xu](/person/danfei-xu), [Marco Pavone](/person/marco-pavone)



[European Conference on Computer Vision (ECCV) 2022](https://eccv2022.ecva.net/)









[Text2LIVE: Text-Driven Layered Image and Video Editing](/publication/2022-10_text2live-text-driven-layered-image-and-video-editing)

Omer Bar-Tal, Dolev Ofri-Amar, Rafail Fridman, [Yoni Kasten](/person/yoni-kasten), Tali Dekel



[ECCV 2022](https://www.ecva.net/papers/eccv_2022/papers_ECCV/papers/136750705.pdf)









[LANA: Latency Aware Network Acceleration](/publication/2022-10_lana-latency-aware-network-acceleration)

[Pavlo Molchanov](/person/pavlo-molchanov), Jimmy Hall, [Hongxu Danny Yin](/person/danny-yin), [Jan Kautz](/person/jan-kautz), Nicolo Fusi, [Arash Vahdat](/person/arash-vahdat)



[European Conference on Computer Vision (ECCV), 2022](https://arxiv.org/abs/2107.10624)









[Audio-Visual Segmentation](/publication/2022-10_audio-visual-segmentation)

Jinxin Zhou, Yiran Zhong, [Stan Birchfield](/person/stan-birchfield), et al.



[ECCV 2022](https://eccv2022.ecva.net/)









[Shape, Light, and Material Decomposition from Images using Monte Carlo Rendering and Denoising](/index.php/publication/2022-10_shape-light-and-material-decomposition-images-using-monte-carlo-rendering-and)

[Jon Hasselgren](/index.php/person/jon-hasselgren), Nikolai Hofmann, [Jacob Munkberg](/index.php/person/jacob-munkberg)



[NeurIPS 2022](https://nvlabs.github.io/nvdiffrecmc/)









[Variable Bitrate Neural Fields](/index.php/vbnf)

Towaki Takikawa, Alex Evans, [Jonathan Tremblay](/index.php/person/jonathan-tremblay), [Thomas Müller](/index.php/person/thomas-muller), Morgan McGuire, Alec Jacobson, Sanja Fidler



[ACM SIGGRAPH 2022 Conference Proceedings](https://s2022.siggraph.org/)









[An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion](/publication/2022-08_image-worth-one-word-personalizing-text-image-generation-using-textual)

Rinon Gal, Yuval Alaluf, [Yuval Atzmon](/person/yuval-atzmon), Or Patashnik, Amit H. Bermano, [Gal Chechik](/person/gal-chechik), Daniel Cohen-Or



[ICLR 2023](https://iclr.cc/virtual/2023/oral/12700)



Top 25%





[Instant Neural Graphics Primitives with a Multiresolution Hash Encoding](/publication/2022-07_instant-neural-graphics-primitives-multiresolution-hash-encoding)

[Thomas Müller](/person/thomas-muller), Alex Evans, Christoph Schied, [Alex Keller](/person/alex-keller)



[ACM Transactions on Graphics (SIGGRAPH 2022)](https://s2022.siggraph.org)



Best Technical Paper, SIGGRAPH 2022, THE BEST INVENTIONS OF 2022, TIME





[CoordGAN: Self-Supervised Dense Correspondences Emerge from GANs](/publication/2022-07_coordgan-self-supervised-dense-correspondences-emerge-gans)

Jiteng Mu, [Shalini De Mello](/person/shalini-de-mello), [Zhiding Yu](/person/zhiding-yu), Nuno Vasconcelos, Xiaolong Wang, [Sifei Liu](/person/sifei-liu)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022](https://cvpr2022.thecvf.com/)









[Whose Track Is It Anyway? Improving Robustness to Tracking Errors with Affinity-Based Prediction](/publication/2022-06_whose-track-it-anyway-improving-robustness-tracking-errors-affinity-based)

[Xinshuo Weng](/person/xinshuo-weng), [Boris Ivanovic](/person/boris-ivanovic), Kris Kitani, [Marco Pavone](/person/marco-pavone)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022](https://cvpr2022.thecvf.com/)









[ScePT: Scene-consistent, Policy-based Trajectory Predictions for Planning](/publication/2022-06_scept-scene-consistent-policy-based-trajectory-predictions-planning)

[Yuxiao Chen](/person/yuxiao-chen), [Boris Ivanovic](/person/boris-ivanovic), [Marco Pavone](/person/marco-pavone)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022](https://cvpr2022.thecvf.com/)









[Polymorphic-GAN: Generating Aligned Samples across Multiple Domains with Learned Morph Maps](/publication/2022-06_polymorphic-gan-generating-aligned-samples-across-multiple-domains-learned)

Seung Wook Kim, [Karsten Kreis](/person/karsten-kreis), Daiqing Li, Antonio Torralba, Sanja Fidler



[Conference on Computer Vision and Pattern Recognition (CVPR) 2022 (Oral)](https://arxiv.org/abs/2206.02903)









[FreeSOLO: Learning to Segment Objects without Annotations](/publication/2022-06_freesolo-learning-segment-objects-without-annotations)

Xinlong Wang, [Zhiding Yu](/person/zhiding-yu), [Shalini De Mello](/person/shalini-de-mello), [Jan Kautz](/person/jan-kautz), Anima Anandkumar, Chunhua Shen, Jose M. Alvarez



[ IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022](https://cvpr2022.thecvf.com/)









[BigDatasetGAN: Synthesizing ImageNet with Pixel-wise Annotations](/publication/2022-06_bigdatasetgan-synthesizing-imagenet-pixel-wise-annotations)

Daiqing Li, Huan Ling, Seung Wook Kim, [Karsten Kreis](/person/karsten-kreis), Adela Barriuso, Sanja Fidler, Antonio Torralba



[IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022](https://arxiv.org/abs/2201.04684)









[Ifor: Iterative flow minimization for robotic object rearrangement](/index.php/publication/2022-06_ifor-iterative-flow-minimization-robotic-object-rearrangement)

[Ankit Goyal](/index.php/person/ankit-goyal), Arsalan Mousavian, Chris Paxton, [Yu-Wei Chao](/index.php/person/yu-wei-chao), Brian Okorn, Jia Deng, Dieter Fox



[IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022](https://cvpr2022.thecvf.com/)









[Efficient Geometry-aware 3D Generative Adversarial Networks](/publication/2022-06_efficient-geometry-aware-3d-generative-adversarial-networks)

Eric R. Chan, Connor Z. Lin, Matthew A. Chan, [Koki Nagano](/person/koki-nagano), Boxiao Pan, [Shalini De Mello](/person/shalini-de-mello), Orazio Gallo, Leonidas Guibas, [Jonathan Tremblay](/person/jonathan-tremblay), Sameh Khamis, [Tero Karras](/person/tero-karras), Gordon Wetzstein



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022](https://cvpr2022.thecvf.com/)



Oral





[GroupViT: Semantic Segmentation Emerges from Text Supervision](/publication/2022-06_groupvit-semantic-segmentation-emerges-text-supervision)

Jiarui Xu, [Shalini De Mello](/person/shalini-de-mello), [Sifei Liu](/person/sifei-liu), [Wonmin Byeon](/person/wonmin-byeon), [Thomas Breuel](/person/thomas-breuel), [Jan Kautz](/person/jan-kautz), Xiaolong Wang



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022](https://cvpr2022.thecvf.com/)









[GLAMR: Global Occlusion-Aware Human Mesh Recovery with Dynamic Cameras](/index.php/publication/2022-06_glamr-global-occlusion-aware-human-mesh-recovery-dynamic-cameras)

[Ye Yuan](/index.php/person/ye-yuan), [Umar Iqbal](/index.php/person/umar-iqbal), [Pavlo Molchanov](/index.php/person/pavlo-molchanov), Kris Kitani, [Jan Kautz](/index.php/person/jan-kautz)



IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022 (Ora…









[Interaction-Dynamics-Aware Perception Zones for Obstacle Detection Safety Evaluation](/publication/2022-06_interaction-dynamics-aware-perception-zones-obstacle-detection-safety)

Sever Topan, [Karen Leung](/person/karen-leung), [Yuxiao Chen](/person/yuxiao-chen), Pritish Tupekar, [Edward Schmerling](/person/ed-schmerling), Jonas Nilsson, Michael Cox, [Marco Pavone](/person/marco-pavone)



[IEEE Intelligent Vehicles Symposium (IV) 2022](https://iv2022.com/)









[Injecting Planning-Awareness into Prediction and Detection Evaluation](/publication/2022-06_injecting-planning-awareness-prediction-and-detection-evaluation)

[Boris Ivanovic](/person/boris-ivanovic), [Marco Pavone](/person/marco-pavone)



[IEEE Intelligent Vehicles Symposium (IV) 2022](https://iv2022.com/)









[MTP: Multi-Hypothesis Tracking and Prediction for Reduced Error Propagation](/publication/2022-06_mtp-multi-hypothesis-tracking-and-prediction-reduced-error-propagation)

[Xinshuo Weng](/person/xinshuo-weng), [Boris Ivanovic](/person/boris-ivanovic), [Marco Pavone](/person/marco-pavone)



[IEEE Intelligent Vehicles Symposium (IV) 2022](https://iv2022.com/)









[A-ViT: Adaptive Tokens for Efficient Vision Transformer](/index.php/publication/2022-06_vit-adaptive-tokens-efficient-vision-transformer)

[Hongxu Danny Yin](/index.php/person/danny-yin), [Arash Vahdat](/index.php/person/arash-vahdat), Jose M. Alvarez, Arun Mallya, [Jan Kautz](/index.php/person/jan-kautz), [Pavlo Molchanov](/index.php/person/pavlo-molchanov)



IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022 (Ora…









[GradViT: Gradient Inversion of Vision Transformers](/publication/2022-05_gradvit-gradient-inversion-vision-transformers)

Ali Hatamizadeh , [Hongxu Danny Yin](/person/danny-yin), [Holger Roth](/person/holger-roth), [Wenqi Li](/person/wenqi-li), [Jan Kautz](/person/jan-kautz), [Daguang Xu](/person/daguang-xu), [Pavlo Molchanov](/person/pavlo-molchanov)



[IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022](https://cvpr2022.thecvf.com/)









[When to Prune? A Policy towards Early Structural Pruning](/index.php/publication/2022-05_when-prune-policy-towards-early-structural-pruning)

Maying Shen, [Pavlo Molchanov](/index.php/person/pavlo-molchanov), [Hongxu Danny Yin](/index.php/person/danny-yin), Jose M. Alvarez



IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022









[Propagating State Uncertainty Through Trajectory Forecasting](/publication/2022-05_propagating-state-uncertainty-through-trajectory-forecasting)

[Boris Ivanovic](/person/boris-ivanovic), Yifeng Lin, Shubham Shrivastava, Punarjay Chakravarty, [Marco Pavone](/person/marco-pavone)



[IEEE International Conference on Robotics and Automation (ICRA) 2022](https://www.icra2022.org/)









[HandoverSim: A Simulation Framework and Benchmark for Human-to-Robot Object Handovers](/index.php/publication/2022-05_handoversim-simulation-framework-and-benchmark-human-robot-object-handovers)

[Yu-Wei Chao](/index.php/person/yu-wei-chao), Chris Paxton, Yu Xiang, [Wei Yang](/index.php/person/wei-yang), [Balakumar Sundaralingam](/index.php/person/balakumar-sundaralingam), Tao Chen, [Adithya Murali](/index.php/person/adithya-murali), Maya Cakmak, Dieter Fox



[IEEE International Conference on Robotics and Automation (ICRA) 2022](https://www.icra2022.org)









[A Dataset and Explorer for 3D Signed Distance Functions](/index.php/sdf-explorer)

Towaki Takikawa, Andrew Glassner, Morgan McGuire



[Journal of Computer Graphics Techniques](https://www.jcgt.org)









[Learning Continuous Environment Fields via Implicit Functions](/index.php/publication/2022-04_learning-continuous-environment-fields-implicit-functions)

[Xueting Li](/index.php/person/xueting-li), [Shalini De Mello](/index.php/person/shalini-de-mello), Xiaolong Wang, Ming-Hsuan Yang, [Jan Kautz](/index.php/person/jan-kautz), [Sifei Liu](/index.php/person/sifei-liu)



[International Conference on Learning Representations (ICLR), 2022](https://iclr.cc/Conferences/2022)









[Neural Fields in Visual Computing and Beyond](/index.php/neuralfields)

Yiheng Xie, Towaki Takikawa, Shunsuke Saito, Or Litany, Shiqin Yan, Numair Khan, Federico Tombari, James Tompkin, Vincent Sitzmann, Srinath Sridhar



[Computer Graphics Forum (Eurographics 2022)](https://eg2022.univ-reims.fr/)









[Efficient Token Mixing for Transformers via Adaptive Fourier Neural Operators](/publication/2022-04_efficient-token-mixing-transformers-adaptive-fourier-neural-operators)















["This is my unicorn, Fluffy": Personalizing frozen vision-language representations](/publication/2022-04_my-unicorn-fluffy-personalizing-frozen-vision-language-representations)

Niv Cohen, Rinon Gal, [Gal Chechik](/person/gal-chechik), [Yuval Atzmon](/person/yuval-atzmon)



[ECCV 2022](https://eccv2022.ecva.net/)



Oral





[Learning Contrastive Representation for Semantic Correspondence](/index.php/publication/2022-03_learning-contrastive-representation-semantic-correspondence)

Taihong Xiao, [Sifei Liu](/index.php/person/sifei-liu), [Shalini De Mello](/index.php/person/shalini-de-mello), [Zhiding Yu](/index.php/person/zhiding-yu), [Jan Kautz](/index.php/person/jan-kautz), Ming-Hsuan Yang



[International Journal of Computer Vision (IJCV) 2022](https://link.springer.com/article/10.1007/s11263-022-01602-y)









[PredictionNet: Real-Time Joint Probabilistic Traffic Prediction for Planning, Control, and Simulation](/publication/2022-03_predictionnet-real-time-joint-probabilistic-traffic-prediction-planning-control)

Alexey Kamenev, Lirui Wang, Ollin Boer Bohan, Ishwar Kulkarni, Bilal Kartal, Artem Molchanov, [Stan Birchfield](/person/stan-birchfield), David Nister, Nikolai Smolyanskiy



ICRA 2022









[ Tackling the Generative Learning Trilemma with Denoising Diffusion GANs](/publication/2022-03_tackling-generative-learning-trilemma-denoising-diffusion-gans-0)

Zhisheng Xiao, [Karsten Kreis](/person/karsten-kreis), [Arash Vahdat](/person/arash-vahdat)



[International Conference on Learning Representations (ICLR) 2022 (Spotlight)](https://arxiv.org/abs/2112.07804)









[Score-Based Generative Modeling with Critically-Damped Langevin Diffusion](/publication/2022-03_score-based-generative-modeling-critically-damped-langevin-diffusion)

Tim Dockhorn, [Arash Vahdat](/person/arash-vahdat), [Karsten Kreis](/person/karsten-kreis)



[International Conference on Learning Representations (ICLR) 2022 (Spotlight)](https://arxiv.org/abs/2112.07068)









[Single-Stage Keypoint-Based Category-Level Object Pose Estimation from an RGB Image](/index.php/publication/2022-02_single-stage-keypoint-based-category-level-object-pose-estimation-rgb-image)

Yunzhi Lin, [Jonathan Tremblay](/index.php/person/jonathan-tremblay), [Stephen Tyree](/index.php/person/stephen-tyree), Patricio A. Vela, [Stan Birchfield](/index.php/person/stan-birchfield)



ICRA 2022









[Keypoint-Based Category-Level Object Pose Tracking from an RGB Sequence with Uncertainty Estimation](/publication/2022-01_keypoint-based-category-level-object-pose-tracking-rgb-sequence-uncertainty)

Yunzhi Lin, [Jonathan Tremblay](/person/jonathan-tremblay), [Stephen Tyree](/person/stephen-tyree), Patricio A. Vela, [Stan Birchfield](/person/stan-birchfield)



ICRA 2022









[Displacement-Invariant Cost Computation for Efficient Stereo Matching](/publication/2022-01_displacement-invariant-cost-computation-efficient-stereo-matching)

Yiran Zhong, [Charles Loop](/person/charles-loop), [Wonmin Byeon](/person/wonmin-byeon), [Stan Birchfield](/person/stan-birchfield), et al.



[IJCV](https://link.springer.com/article/10.1007/s11263-022-01595-8)









[RTMV: A Ray-Traced Multi-View Synthetic Dataset for Novel View Synthesis ](/index.php/publication/2022-01_rtmv-ray-traced-multi-view-synthetic-dataset-novel-view-synthesis)

[Jonathan Tremblay](/index.php/person/jonathan-tremblay), Moustafa Meshry, [Stan Birchfield](/index.php/person/stan-birchfield), Alex Evans, [Jan Kautz](/index.php/person/jan-kautz), [Alex Keller](/index.php/person/alex-keller), Sameh Khamis, [Charles Loop](/index.php/person/charles-loop), Nate Morrical, [Thomas Müller](/index.php/person/thomas-muller), [Koki Nagano](/index.php/person/koki-nagano), Towaki Takikawa, [Stan Birchfield](/index.php/person/stan-birchfield)



[ECCV 2022 Workshop on Learning to Generate 3D Shapes and Scenes](https://learn3dg.github.io/)









### 2021 

[EditGAN: High-Precision Semantic Image Editing](/publication/2021-12_editgan-high-precision-semantic-image-editing)

Huan Ling, [Karsten Kreis](/person/karsten-kreis), Daiqing Li, Seung Wook Kim, Antonio Torralba, Sanja Fidler



[Neural Information Processing Systems (NeurIPS) 2021](https://arxiv.org/abs/2111.03186)









[KAMA: 3D Keypoint Aware Body Mesh Articulation](/publication/2021-12_kama-3d-keypoint-aware-body-mesh-articulation)

[Umar Iqbal](/person/umar-iqbal), Kevin Xie, Kelly Guo, [Jan Kautz](/person/jan-kautz), [Pavlo Molchanov](/person/pavlo-molchanov)



[International Conference on 3D Vision](https://3dv2021.surrey.ac.uk/)









[Standard vs. Learning-based Codecs for Real Time Endoscopic Video Transmission](/publication/2021-11_standard-vs-learning-based-codecs-real-time-endoscopic-video-transmission)

[Iuri Frosio](/person/iuri-frosio), Aldo Marzullo, Martina Golini, Elena De Momi, Michele Catellani, Francesco Calimeri, Giuseppe Fiameni



[AIABI-2021 Italian Workshop on Artificial Intelligence and Applications for Bus…](http://ceur-ws.org/Vol-3102/)









[Extracting Triangular 3D Models, Materials, and Lighting From Images](/publication/2021-11_extracting-triangular-3d-models-materials-and-lighting-images)

[Jacob Munkberg](/person/jacob-munkberg), [Jon Hasselgren](/person/jon-hasselgren), Tianchang Shen, Jun Gao, Wenzheng Chen, Alex Evans, [Thomas Müller](/person/thomas-muller), Sanja Fidler



[CVPR 2022 (Oral)](https://openaccess.thecvf.com/content/CVPR2022/papers/Munkberg_Extracting_Triangular_3D_Models_Materials_and_Lighting_From_Images_CVPR_2022_paper.pdf)









[Noise-Aware Video Saliency Prediction](/publication/2021-11_noise-aware-video-saliency-prediction)

[Ekta Prashnani](/person/ekta-prashnani), Orazio Gallo, [Joohwan Kim](/person/joohwan-kim), [Josef Spjut](/person/josef-spjut), Pradeep Sen, [Iuri Frosio](/person/iuri-frosio)



[The British Machine Vision Conference (BMVC) - 2021](https://www.bmvc2021.com/)









[Controllable and Compositional Generation with Latent-Space Energy-Based Models](/publication/2021-11_controllable-and-compositional-generation-latent-space-energy-based-models)

Weili Nie, [Arash Vahdat](/person/arash-vahdat), Anima Anandkumar



[Neural Information Processing Systems (NeurIPS) 2021](https://arxiv.org/abs/2110.10873)









[Don’t Generate Me: Training Differentially Private Generative Models with Sinkhorn Divergence](/publication/2021-11_don-t-generate-me-training-differentially-private-generative-models-sinkhorn)

Tianshi Cao, Alex Bie, [Arash Vahdat](/person/arash-vahdat), Sanja Fidler, [Karsten Kreis](/person/karsten-kreis)



[Neural Information Processing Systems (NeurIPS) 2021](https://arxiv.org/abs/2111.01177)









[Score-based Generative Modeling in Latent Space](/publication/2021-11_score-based-generative-modeling-latent-space)

[Arash Vahdat](/person/arash-vahdat), [Karsten Kreis](/person/karsten-kreis), [Jan Kautz](/person/jan-kautz)



[Neural Information Processing Systems (NeurIPS) 2021](https://arxiv.org/abs/2106.05931)









[A Contrastive Learning Approach for Training Variational Autoencoder Priors](/publication/2021-11_contrastive-learning-approach-training-variational-autoencoder-priors)

Jyoti Aneja, Alexander Schwing, [Jan Kautz](/person/jan-kautz), [Arash Vahdat](/person/arash-vahdat)



[Neural Information Processing Systems (NeurIPS) 2021](https://arxiv.org/abs/2010.02917)









[MTP: Multi-Hypothesis Tracking and Prediction for Reduced Error Propagation](/publication/2021-10_mtp-multi-hypothesis-tracking-and-prediction-reduced-error-propagation)

Xinshuo Weng, [Boris Ivanovic](/person/boris-ivanovic), [Marco Pavone](/person/marco-pavone)



[CVPR](https://cvpr2021.thecvf.com/)









[Self-Supervised Object Detection via Generative Image Synthesis](/index.php/publication/2021-10_self-supervised-object-detection-generative-image-synthesis)

Siva Karthik Mustikovela, [Shalini De Mello](/index.php/person/shalini-de-mello), Aayush Prakash, [Umar Iqbal](/index.php/person/umar-iqbal), [Sifei Liu](/index.php/person/sifei-liu), Thu Nguyen-Phuoc, Carsten Rother, [Jan Kautz](/index.php/person/jan-kautz)



[International Conference on Computer Vision (ICCV) 2021](https://iccv2021.thecvf.com/)









[Self-Supervised Real-to-Sim Scene Generation](/index.php/publication/2021-10_self-supervised-real-sim-scene-generation)

Aayush Prakash, Shoubhik Debnath, Jean-Francois Lafleche, Eric Cameracci, Gavriel State, [Stan Birchfield](/index.php/person/stan-birchfield), Marc T. Law



[ICCV 2021](https://iccv2021.thecvf.com/home)









[Multi-View Fusion for Multi-Level Robotic Scene Understanding](/index.php/publication/2021-09_multi-view-fusion-multi-level-robotic-scene-understanding)

Yunzhi Lin, [Jonathan Tremblay](/index.php/person/jonathan-tremblay), [Stephen Tyree](/index.php/person/stephen-tyree), Patricio A. Vela, [Stan Birchfield](/index.php/person/stan-birchfield)



[IROS 2021](https://www.iros2021.org/)









[Weakly-Supervised Physically Unconstrained Gaze Estimation](/publication/2021-06_weakly-supervised-physically-unconstrained-gaze-estimation)

Rakshit Kothari, [Shalini De Mello](/person/shalini-de-mello), [Umar Iqbal](/person/umar-iqbal), [Wonmin Byeon](/person/wonmin-byeon), Seonwook Park, [Jan Kautz](/person/jan-kautz)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2021](http://cvpr2021.thecvf.com/)



Oral





[Learning to Track Instances without Video Annotations](/publication/2021-06_learning-track-instances-without-video-annotations)

Yang Fu, [Sifei Liu](/person/sifei-liu), [Umar Iqbal](/person/umar-iqbal), [Shalini De Mello](/person/shalini-de-mello), Humphrey Shi, [Jan Kautz](/person/jan-kautz)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2021](http://cvpr2021.thecvf.com/)



Oral





[Neural Geometric Level of Detail: Real-time Rendering with Implicit 3D Shapes](/index.php/nglod)

Towaki Takikawa, Joey Litalien, Kangxue Yin, [Karsten Kreis](/index.php/person/karsten-kreis), [Charles Loop](/index.php/person/charles-loop), Derek Nowrouzezahrai, Alec Jacobson, Morgan McGuire, Sanja Fidler



[IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021](https://cvpr2021.thecvf.com/)









[Semantic Segmentation with Generative Models: Semi-Supervised Learning and Strong Out-of-Domain Generalization](/publication/2021-06_semantic-segmentation-generative-models-semi-supervised-learning-and-strong-out)

Daiqing Li, Junlin Yang, [Karsten Kreis](/person/karsten-kreis), Antonio Torralba, Sanja Fidler



[IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021](https://arxiv.org/abs/2104.05833)









[Binary TTC: A Temporal Geofence for Autonomous Navigation](/publication/2021-06_binary-ttc-temporal-geofence-autonomous-navigation)

[Abhishek Badki](/person/abhishek-badki), Orazio Gallo, [Jan Kautz](/person/jan-kautz), Pradeep Sen



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2021](https://cvpr2021.thecvf.com/)



Best Student Paper Honorable Mention, CVPR 2021





[One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing](/publication/2021-06_one-shot-free-view-neural-talking-head-synthesis-video-conferencing)

[Ting-Chun Wang](/person/ting-chun-wang), Arun Mallya, [Ming-Yu Liu](/person/ming-yu-liu)



[CVPR](https://cvpr2021.thecvf.com/)









[Deep Two-View Structure-from-Motion Revisited](/index.php/publication/2021-06_deep-two-view-structure-motion-revisited)

Jianyuan Wang, Yiran Zhong, Yuchao Dai, [Stan Birchfield](/index.php/person/stan-birchfield), Kaihao Zhang, Nikolai Smolyanskiy, Hongdong Li



[CVPR 2021](https://cvpr2021.thecvf.com/)









[See through Gradients: Image Batch Recovery via GradInversion](/index.php/publication/2021-06_see-through-gradients-image-batch-recovery-gradinversion)

[Hongxu Danny Yin](/index.php/person/danny-yin), Arun Mallya, [Arash Vahdat](/index.php/person/arash-vahdat), Jose M. Alvarez, [Jan Kautz](/index.php/person/jan-kautz), [Pavlo Molchanov](/index.php/person/pavlo-molchanov)



[IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021](https://openaccess.thecvf.com/content/CVPR2021/papers/Yin_See_Through_Gradients_Image_Batch_Recovery_via_GradInversion_CVPR_2021_paper.pdf)









[Optimal Quantization Using Scaled Codebook](/index.php/publication/2021-06_optimal-quantization-using-scaled-codebook)

Yerlan Idelbayev, [Pavlo Molchanov](/index.php/person/pavlo-molchanov), Maying Shen, [Hongxu Danny Yin](/index.php/person/danny-yin), Miguel A. Carreira-Perpinan, Jose M. Alvarez



[IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021](https://openaccess.thecvf.com/content/CVPR2021/html/Idelbayev_Optimal_Quantization_Using_Scaled_Codebook_CVPR_2021_paper.html)









[DexYCB: A Benchmark for Capturing Hand Grasping of Objects](/index.php/publication/2021-06_dexycb-benchmark-capturing-hand-grasping-objects)

[Yu-Wei Chao](/index.php/person/yu-wei-chao), [Wei Yang](/index.php/person/wei-yang), Yu Xiang, [Pavlo Molchanov](/index.php/person/pavlo-molchanov), Ankur Handa, [Jonathan Tremblay](/index.php/person/jonathan-tremblay), [Yashraj Narang](/index.php/person/yashraj-narang), Karl Van Wyk, [Umar Iqbal](/index.php/person/umar-iqbal), [Stan Birchfield](/index.php/person/stan-birchfield), [Jan Kautz](/index.php/person/jan-kautz), Dieter Fox



[IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021](http://cvpr2021.thecvf.com)









[VAEBM: A Symbiosis between Variational Autoencoders and Energy-based Models](/publication/2021-06_vaebm-symbiosis-between-variational-autoencoders-and-energy-based-models)

Zhisheng Xiao, [Karsten Kreis](/person/karsten-kreis), [Jan Kautz](/person/jan-kautz), [Arash Vahdat](/person/arash-vahdat)



[International Conference on Learning Representations (ICLR) 2021 (Spotlight)](https://arxiv.org/abs/2010.00654)









[NViSII: A Scriptable Tool for Photorealistic Image Generation](/index.php/publication/2021-05_nvisii-scriptable-tool-photorealistic-image-generation)

Nathan Morrical, [Jonathan Tremblay](/index.php/person/jonathan-tremblay), Yunzhi Lin, [Stephen Tyree](/index.php/person/stephen-tyree), [Stan Birchfield](/index.php/person/stan-birchfield), Valerio Pascucci, Ingo Wald



SDG Workshop at ICLR 2021









[RGB-D Local Implicit Function for Depth Completion of Transparent Objects](/index.php/publication/2021-03_rgb-d-local-implicit-function-depth-completion-transparent-objects)

Luyang Zhu, Arsalan Mousavian, Yu Xiang, Hammad Mazhar, Jozef van Eenbergen, Shoubhik Debnath, Dieter Fox



[CVPR 2021](http://cvpr2021.thecvf.com/)









[Reactive Human-to-Robot Handovers of Arbitrary Objects](/index.php/publication/2021-03_reactive-human-robot-handovers-arbitrary-objects)

[Wei Yang](/index.php/person/wei-yang), Chris Paxton, Arsalan Mousavian, [Yu-Wei Chao](/index.php/person/yu-wei-chao), Maya Cakmak, Dieter Fox



ICRA 2021



Best Paper in Human-Robot Interaction, ICRA 2021





[Robust Vision-Based Cheat Detection in Competitive Gaming](/publication/2021-03_robust-vision-based-cheat-detection-competitive-gaming)

Aditya Jonnalagadda, [Iuri Frosio](/person/iuri-frosio), Seth Schenider, Morgan McGuire, [Joohwan Kim](/person/joohwan-kim)



[I3D ’21](http://i3dsymposium.github.io/2021/)









[Self-Supervised Learning for Domain Adaptation on Point-Clouds](/publication/2021-01_self-supervised-learning-domain-adaptation-point-clouds)

Idan Achituve, [Haggai Maron](/person/haggai-maron), [Gal Chechik](/person/gal-chechik)



[Winter Conference on Applications of Computer Vision (WACV), 2021](https://arxiv.org/pdf/2003.12641.pdf)









[From Generalized Zero-Shot Learning to Long-Tail with Class Descriptors](/publication/2021-01_generalized-zero-shot-learning-long-tail-class-descriptors)

Dvir Samuel, [Yuval Atzmon](/person/yuval-atzmon), [Gal Chechik](/person/gal-chechik)



[Winter Conference on Applications of Computer Vision (WACV) 2021](http://wacv2021.thecvf.com/home)









[Data-Free Knowledge Distillation for Object Detection](/index.php/publication/2021-01_data-free-knowledge-distillation-object-detection)

Akshay Chawla, [Hongxu Danny Yin](/index.php/person/danny-yin), [Pavlo Molchanov](/index.php/person/pavlo-molchanov), Jose M. Alvarez



[WACV 2021](https://openaccess.thecvf.com/content/WACV2021/html/Chawla_Data-Free_Knowledge_Distillation_for_Object_Detection_WACV_2021_paper.html)









### 2020 

[Online Adaptation for Consistent Mesh Reconstruction in the Wild](/publication/2020-12_online-adaptation-consistent-mesh-reconstruction-wild)

Xueting Li, [Sifei Liu](/person/sifei-liu), [Shalini De Mello](/person/shalini-de-mello), Kihwan Kim, Xiaolong Wang, Ming-Hsuan Yang, [Jan Kautz](/person/jan-kautz)



[Neural Information Processing Systems (NeurIPS) 2020](https://nips.cc/virtual/2020/public/poster_aba3b6fd5d186d28e06ff97135cade7f.html)









[Self-Learning Transformations for Improving Gaze and Head Redirection](/publication/2020-12_self-learning-transformations-improving-gaze-and-head-redirection)

Yufeng Zheng, Seonwook Park, Xucong Zhang, [Shalini De Mello](/person/shalini-de-mello), Otmar Hilliges



[ Neural Information Processing Systems (NeurIPS) 2020](https://proceedings.neurips.cc/paper/2020)









[A Causal View of Compositional Zero-Shot Recognition](/publication/2020-12_causal-view-compositional-zero-shot-recognition)

[Yuval Atzmon](/person/yuval-atzmon), Felix Kreuk, Uri Shalit, [Gal Chechik](/person/gal-chechik)



[Neural Information Processing Systems (NeurIPS) 2020 (Spotlight)](https://neurips.cc/Conferences/2020/)









[Neural Networks with Recurrent Generative Feedback](/publication/2020-12_neural-networks-recurrent-generative-feedback)

Yujia Huang, James Gornet, Sihui Dai, [Zhiding Yu](/person/zhiding-yu), Tan Nguyen, Doris Y. Tsao, Anima Anandkumar



[Conference on Neural Information Processing Systems (NeurIPS) 2020](https://nips.cc/Conferences/2020)









[Learning Deformable Tetrahedral Meshes for 3D Reconstruction](/index.php/publication/2020-12_learning-deformable-tetrahedral-meshes-3d-reconstruction)

Jun Gao, Wenzheng Chen, Tommy Xiang, Clement Fuji Tsang, Alec Jacobson, Morgan McGuire



[NeurIPS](https://nips.cc/)









[Variational Amodal Object Completion](/publication/2020-12_variational-amodal-object-completion)

Huan Ling, David Acuna, [Karsten Kreis](/person/karsten-kreis), Seung Wook Kim, Sanja Fidler



[Neural Information Processing Systems (NeurIPS) 2020](https://papers.nips.cc/paper/2020/hash/bacadc62d6e67d7897cef027fa2d416c-Abstract.html)









[Generative View Synthesis: From Single-View Semantics to Novel-View Images](/index.php/publication/2020-12_generative-view-synthesis-single-view-semantics-novel-view-images)

Tewodros Abtegebrial, Varun Jampani, Orazio Gallo, Didier Stricker



[NeurIPS](https://nips.cc/virtual/2020/public/index.html)









[Bongard-LOGO: A New Benchmark for Human-Level Concept Learning and Reasoning](/publication/2020-12_bongard-logo-new-benchmark-human-level-concept-learning-and-reasoning)

Weili Nie, [Zhiding Yu](/person/zhiding-yu), Lei Mao, Ankit B. Patel, [Yuke Zhu](/person/yuke-zhu), Anima Anandkumar



[Conference on Neural Information Processing Systems (NeurIPS) 2020 (Spotlight)](https://nips.cc/Conferences/2020)









[Neural FFTs for Universal Texture Image Synthesis](/publication/2020-12_neural-ffts-universal-texture-image-synthesis)

[Morteza Mardani](/person/morteza-mardani), Guilin Liu, Aysegul Dundar, Shiqiu Liu, Andrew Tao, Bryan Catanzaro



[NeurIPS 2020](https://proceedings.neurips.cc/paper/2020/hash/a23156abfd4a114c35b930b836064e8b-Abstract.html)









[ZEST: Zero-shot Learning from Text Descriptions using Textual Similarity and Visual Summarization](/publication/2020-11_zest-zero-shot-learning-text-descriptions-using-textual-similarity-and-visual)

Tzuf Paz-Argaman, [Yuval Atzmon](/person/yuval-atzmon), [Gal Chechik](/person/gal-chechik), Reut Tsarfaty



[Findings of EMNLP](https://2020.emnlp.org/)









[Learning Object Permanence from Video](/publication/2020-10_learning-object-permanence-video)

Aviv Shamsian, Ofri Kleinfeld, Amir Globerson, [Gal Chechik](/person/gal-chechik)



[ECCV 2020](https://arxiv.org/abs/2006.15327)









[LAMP: Large Deep Nets with Automated Model Parallelism for Image Segmentation](/index.php/publication/2020-10_lamp-large-deep-nets-automated-model-parallelism-image-segmentation)

Wentao Zhu, [Can Zhao](/index.php/person/can-zhao), [Wenqi Li](/index.php/person/wenqi-li), [Holger Roth](/index.php/person/holger-roth), [Ziyue Xu](/index.php/person/ziyue-xu), [Daguang Xu](/index.php/person/daguang-xu)



[MICCAI 2020](https://miccai2020.org/en/)









[UNAS: Differentiable Architecture Search Meets Reinforcement Learning](/publication/2020-08_unas-differentiable-architecture-search-meets-reinforcement-learning)

[Arash Vahdat](/person/arash-vahdat), Arun Mallya, [Ming-Yu Liu](/person/ming-yu-liu), [Jan Kautz](/person/jan-kautz)



[IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021](https://arxiv.org/abs/1912.07651)









[Joint Disentangling and Adaptation for Cross-Domain Person Re-Identification](/publication/2020-08_joint-disentangling-and-adaptation-cross-domain-person-re-identification)

Yang Zou, Xiaodong Yang, [Zhiding Yu](/person/zhiding-yu), B. V. K. Vijaya Kumar, [Jan Kautz](/person/jan-kautz)



[European Conference on Computer Vision (ECCV) 2020 (Oral)](https://eccv2020.eu/)









[UFO2: A Unified Framework towards Omni-supervised Object Detection](/publication/2020-08_ufo2-unified-framework-towards-omni-supervised-object-detection)

Zhongzheng Ren, [Zhiding Yu](/person/zhiding-yu), Xiaodong Yang, [Ming-Yu Liu](/person/ming-yu-liu), Alexander G. Schwing, [Jan Kautz](/person/jan-kautz)



[European Conference on Computer Vision (ECCV) 2020](https://eccv2020.eu/)









[Weakly-Supervised 3D Hand Pose Estimation via Biomechanical Constraints ](/publication/2020-08_weakly-supervised-3d-hand-pose-estimation-biomechanical-constraints)

Adrian Spurr, [Umar Iqbal](/person/umar-iqbal), [Pavlo Molchanov](/person/pavlo-molchanov), Otmar Hilliges, [Jan Kautz](/person/jan-kautz)



[European Conference on Computer Vision, 2020](https://eccv2020.eu/)









[World-Consistent Video-to-Video Synthesis](/publication/2020-08_world-consistent-video-video-synthesis)

Arun Mallya, [Ting-Chun Wang](/person/ting-chun-wang), Karan Sapra, [Ming-Yu Liu](/person/ming-yu-liu)



[ECCV](https://eccv2020.eu/)









[Self-supervised Single-view 3D Reconstruction via Semantic Consistency](/publication/2020-08_self-supervised-single-view-3d-reconstruction-semantic-consistency)

Xueting Li, [Sifei Liu](/person/sifei-liu), Kihwan Kim, [Shalini De Mello](/person/shalini-de-mello), Varun Jampani, Ming-Hsuan Yang, [Jan Kautz](/person/jan-kautz)



[European Conference on Computer Vision (ECCV) 2020](https://eccv2020.eu/)









[Indirect Object-to-Robot Pose Estimation from an External Monocular RGB Camera](/index.php/publication/2020-07_indirect-object-robot-pose-estimation-external-monocular-rgb-camera)

[Jonathan Tremblay](/index.php/person/jonathan-tremblay), [Stephen Tyree](/index.php/person/stephen-tyree), Terry Mosier, [Stan Birchfield](/index.php/person/stan-birchfield)



IROS 2020









[Weakly Supervised One-stage Vision and Language Disease Detection using Large Scale Pneumonia and Pneumothorax Studies](/index.php/publication/2020-07_weakly-supervised-one-stage-vision-and-language-disease-detection-using-large)

Leo Tam, Xiaosong Wang, Evrim Turkbey, Kevin Lu, Yuhong Wen, [Daguang Xu](/index.php/person/daguang-xu)



[MICCAI 2020](https://arxiv.org/abs/2007.15778)









[Semi-Supervised StyleGAN for Disentanglement Learning](/publication/2020-07_semi-supervised-stylegan-disentanglement-learning)

Weili Nie, [Tero Karras](/person/tero-karras), Animesh Garg, Shoubhik Debnath, Anjul Patney, Ankit B. Patel, Anima Anandkumar



[International Conference on Machine Learning (ICML) 2020](https://icml.cc/virtual/2020)









[Angular Visual Hardness](/publication/2020-07_angular-visual-hardness)

Beidi Chen, Weiyang Liu, [Zhiding Yu](/person/zhiding-yu), [Jan Kautz](/person/jan-kautz), Anshumali Shrivastava, Animesh Garg, Anima Anandkumar



[International Conference on Machine Learning (ICML) 2020](https://icml.cc/virtual/2020)









[Automated Synthetic-to-Real Generalization](/index.php/publication/2020-07_automated-synthetic-real-generalization)

Wuyang Chen, [Zhiding Yu](/index.php/person/zhiding-yu), Zhangyang Wang, Anima Anandkumar



[International Conference on Machine Learning (ICML) 2020](https://icml.cc/virtual/2020)









[NVAE: A Deep Hierarchical Variational Autoencoder](/publication/2020-07_nvae-deep-hierarchical-variational-autoencoder)

[Arash Vahdat](/person/arash-vahdat), [Jan Kautz](/person/jan-kautz)



[Neural Information Processing Systems (NeurIPS) 2020 (Spotlight)](https://arxiv.org/abs/2007.03898)









[Contrastive Learning for Weakly Supervised Phrase Grounding](/publication/2020-06_contrastive-learning-weakly-supervised-phrase-grounding)

Tanmay Gupta, [Arash Vahdat](/person/arash-vahdat), [Gal Chechik](/person/gal-chechik), Xiaodong Yang, [Jan Kautz](/person/jan-kautz), Derek Hoiem



[European Conference on Computer Vision (ECCV) 2020 (Spotlight)](https://arxiv.org/abs/2006.09920)









[Meshlet Priors for 3D Mesh Reconstruction](/publication/2020-06_meshlet-priors-3d-mesh-reconstruction)

Abhishek Badki, Orazio Gallo, [Jan Kautz](/person/jan-kautz), Pradeep Sen



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2020](http://cvpr2020.thecvf.com/)









[Instance-aware, Context-focused, and Memory-efficient Weakly Supervised Object Detection](/publication/2020-06_instance-aware-context-focused-and-memory-efficient-weakly-supervised-object)

Zhongzheng Ren, [Zhiding Yu](/person/zhiding-yu), Xiaodong Yang, [Ming-Yu Liu](/person/ming-yu-liu), Yong Jae Lee, Alexander G. Schwing, [Jan Kautz](/person/jan-kautz)



[IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020](http://cvpr2020.thecvf.com/)









[Regularizing Neural Networks via Minimizing Hyperspherical Energy](/publication/2020-06_regularizing-neural-networks-minimizing-hyperspherical-energy)

Weiyang Liu, Rongmei Lin, Zhen Liu, Chen Feng, [Zhiding Yu](/person/zhiding-yu), James M. Rehg, Li Xiong, Le Song



[IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020](http://cvpr2020.thecvf.com/)









[Bi3D: Stereo Depth Estimation via Binary Classifications](/publication/2020-06_bi3d-stereo-depth-estimation-binary-classifications)

Abhishek Badki, Alejandro Troccoli, Kihwan Kim, [Jan Kautz](/person/jan-kautz), Pradeep Sen, Orazio Gallo



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2020](http://cvpr2020.thecvf.com/)









[Self-Supervised Viewpoint Learning From Image Collections ](/publication/2020-06_self-supervised-viewpoint-learning-image-collections)

Siva Karthik Mustikovela, Varun Jampani, [Shalini De Mello](/person/shalini-de-mello), [Sifei Liu](/person/sifei-liu), [Umar Iqbal](/person/umar-iqbal), Carsten Rother, [Jan Kautz](/person/jan-kautz)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2020](http://cvpr2020.thecvf.com/)









[Dreaming to Distill: Data-free Knowledge Transfer via DeepInversion](/index.php/publication/2020-06_dreaming-distill-data-free-knowledge-transfer-deepinversion)

[Hongxu Danny Yin](/index.php/person/danny-yin), [Pavlo Molchanov](/index.php/person/pavlo-molchanov), Jose M. Alvarez, Zhizhong Li, Arun Mallya, Derek Hoiem, Niraj K. Jha, [Jan Kautz](/index.php/person/jan-kautz)



[IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020 (Ora…](https://openaccess.thecvf.com/content_CVPR_2020/papers/Yin_Dreaming_to_Distill_Data-Free_Knowledge_Transfer_via_DeepInversion_CVPR_2020_paper.pdf)









[Novel View Synthesis of Dynamic Scenes with Globally Coherent Depths](/publication/2020-06_novel-view-synthesis-dynamic-scenes-globally-coherent-depths)

Jae shin Yoon, Kihwan Kim, Orazio Gallo, Hyunsoo Park, [Jan Kautz](/person/jan-kautz)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2020](http://cvpr2020.thecvf.com/)









[Two-shot Spatially-varying BRDF and Shape Estimation](/publication/2020-06_two-shot-spatially-varying-brdf-and-shape-estimation)

Mark Boss, Varun Jampani, Kihwan Kim, Hendrik P.A. Lensch, [Jan Kautz](/person/jan-kautz)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2020](http://cvpr2020.thecvf.com/)









[MVLidarNet: Real-Time Multi-Class Scene Understanding for Autonomous Driving Using Multiple Views ](/publication/2020-06_mvlidarnet-real-time-multi-class-scene-understanding-autonomous-driving-using)

Ke Chen, Ryan Oldja, Nikolai Smolyanskiy, [Stan Birchfield](/person/stan-birchfield), Alexander Popov, David Wehr, Ibrahim Eden, Joachim Pehserl



IROS 2020









[Learning Canonical Representations for Scene Graph to Image Generation](/publication/2020-06_learning-canonical-representations-scene-graph-image-generation)

Roei Herzig, Amir Bar, Huijuan Xu, [Gal Chechik](/person/gal-chechik), Trevor Darrell, Amir Globerson



[ECCV 2020](https://arxiv.org/abs/1912.07414)









[6-DOF Grasping for Target-driven Object Manipulation in Clutter](/index.php/publication/2020-06_6-dof-grasping-target-driven-object-manipulation-clutter)

Adithyavairavan Murali, Arsalan Mousavian, Clemens Eppner, Chris Paxton, Dieter Fox



[ICRA 2020](https://www.icra2020.org)



Best Paper Finalist in Robot Manipulation, ICRA 2020, Best Student Paper Finalist, ICRA 2020





[Weakly-Supervised 3D Human Pose Learning via Multi-view Images in the Wild](/publication/2020-06_weakly-supervised-3d-human-pose-learning-multi-view-images-wild)

[Umar Iqbal](/person/umar-iqbal), [Pavlo Molchanov](/person/pavlo-molchanov), [Jan Kautz](/person/jan-kautz)



[IEEE Computer Vision and Pattern Recognition](http://cvpr2020.thecvf.com/)









[Self-supervised 6D Object Pose Estimation for Robot Manipulation](/publication/2020-05_self-supervised-6d-object-pose-estimation-robot-manipulation)

Xinke Deng, Yu Xiang, Arsalan Mousavian, Clemens Eppner, Timothy Bretl, Dieter Fox



[2020 IEEE International Conference on Robotics and Automation (ICRA)](https://ieeexplore.ieee.org/document/9196714)









[Camera-to-Robot Pose Estimation from a Single Image](/publication/2020-05_camera-robot-pose-estimation-single-image)

Timothy E. Lee, [Jonathan Tremblay](/person/jonathan-tremblay), Thang To, Jia Cheng, Terry Mosier, Oliver Kroemer, Dieter Fox, [Stan Birchfield](/person/stan-birchfield)



ICRA 2020









[Toward Sim-to-Real Directional Semantic Grasping](/publication/2020-05_toward-sim-real-directional-semantic-grasping)

Shariq Iqbal, [Jonathan Tremblay](/person/jonathan-tremblay), Thang To, Jia Cheng, Erik Leitch, Andy Campbell, Kirby Leung, Duncan McKay, [Stan Birchfield](/person/stan-birchfield)



ICRA 2020









[How to close sim-real gap? transfer with segmentation!](/publication/2020-05_how-close-sim-real-gap-transfer-segmentation)

Mengyuan Yan, Qingyun Sun, [Iuri Frosio](/person/iuri-frosio), [Stephen Tyree](/person/stephen-tyree), [Jan Kautz](/person/jan-kautz)



arxiv









[DexPilot: Vision Based Teleoperation of Dexterous Robotic Hand-Arm System](/index.php/publication/2020-05_dexpilot-vision-based-teleoperation-dexterous-robotic-hand-arm-system)

Ankur Handa, Karl Van Wyk, [Wei Yang](/index.php/person/wei-yang), Jacky Liang, [Yu-Wei Chao](/index.php/person/yu-wei-chao), Qian Wan, [Stan Birchfield](/index.php/person/stan-birchfield), Nathan Ratliff, Dieter Fox



[ICRA 2020](http://icra2020.org/)









[SymGAN: Orientation Estimation without Annotation for Symmetric Objects](/publication/2020-03_symgan-orientation-estimation-without-annotation-symmetric-objects)

Phil Ammirato, [Jonathan Tremblay](/person/jonathan-tremblay), [Ming-Yu Liu](/person/ming-yu-liu), Alexander Berg, Dieter Fox



[WACV](https://wacv20.wacv.net/)









[NRMVS: Non-Rigid Multi-view Stereo](/publication/2020-03_nrmvs-non-rigid-multi-view-stereo)

Matthias Innmann, Kihwan Kim, Jinwei Gu, Matthias Niessner , [Charles Loop](/person/charles-loop), Marc Stamminger, [Jan Kautz](/person/jan-kautz)



[IEEE Winter Conference on Applications of Computer Vision (WACV ’20)](https://wacv20.wacv.net/)









[Neurreg: Neural registration and its application to image segmentation](/publication/2020-03_neurreg-neural-registration-and-its-application-image-segmentation)

Wentao Zhu, [Andriy Myronenko](/person/andriy-myronenko), [Ziyue Xu](/person/ziyue-xu), [Wenqi Li](/person/wenqi-li), [Holger Roth](/person/holger-roth), Yufang Huang, Fausto Milletari, [Daguang Xu](/person/daguang-xu)



[WACV](http://openaccess.thecvf.com/content_WACV_2020/papers/Zhu_NeurReg_Neural_Registration_and_Its_Application_to_Image_Segmentation_WACV_2020_paper.pdf)









[Domain Stylization: A Fast Covariance Matching Framework towards Domain Adaptation](/publication/2020-01_domain-stylization-fast-covariance-matching-framework-towards-domain-adaptation)

Aysegul Dundar, [Ming-Yu Liu](/person/ming-yu-liu), [Zhiding Yu](/person/zhiding-yu), [Ting-Chun Wang](/person/ting-chun-wang), John Zedlewski, [Jan Kautz](/person/jan-kautz)



[IEEE Transactions on Pattern Analysis and Machine Intelligence](https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=34)









[Displacement-Invariant Cost Computation for Efficient Stereo Matching](/index.php/publication/2020-01_displacement-invariant-cost-computation-efficient-stereo-matching)

Yiran Zhong, [Charles Loop](/index.php/person/charles-loop), [Wonmin Byeon](/index.php/person/wonmin-byeon), [Stan Birchfield](/index.php/person/stan-birchfield), Yuchao Dai, Kaihao Zhang, Alexey Kamenev, [Thomas Breuel](/index.php/person/thomas-breuel), Hongdong Li, [Jan Kautz](/index.php/person/jan-kautz)



arXiv









[Improving Deep Stereo Network Generalization with Geometric Priors](/index.php/publication/2020-01_improving-deep-stereo-network-generalization-geometric-priors)

Jialiang Wang, Varun Jampani, Deqing Sun, [Charles Loop](/index.php/person/charles-loop), [Stan Birchfield](/index.php/person/stan-birchfield), [Jan Kautz](/index.php/person/jan-kautz)



arXiv









### 2019 

[Few-Shot Video-to-Video Synthesis](/publication/2019-12_few-shot-video-video-synthesis)

[Ting-Chun Wang](/person/ting-chun-wang), [Ming-Yu Liu](/person/ming-yu-liu), Andrew Tao, Guilin Liu, [Jan Kautz](/person/jan-kautz), Bryan Catanzaro



[NeurIPS](https://www.nips.cc/)









[Joint-task Self-supervised Learning for Temporal Correspondence ](/publication/2019-12_joint-task-self-supervised-learning-temporal-correspondence)

Xueting Li, [Sifei Liu](/person/sifei-liu), [Shalini De Mello](/person/shalini-de-mello), Xiaolong Wang, [Jan Kautz](/person/jan-kautz), Ming-Hsuan Yang



[Neural Information Processing Systems (NeurIPS) 2019](https://sites.google.com/view/uvc2019)









[Dance to Music](/publication/2019-12_dance-music)

Hsin-Ying Lee, [Xiaodong Yang](/person/xiaodong-yang), [Ming-Yu Liu](/person/ming-yu-liu), [Ting-Chun Wang](/person/ting-chun-wang), Yu-Ding Lu, Ming-Hsuan Yang, [Jan Kautz](/person/jan-kautz)



[NeurIPS](https://www.nips.cc)









[Joint Optimization for Cooperative Image Captioning](/publication/2019-11_joint-optimization-cooperative-image-captioning)

Gilad Vered, Gal Oren, [Yuval Atzmon](/person/yuval-atzmon), [Gal Chechik](/person/gal-chechik)



[International conference on computer vision (ICCV)](http://openaccess.thecvf.com/content_ICCV_2019/papers/Vered_Joint_Optimization_for_Cooperative_Image_Captioning_ICCV_2019_paper.)









[Content-Consistent Generation of Realistic Eyes with Style ](/index.php/publication/2019-11_content-consistent-generation-realistic-eyes-style)

Marcel Bühler , Seonwook Park, [Shalini De Mello](/index.php/person/shalini-de-mello), Xucong, Otmar Hilliges



International Conference on Computer Vision Workshop (ICCVW) 2019



Winner (1st place) Synthetic Eye Generation Challenge





[Few-Shot Adaptive Gaze Estimation ](/publication/2019-10_few-shot-adaptive-gaze-estimation)

Seonwook Park, [Shalini De Mello](/person/shalini-de-mello), [Pavlo Molchanov](/person/pavlo-molchanov), [Umar Iqbal](/person/umar-iqbal), Otmar Hilliges, [Jan Kautz](/person/jan-kautz)



[International Conference on Computer Vision (ICCV) 2019](http://iccv2019.thecvf.com/program/overview)



Oral





[Neural Inverse Rendering of an Indoor Scene from a Single Image](/publication/2019-10_neural-inverse-rendering-indoor-scene-single-image)

Soumyadip Sengupta, Jinwei Gu, Kihwan Kim, Guilin Liu, David W. Jacobs, [Jan Kautz](/person/jan-kautz)



[IEEE International Conference on Computer Vision (ICCV 2019)](http://iccv2019.thecvf.com/)









[PAMTRI: Pose-Aware Multi-Task Learning for Vehicle Re-Identification Using Highly Randomized Synthetic Data](/index.php/publication/2019-10_pamtri-pose-aware-multi-task-learning-vehicle-re-identification-using-highly)

Zheng Tang, Milind Naphade, [Stan Birchfield](/index.php/person/stan-birchfield), [Jonathan Tremblay](/index.php/person/jonathan-tremblay), William Hodge, Ratnesh Kumar, Shuo Wong, [Xiaodong Yang](/index.php/person/xiaodong-yang)



[ICCV 2019](http://iccv2019.thecvf.com/)









[Few-Shot Unsupervised Image-to-Image Translation](/index.php/publication/2019-10_few-shot-unsupervised-image-image-translation)

[Ming-Yu Liu](/index.php/person/ming-yu-liu), Xun Huang, Arun Mallya, [Tero Karras](/index.php/person/tero-karras), [Timo Aila](/index.php/person/timo-aila), [Jaakko Lehtinen](/index.php/person/jaakko-lehtinen), [Jan Kautz](/index.php/person/jan-kautz)



[ICCV](http://iccv2019.thecvf.com/)









[Meta-Sim: Learning to Generate Synthetic Datasets](/publication/2019-10_meta-sim-learning-generate-synthetic-datasets)

Amlan Kar, Aayush Prakash, [Ming-Yu Liu](/person/ming-yu-liu), Eric Cameracci, Justin Yuan, Matt Rusiniak, David Acuna, Antonio Torralba, Sanja Fidler



[ICCV](http://iccv2019.thecvf.com/)









[Confidence Regularized Self-Training](/publication/2019-10_confidence-regularized-self-training)

Yang Zou, [Zhiding Yu](/person/zhiding-yu), Xiaofeng Liu, B. V. K. Vijaya Kumar, Jinsong Wang



[IEEE/CVF International Conference on Computer Vision (ICCV) 2019 (Oral)](http://iccv2019.thecvf.com/)









[SENSE: A Shared Encoder Network for Scene-flow Estimation](/index.php/publication/2019-10_sense-shared-encoder-network-scene-flow-estimation)

Huaizu Jiang, Deqing Sun, Varun Jampani, Zhaoyang Lv, Erik Learned-Miller, [Jan Kautz](/index.php/person/jan-kautz)



[International Conference in Computer Vision](http://iccv2019.thecvf.com)









[Extreme View Synthesis](/publication/2019-10_extreme-view-synthesis)

Inchang Choi, Orazio Gallo, Alejandro Troccoli, Min H. Kim, [Jan Kautz](/person/jan-kautz)



[IEEE International Conference of Computer Vision](http://iccv2019.thecvf.com/)









[PointFlow: 3D Point Cloud Generation with Continuous Normalizing Flows](/publication/2019-10_pointflow-3d-point-cloud-generation-continuous-normalizing-flows)

Guandao Yang, Xun Huang, Zekun Hao, [Ming-Yu Liu](/person/ming-yu-liu), Serge Belongie, Bharath Hariharan



[ICCV](http://iccv2019.thecvf.com/)









[6-DOF GraspNet: Variational Grasp Generation for Object Manipulation](/publication/2019-10_6-dof-graspnet-variational-grasp-generation-object-manipulation)

Arsalan Mousavian, Clemens Eppner, Dieter Fox



[ICCV 2019](http://iccv2019.thecvf.com/)









[Neural Turtle Graphics for Modeling City Road Layouts](/publication/2019-10_neural-turtle-graphics-modeling-city-road-layouts)

Hang Chu, Daiqing Li, David Acuna, Amlan Kar, Maria Shugrina, Xinkai Wei, [Ming-Yu Liu](/person/ming-yu-liu), Antonio Torralba, Sanja Fidler



[ICCV](http://iccv2019.thecvf.com/)









[Learning Propagation for Arbitrarily-Structured Data](/index.php/publication/2019-09_learning-propagation-arbitrarily-structured-data)

[Sifei Liu](/index.php/person/sifei-liu), Xueting Li, Varun Jampani, [Shalini De Mello](/index.php/person/shalini-de-mello), [Jan Kautz](/index.php/person/jan-kautz)



[International Conference on Computer Vision (ICCV) 2019](http://openaccess.thecvf.com/content_ICCV_2019/html/Liu_Learning_Propagation_for_Arbitrarily-Structured_Data_ICCV_2019_paper.htm)









[Video Stitching for Linear Camera Arrays](/publication/2019-09_video-stitching-linear-camera-arrays)

Wei-Sheng Lai, Orazio Gallo, [Jinwei Gu](/person/jinwei-gu), Deqing Sun, Ming-Hsuan Yang, [Jan Kautz](/person/jan-kautz)



[British Machine Vision Conference](https://bmvc2019.org/)









[Few-Shot Viewpoint Estimation](/index.php/publication/2019-09_few-shot-viewpoint-estimation)

Hung-Yu Tseng, [Shalini De Mello](/index.php/person/shalini-de-mello), [Jonathan Tremblay](/index.php/person/jonathan-tremblay), [Sifei Liu](/index.php/person/sifei-liu), [Stan Birchfield](/index.php/person/stan-birchfield), Ming-Hsuan Yang, [Jan Kautz](/index.php/person/jan-kautz)



[British Machine Vision Conference (BMVC) 2019](https://bmvc2019.org/programme/)









[Pixel-Adaptive Convolutional Neural Networks](/publication/2019-06_pixel-adaptive-convolutional-neural-networks)

Hang Su, Varun Jampani, Deqing Sun, Orazio Gallo, Erik-Learned Miller, [Jan Kautz](/person/jan-kautz)



[Computer Vision and Pattern Recognition (CVPR), 2019](http://cvpr2019.thecvf.com)









[CityFlow: A City-Scale Benchmark for Multi-Target Multi-Camera Vehicle Tracking and Re-Identification](/index.php/publication/2019-06_cityflow-city-scale-benchmark-multi-target-multi-camera-vehicle-tracking-and-re)

Zheng Tang, Milind Naphade, [Ming-Yu Liu](/index.php/person/ming-yu-liu), [Xiaodong Yang](/index.php/person/xiaodong-yang), [Stan Birchfield](/index.php/person/stan-birchfield), Shuo Wang, Ratnesh Kumar, David Anastasiu, Jenq-Neng Hwan



[CVPR 2019](http://cvpr2019.thecvf.com/)









[SCOPS: Self-Supervised Co-Part Segmentation](/publication/2019-06_scops-self-supervised-co-part-segmentation)

Wei-Chih Hung, Varun Jampani, [Sifei Liu](/person/sifei-liu), [Pavlo Molchanov](/person/pavlo-molchanov), Ming-Hsuan Yang , [Jan Kautz](/person/jan-kautz)



[CVPR 2019](https://varunjampani.github.io/scops/)









[SIDOD: A Synthetic Image Dataset for 3D Object Pose Recognition with Distractors](/index.php/publication/2019-06_sidod-synthetic-image-dataset-3d-object-pose-recognition-distractors)

Mona Jalal, [Josef Spjut](/index.php/person/josef-spjut), [Ben Boudaoud](/index.php/person/ben-boudaoud), Margrit Betke



[WiCV](https://wicvworkshop.github.io/CVPR2019/index.html)









[Semantic Image Synthesis with Spatially-Adaptive Normalization](/publication/2019-06_semantic-image-synthesis-spatially-adaptive-normalization)

Taesung Park, [Ming-Yu Liu](/person/ming-yu-liu), [Ting-Chun Wang](/person/ting-chun-wang), Jun-Yan Zhu



[CVPR](http://cvpr2019.thecvf.com/)









[Neural RGB-&gt;D Sensing: Depth and Uncertainty from a Video Camera](/publication/2019-06_neural-rgb-d-sensing-depth-and-uncertainty-video-camera)

Chao Liu, Jinwei Gu, Kihwan Kim, Srinivasa Narasimhan, [Jan Kautz](/person/jan-kautz)



[IEEE CVPR 2019 (Oral)](http://cvpr2019.thecvf.com/)









[Joint Discriminative and Generative Learning for Person Re-identification](/publication/2019-06_joint-discriminative-and-generative-learning-person-re-identification)

Zhedong Zheng, [Xiaodong Yang](/person/xiaodong-yang), [Zhiding Yu](/person/zhiding-yu), Liang Zheng, Yi Yang, [Jan Kautz](/person/jan-kautz)



[IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019](http://cvpr2019.thecvf.com/)









[Putting Humans in a Scene: Learning Affordance in 3D Indoor Environments](/publication/2019-06_putting-humans-scene-learning-affordance-3d-indoor-environments)

Xueting Li, [Sifei Liu](/person/sifei-liu), Kihwan Kim, Xiaolong Wang, Ming-Hsuan Yang, [Jan Kautz](/person/jan-kautz)



[CVPR 2019](https://sites.google.com/view/3d-affordance-cvpr19)









[STEP: Spatio-Temporal Progressive Learning for Video Action Detection](/publication/2019-06_step-spatio-temporal-progressive-learning-video-action-detection)

Xitong Yang, [Xiaodong Yang](/person/xiaodong-yang), [Ming-Yu Liu](/person/ming-yu-liu), Fanyi Xiao, Larry Davis, [Jan Kautz](/person/jan-kautz)



[CVPR](http://cvpr2019.thecvf.com/)









[PlaneRCNN: 3D Plane Detection and Reconstruction from a Single Image](/publication/2019-06_planercnn-3d-plane-detection-and-reconstruction-single-image)

Chen Liu, Kihwan Kim, Jinwei Gu, Yasutaka Furukawa, [Jan Kautz](/person/jan-kautz)



[IEEE CVPR 2019 (Oral)](http://cvpr2019.thecvf.com/)









[Competitive Collaboration: Joint Unsupervised Learning of Depth, Camera Motion, Optical Flow and Motion Segmentation](/publication/2019-06_competitive-collaboration-joint-unsupervised-learning-depth-camera-motion)

Anurag Ranjan, Varun Jampani, Lukas Balles, Kihwan Kim, Deqing Sun, Jonas Wulff, Michael J. Black



[IEEE CVPR 2019](http://cvpr2019.thecvf.com/)









[Structured Domain Randomization: Bridging the Reality Gap by Context-Aware Synthetic Data](/index.php/publication/2019-04_structured-domain-randomization-bridging-reality-gap-context-aware-synthetic)

Aayush Prakash, Shaad Boochoon, Mark Brophy, David Acuna, Eric Cameracci, Gavriel State, [Omer Shapira](/index.php/person/omer-shapira), [Stan Birchfield](/index.php/person/stan-birchfield)



[ICRA 2019](https://ras.papercept.net/conferences/conferences/ICRA19/program/ICRA19_ContentListWeb_1.html)









[Learning Linear Transformations for Fast Image and Video Style Transfer](/publication/2019-04_learning-linear-transformations-fast-image-and-video-style-transfer)

Xueting Li, [Sifei Liu](/person/sifei-liu), Ming-Hsuan Yang, [Jan Kautz](/person/jan-kautz)



[CVPR 2019](https://sites.google.com/view/linear-style-transfer-cvpr19)









[Informative Object Annotations: Tell Me Something I Don't Know](/publication/2019-03_informative-object-annotations-tell-me-something-i-dont-know)

Lior Bracha, [Gal Chechik](/person/gal-chechik)



Computer Vision and Pattern Recognition









[Adaptive Confidence Smoothing for Generalized Zero-Shot Learning](/index.php/publication/2019-03_adaptive-confidence-smoothing-generalized-zero-shot-learning)

[Yuval Atzmon](/index.php/person/yuval-atzmon), [Gal Chechik](/index.php/person/gal-chechik)



Computer Vision and Pattern Recognition (CVPR) 2019









[Unsupervised Stylish Image Description Generation via Domain Layer Norm](/publication/2019-02_unsupervised-stylish-image-description-generation-domain-layer-norm)

Cheng-Kuan Chen, Zhu-Feng Pan, [Ming-Yu Liu](/person/ming-yu-liu), Min Sun



[AAAI](https://aaai.org/Conferences/AAAI-19/)









[Models Matter, So Does Training: An Empirical Study of CNNs for Optical Flow Estimation](/publication/2019-01_models-matter-so-does-training-empirical-study-cnns-optical-flow-estimation)

Deqing Sun, [Xiaodong Yang](/person/xiaodong-yang), [Ming-Yu Liu](/person/ming-yu-liu), [Jan Kautz](/person/jan-kautz)



[TPAMI](https://ieeexplore.ieee.org/document/8621052)









[A Fusion Approach for Multi-Frame Optical Flow Estimation](/publication/2019-01_fusion-approach-multi-frame-optical-flow-estimation)

Zhile Ren, Orazio Gallo, Deqing Sun, Ming-Hsuan Yang, Erik B. Sudderth, [Jan Kautz](/person/jan-kautz)



[IEEE Winter conference of Applications of Computer Vision (WACV)](http://wacv19.wacv.net/)









### 2018 

[Localization-Aware Active Learning for Object Detection](/index.php/publication/2018-12_localization-aware-active-learning-object-detection)

Chieh-Chi Kao, Teng-Yok Lee, Pradeep Sen, [Ming-Yu Liu](/index.php/person/ming-yu-liu)



[ACCV](https://arxiv.org/pdf/1801.05124.pdf)









[Context-aware Synthesis and Placement of Object Instances](/index.php/publication/2018-12_context-aware-synthesis-and-placement-object-instances)

Donghoon Lee, [Sifei Liu](/index.php/person/sifei-liu), Jinwei Gu, [Ming-Yu Liu](/index.php/person/ming-yu-liu), Ming-Hsuan Yang, [Jan Kautz](/index.php/person/jan-kautz)



NIPS









[Video-to-Video Synthesis](/publication/2018-12_video-video-synthesis)

[Ting-Chun Wang](/person/ting-chun-wang), [Ming-Yu Liu](/person/ming-yu-liu), Jun-Yan Zhu, Guilin Liu, Andrew Tao, [Jan Kautz](/person/jan-kautz), Bryan Catanzaro



[NIPS](https://arxiv.org/abs/1808.06601)









[Context-aware Synthesis and Placement of Object Instances](/index.php/publication/2018-12_context-aware-synthesis-and-placement-object-instances)

Donghoon Lee, [Sifei Liu](/index.php/person/sifei-liu), Jinwei Gu, [Ming-Yu Liu](/index.php/person/ming-yu-liu), Ming-Hsuan Yang, [Jan Kautz](/index.php/person/jan-kautz)



NIPS









[Mapping Images to Scene Graphs with Permutation-Invariant Structured Prediction](/publication/2018-12_mapping-images-scene-graphs-permutation-invariant-structured-prediction)

Roei Herzig, Moshiko Raboh, [Gal Chechik](/person/gal-chechik), Jonathan Berant, Amir Globerson



[Neural Information processing systems (NeurIPS)](http://papers.nips.cc/paper/7951-mapping-images-to-scene-graphs-with-permutation-invariant-structured-prediction)









[Learning towards Minimum Hyperspherical Energy](/publication/2018-12_learning-towards-minimum-hyperspherical-energy)

Weiyang Liu, Rongmei Lin, Zhen Liu, Lixin Liu, [Zhiding Yu](/person/zhiding-yu), Bo Dai, Le Song



[Conference on Neural Information Processing Systems (NeurIPS) 2018](https://nips.cc/Conferences/2018)









[Deep Object Pose Estimation for Semantic Robotic Grasping of Household Objects](/publication/2018-09_deep-object-pose-estimation-semantic-robotic-grasping-household-objects)

[Jonathan Tremblay](/person/jonathan-tremblay), Thang To, Bala Sundaralingam, Yu Xiang, Dieter Fox, [Stan Birchfield](/person/stan-birchfield)



[Conference on Robot Learning (CoRL) 2018](http://www.robot-learning.org/)









[Hand Pose Estimation via Latent 2.5 D Heatmap Regression](/publication/2018-09_hand-pose-estimation-latent-25-d-heatmap-regression)

[Umar Iqbal](/person/umar-iqbal), [Pavlo Molchanov](/person/pavlo-molchanov), [Thomas Breuel](/person/thomas-breuel), Juergen Gall, [Jan Kautz](/person/jan-kautz)



ECCV2018









[Separating Reflection and Transmission Images in the Wild](/index.php/publication/2018-09_separating-reflection-and-transmission-images-wild)

Patrick Wieschollek, Orazio Gallo, [Jinwei Gu](/index.php/person/jinwei-gu), [Jan Kautz](/index.php/person/jan-kautz)



European Conference of Computer Vision (ECCV)









[Tackling 3D ToF Artifacts Through Learning and the FLAT Dataset](/publication/2018-09_tackling-3d-tof-artifacts-through-learning-and-flat-dataset)

Qi Guo, [Iuri Frosio](/person/iuri-frosio), Orazio Gallo, Todd Zickler, [Jan Kautz](/person/jan-kautz)



[ECCV 2018](https://eccv2018.org/)









[Learning Rigidity in Dynamic Scenes with a Moving Camera for 3D Motion Field Estimation](/publication/2018-09_learning-rigidity-dynamic-scenes-moving-camera-3d-motion-field-estimation)

Zhaoyang Lv, Kihwan Kim, Alejandro Troccoli, Deqing Sun, James M. Rehg, [Jan Kautz](/person/jan-kautz)



[European Conference on Computer Vision (ECCV 2018)](https://eccv2018.org/)









[Simultaneous Edge Alignment and Learning](/index.php/publication/2018-09_simultaneous-edge-alignment-and-learning)

[Zhiding Yu](/index.php/person/zhiding-yu), Weiyang Liu, Yang Zou, Chen Feng, Srikumar Ramalingam, B. V. K. Vijaya Kumar, [Jan Kautz](/index.php/person/jan-kautz)



[European Conference on Computer Vision (ECCV) 2018](https://eccv2018.org/)









[Switchable Temporal Propagation Network ](/publication/2018-09_switchable-temporal-propagation-network)

[Sifei Liu](/person/sifei-liu), Guangyu Zhong, [Shalini De Mello](/person/shalini-de-mello), [Jinwei Gu](/person/jinwei-gu), Varun Jampani



[European Conference on Computer Vision (ECCV) 2018](http://faculty.ucmerced.edu/mhyang/papers/eccv2018_stpn.pdf)









[Image Inpainting for Irregular Holes Using Partial Convolutions](/publication/2018-09_image-inpainting-irregular-holes-using-partial-convolutions)

Guilin Liu, Fitsum A. Reda, Kevin Shih, [Ting-Chun Wang](/person/ting-chun-wang), Andrew Tao, Bryan Catanzaro



[ECCV](http://openaccess.thecvf.com/ECCV2018.py)









[Domain Adaptation for Semantic Segmentation via Class-Balanced Self-Training](/publication/2018-09_domain-adaptation-semantic-segmentation-class-balanced-self-training)

Yang Zou, [Zhiding Yu](/person/zhiding-yu), B. V. K. Vijaya Kumar, Jinsong Wang



[European Conference on Computer Vision (ECCV) 2018](https://eccv2018.org/)









[HGMR: Hierarchical Gaussian Mixtures for Adaptive 3D Registration](/index.php/publication/2018-09_hgmr-hierarchical-gaussian-mixtures-adaptive-3d-registration)

Ben Eckart, Kihwan Kim, [Jan Kautz](/index.php/person/jan-kautz)



[European Conference on Computer Vision (ECCV 2018)](https://eccv2018.org/)









[A Closed-form Solution to Photorealistic Image Stylization](/publication/2018-09_closed-form-solution-photorealistic-image-stylization)

Yijun Li, [Ming-Yu Liu](/person/ming-yu-liu), Xueting Li, Ming-Hsuan Yang, [Jan Kautz](/person/jan-kautz)



[ECCV](http://arxiv.org/abs/1802.06474)









[Multimodal Unsupervised Image-to-Image Translation](/publication/2018-09_multimodal-unsupervised-image-image-translation)

Xun Huang, [Ming-Yu Liu](/person/ming-yu-liu), Serge Belongie, [Jan Kautz](/person/jan-kautz)



[ECCV](https://arxiv.org/abs/1804.04732)









[EOE: Expected Overlap Estimation over Unstructured Point Cloud Data](/index.php/publication/2018-09_eoe-expected-overlap-estimation-over-unstructured-point-cloud-data)

Ben Eckart, Kihwan Kim, [Jan Kautz](/index.php/person/jan-kautz)



[International Conference on 3D Vision (3DV) 2018](http://3dv18.uniud.it/)









[3D MRI Brain Tumor Segmentation Using Autoencoder Regularization](/index.php/publication/2018-09_3d-mri-brain-tumor-segmentation-using-autoencoder-regularization)

[Andriy Myronenko](/index.php/person/andriy-myronenko)



[MICCAI, BrainLes, 2018](https://link.springer.com/chapter/10.1007/978-3-030-11726-9_28)









[Superpixel Sampling Networks](/publication/2018-09_superpixel-sampling-networks)

Varun Jampani, Deqing Sun, [Ming-Yu Liu](/person/ming-yu-liu), Ming-Hsuan Yang, [Jan Kautz](/person/jan-kautz)



[European Conference on Computer Vision (ECCV), 2018](http://eccv2018.org)









[Noise2Noise: Learning Image Restoration without Clean Data](/publication/2018-07_noise2noise-learning-image-restoration-without-clean-data)

Jaakko Lehtinen, [Jacob Munkberg](/person/jacob-munkberg), [Jon Hasselgren](/person/jon-hasselgren), [Samuli Laine](/person/samuli-laine), [Tero Karras](/person/tero-karras), Miika Aittala, [Timo Aila](/person/timo-aila)



[Proc. ICML 2018](https://icml.cc/)









[Light-weight Head Pose Invariant Gaze Tracking ](/publication/2018-06_light-weight-head-pose-invariant-gaze-tracking)

Rajeev Ranjan, [Shalini De Mello](/person/shalini-de-mello), [Jan Kautz](/person/jan-kautz)



[IEEE Computer Vision and Pattern Recognition Workshop (CVPRW) 2018](http://cvpr2018.thecvf.com/program/workshops)



Best Paper (runner up) Workshop on Analysis and Modeling of Faces and Gestures





[Learning Superpixels with Segmentation-Aware Affinity Losse](/index.php/publication/2018-06_learning-superpixels-segmentation-aware-affinity-losse)

Wei-Chih Tu, [Ming-Yu Liu](/index.php/person/ming-yu-liu), Varun Jampani, Deqing Sun, Shao-Yi Chien, Ming-Hsuan Yang, [Jan Kautz](/index.php/person/jan-kautz)



[CVPR](https://ieeexplore.ieee.org/document/8578164)









[MoCoGAN: Decomposing Motion and Content for Video Generation](/publication/2018-06_mocogan-decomposing-motion-and-content-video-generation)

Sergey Tulyakov, [Ming-Yu Liu](/person/ming-yu-liu), [Xiaodong Yang](/person/xiaodong-yang), [Jan Kautz](/person/jan-kautz)



[CVPR](https://arxiv.org/abs/1707.04993)









[Super SloMo: High Quality Estimation of Multiple Intermediate Frames for Video Interpolation](/publication/2018-06_super-slomo-high-quality-estimation-multiple-intermediate-frames-video)

Huaizu Jiang, Deqing Sun, Varun Jampani, Ming-Hsuan Yang, Erik Learned-Miller, [Jan Kautz](/person/jan-kautz)



[CVPR 2018](http://cvpr2018.thecvf.com/)









[Depth-Based 3D Hand Pose Estimation: From Current Achievements to Future Goals](/publication/2018-06_depth-based-3d-hand-pose-estimation-current-achievements-future-goals)

Shanxin Yuan, Guillermo Garcia-Hernando, Bjorn Stenger, [Pavlo Molchanov](/person/pavlo-molchanov), [Jan Kautz](/person/jan-kautz), Sina Honari



[Conference on Computer Vision and Pattern Recognition](http://cvpr2018.thecvf.com)









[Improving Landmark Localization with Semi-Supervised Learning](/publication/2018-06_improving-landmark-localization-semi-supervised-learning)

Sina Honari, [Pavlo Molchanov](/person/pavlo-molchanov), [Stephen Tyree](/person/stephen-tyree), Pascal Vincent, Christopher Pal, [Jan Kautz](/person/jan-kautz)



[CVPR](http://cvpr2018.thecvf.com)









[Geometry-Aware Learning of Maps for Camera Localization](/publication/2018-06_geometry-aware-learning-maps-camera-localization)

Samarth Brahmbhatt, [Jinwei Gu](/person/jinwei-gu), Kihwan Kim, James Hays, [Jan Kautz](/person/jan-kautz)



[CVPR 2018 (Spotlight)](http://cvpr2018.thecvf.com/)









[Falling Things: A Synthetic Dataset for 3D Object Detection and Pose Estimation](/index.php/publication/2018-06_falling-things-synthetic-dataset-3d-object-detection-and-pose-estimation)

[Jonathan Tremblay](/index.php/person/jonathan-tremblay), Thang To, [Stan Birchfield](/index.php/person/stan-birchfield)



[CVPR 2018 Workshop on Real World Challenges and New Benchmarks for Deep Learnin…](https://sites.google.com/view/cvpr2018-robotic-vision)









[Decoupled Networks](/publication/2018-06_decoupled-networks)

Weiyang Liu, Zhen Liu, [Zhiding Yu](/person/zhiding-yu), Bo Dai, Rongmei Lin, Yisen Wang, James M. Rehg, Le Song



[IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2018](http://cvpr2018.thecvf.com/)









[Deep Semantic Face Deblurring](/publication/2018-06_deep-semantic-face-deblurring)

Ziyi Shen, Wei-Sheng Lai, Tingfa Xu, [Jan Kautz](/person/jan-kautz), Ming-Hsuan Yang



[IEEE Computer Vision and Pattern Recognition (CVPR)](http://cvpr2018.thecvf.com)









[High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs](/publication/2018-06_high-resolution-image-synthesis-and-semantic-manipulation-conditional-gans)

[Ting-Chun Wang](/person/ting-chun-wang), [Ming-Yu Liu](/person/ming-yu-liu), Jun-Yan Zhu, Andrew Tao, [Jan Kautz](/person/jan-kautz), Bryan Catanzaro



[CVPR](https://arxiv.org/abs/1711.11585)









[PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume](/publication/2018-06_pwc-net-cnns-optical-flow-using-pyramid-warping-and-cost-volume)

Deqing Sun, [Xiaodong Yang](/person/xiaodong-yang), [Ming-Yu Liu](/person/ming-yu-liu), [Jan Kautz](/person/jan-kautz)



[CVPR](http://cvpr2018.thecvf.com/)









[SPLATNet: Sparse Lattice Networks for Point Cloud Processing](/publication/2018-06_splatnet-sparse-lattice-networks-point-cloud-processing)

Hang Su, Varun Jampani, Deqing Sun, Subhransu Maji, Evangelos Kalogerakis, Ming-Hsuan Yang, [Jan Kautz](/person/jan-kautz)



[CVPR 2018 (oral)](http://cvpr2018.thecvf.com/)









[Learning Strict Identity Mappings in Deep Residual Networks](/publication/2018-06_learning-strict-identity-mappings-deep-residual-networks)

Xin Yu, [Zhiding Yu](/person/zhiding-yu), Srikumar Ramalingam



[IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2018](http://cvpr2018.thecvf.com/)









[Making Convolutional Networks Recurrent for Visual Sequence Learning](/publication/2018-06_making-convolutional-networks-recurrent-visual-sequence-learning)

[Xiaodong Yang](/person/xiaodong-yang), [Pavlo Molchanov](/person/pavlo-molchanov), [Jan Kautz](/person/jan-kautz)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR)](http://xiaodongyang.org/publications/papers/prernn-cvpr18.pdf)









[PoseCNN: A Convolutional Neural Network for 6D Object Pose Estimation in Cluttered Scenes](/index.php/publication/2018-06_posecnn-convolutional-neural-network-6d-object-pose-estimation-cluttered-scenes)

Yu Xiang, Tanner Schmidt, Venkatraman Narayanan, Dieter Fox



[Robotics: Science and Systems (RSS)](http://www.roboticsconference.org/)









[Synthetically Trained Neural Networks for Learning Human-Readable Plans from Real-World Demonstrations ](/index.php/publication/2018-05_synthetically-trained-neural-networks-learning-human-readable-plans-real-world)

[Jonathan Tremblay](/index.php/person/jonathan-tremblay), Thang To, Artem Molchanov, [Stephen Tyree](/index.php/person/stephen-tyree), [Jan Kautz](/index.php/person/jan-kautz), [Stan Birchfield](/index.php/person/stan-birchfield)



[IEEE International Conference on Robotics and Automation (ICRA) 2018](https://icra2018.org/)









[Probabilistic AND-OR Attribute Grouping for Zero-Shot Learning](/publication/2018-05_probabilistic-and-or-attribute-grouping-zero-shot-learning)

[Yuval Atzmon](/person/yuval-atzmon), [Gal Chechik](/person/gal-chechik)



[The conference on uncertainty in artificial intelligence (UAI 2018)](http://auai.org/uai2018/)









[Reblur2Deblur: Deblurring Videos via Self-Supervised Learning](/publication/2018-05_reblur2deblur-deblurring-videos-self-supervised-learning)

Huaijin Chen, [Jinwei Gu](/person/jinwei-gu), Orazio Gallo, [Ming-Yu Liu](/person/ming-yu-liu), Ashok Veeraraghavan, [Jan Kautz](/person/jan-kautz)



[IEEE International Conference on Computational Photography (ICCP)](http://iccp2018.ece.cmu.edu/)









[Training Deep Networks with Synthetic Data: Bridging the Reality Gap by Domain Randomization](/index.php/publication/2018-04_training-deep-networks-synthetic-data-bridging-reality-gap-domain-randomization)

[Jonathan Tremblay](/index.php/person/jonathan-tremblay), Aayush Prakash, David Acuna, Mark Brophy, Varun Jampani, Cem Anil, Thang To, Eric Cameracci, Shaad Boochoon, [Stan Birchfield](/index.php/person/stan-birchfield)



[CVPR 2018 Workshop on Autonomous Driving](http://www.wad.ai/)









[IamNN: Iterative and Adaptive Mobile Neural Network for Efficient Image Classification](/publication/2018-04_iamnn-iterative-and-adaptive-mobile-neural-network-efficient-image)

Sam Leroux, [Pavlo Molchanov](/person/pavlo-molchanov), Pieter Simoens, Bart Dhoedt, [Thomas Breuel](/person/thomas-breuel), [Jan Kautz](/person/jan-kautz)



[International Conference on Learning Representations, Workshop](https://iclr.cc)









[On the Importance of Stereo for Accurate Depth Estimation: An Efficient Semi-Supervised Deep Neural Network Approach ](/publication/2018-04_importance-stereo-accurate-depth-estimation-efficient-semi-supervised-deep)

Nikolai Smolyanskiy, Alexey Kamenev, [Stan Birchfield](/person/stan-birchfield)



[CVPR 2018 Workshop on Autonomous Driving](http://www.wad.ai)









### 2017 

[On Nearest Neighbors in Non Local Means Denoising](/publication/2017-12_nearest-neighbors-non-local-means-denoising)

[Iuri Frosio](/person/iuri-frosio), [Jan Kautz](/person/jan-kautz)



[Neural Information Processing Systems (NIPS) 2017 Workshop on Nearest Neighbors…](https://nn2017.mit.edu/)









[Sim-to-Real Transfer of Accurate Grasping with Eye-In-Hand Observations and Continuous Control](/index.php/publication/2017-12_sim-real-transfer-accurate-grasping-eye-hand-observations-and-continuous)

Mengyuan Yan, [Iuri Frosio](/index.php/person/iuri-frosio), [Stephen Tyree](/index.php/person/stephen-tyree), [Jan Kautz](/index.php/person/jan-kautz)



[NIPS 2017 Workshop on Acting and Interacting in the Real World: Challenges in …](https://sites.google.com/view/nips17robotlearning/home)









[Unsupervised Image-to-Image Translation Networks](/publication/2017-12_unsupervised-image-image-translation-networks)

[Ming-Yu Liu](/person/ming-yu-liu), [Thomas Breuel](/person/thomas-breuel), [Jan Kautz](/person/jan-kautz)



[NIPS](https://nips.cc/)









[Learning Affinity via Spatial Propagation Networks ](/publication/2017-12_learning-affinity-spatial-propagation-networks)

[Sifei Liu](/person/sifei-liu), [Shalini De Mello](/person/shalini-de-mello), [Jinwei Gu](/person/jinwei-gu), Guangyu Zhong, Ming-Hsuan Yang, [Jan Kautz](/person/jan-kautz)



[Conference on Neural Information Processing Systems (NIPS) 2017](https://nips.cc/)









[Learning to Super-Resolve Blurry Face and Text Images](/publication/2017-10_learning-super-resolve-blurry-face-and-text-images)

Xiangyu Xu, Deqing Sun, Jinshan Pan, Yujin Zhang, Hanspeter Pfister, Ming-Hsuan Yang



International Conference on Computer Vision









[A Lightweight Approach for On-the-Fly Reflectance Estimation](/publication/2017-10_lightweight-approach-fly-reflectance-estimation)

Kihwan Kim, [Jinwei Gu](/person/jinwei-gu), [Stephen Tyree](/person/stephen-tyree), [Pavlo Molchanov](/person/pavlo-molchanov), Matthias Nießner, [Jan Kautz](/person/jan-kautz)



[IEEE International Conference on Computer Vision (ICCV 2017)](http://iccv2017.thecvf.com/)









[Semantic Video CNNs through Representation Warping](/index.php/publication/2017-10_semantic-video-cnns-through-representation-warping)

Raghudeep Gadde, Varun Jampani, Peter V. Gehler



[International Conference on Computer Vision (ICCV'17)](http://iccv2017.thecvf.com)









[Intrinsic3D: High-Quality 3D Reconstruction by Joint Appearance and Geometry Optimization with Spatially-Varying Lighting](/publication/2017-10_intrinsic3d-high-quality-3d-reconstruction-joint-appearance-and-geometry)

Robert Maier, Kihwan Kim, Daniel Cremers, [Jan Kautz](/person/jan-kautz), Matthias Nießner



[IEEE International Conference on Computer Vision (ICCV 2017)](http://iccv2017.thecvf.com/)









[Multiframe Scene Flow with Piecewise Rigid Motion](/index.php/publication/2017-10_multiframe-scene-flow-piecewise-rigid-motion)

Vladislav Golyanik, Kihwan Kim, Robert Maier, Matthias Nießner, Didier Stricker, [Jan Kautz](/index.php/person/jan-kautz)



[IEEE International Conference on 3D Vision (3DV 2017)](http://www.3dv.org)









[Cascaded Scene Flow Prediction using Semantic Segmentation](/index.php/publication/2017-10_cascaded-scene-flow-prediction-using-semantic-segmentation)

Zhile Ren, Deqing Sun, [Jan Kautz](/index.php/person/jan-kautz), Erik B. Sudderth



International Conference on 3D Vision









[Toward Low-Flying Autonomous MAV Trail Navigation using Deep Neural Networks for Environmental Awareness](/index.php/publication/2017-09_toward-low-flying-autonomous-mav-trail-navigation-using-deep-neural-networks)

Nikolai Smolyanskiy, Alexey Kamenev, Jeffrey Smith, [Stan Birchfield](/index.php/person/stan-birchfield)



[IROS 2017](https://www.iros2017.org/)









[Dynamic Facial Analysis: From Bayesian Filtering to Recurrent Neural Network](/index.php/publication/2017-07_dynamic-facial-analysis-bayesian-filtering-recurrent-neural-network)

[Jinwei Gu](/index.php/person/jinwei-gu), [Xiaodong Yang](/index.php/person/xiaodong-yang), [Shalini De Mello](/index.php/person/shalini-de-mello), [Jan Kautz](/index.php/person/jan-kautz)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017](http://cvpr2017.thecvf.com/)









[Deep 360 Pilot: Learning a Deep Agent for Piloting through 360 Sports Videos](/index.php/publication/2017-07_deep-360-pilot-learning-deep-agent-piloting-through-360-sports-videos)

Hou-Ning Hu, Yen-Chen Lin, [Ming-Yu Liu](/index.php/person/ming-yu-liu), Hsien-Tzu Cheng, Yung-Ju Chang, Min Sun



[CVPR](http://cvpr2017.thecvf.com/)









[Production-Level Facial Performance Capture Using Deep Convolutional Neural Networks](/index.php/publication/2017-07_production-level-facial-performance-capture-using-deep-convolutional-neural)

[Samuli Laine](/index.php/person/samuli-laine), [Tero Karras](/index.php/person/tero-karras), [Timo Aila](/index.php/person/timo-aila), Antti Herva, Shunsuke Saito, Ronald Yu, Hao Li, Jaakko Lehtinen



[Symposium on Computer Animation 2017](http://sca17.cs.columbia.edu/index.html)









[Polarimetric Multi-view Stereo](/publication/2017-07_polarimetric-multi-view-stereo)

Zhaopeng Cui, [Jinwei Gu](/person/jinwei-gu), Boxin Shi, Ping Tan, [Jan Kautz](/person/jan-kautz)



[IEEE CVPR 2017](http://cvpr2017.thecvf.com/)









[Reconstructing Intensity Images from Binary Spatial Gradient Cameras](/index.php/publication/2017-07_reconstructing-intensity-images-binary-spatial-gradient-cameras)

Suren Jayasuriya, Orazio Gallo, [Jinwei Gu](/index.php/person/jinwei-gu), [Timo Aila](/index.php/person/timo-aila), [Jan Kautz](/index.php/person/jan-kautz)



[IEEE Workshop on Embedded Vision (CVPR)](http://cvisioncentral.com/promotion/evw2017/)









[Computational Zoom: A Framework for Post-Capture Image Composition](/index.php/publication/2017-07_computational-zoom-framework-post-capture-image-composition)

Abhishek Badki, Orazio Gallo, [Jan Kautz](/index.php/person/jan-kautz), Pradeep Sen



[ACM SIGGRAPH](http://dl.acm.org/citation.cfm?id=J778&CFID=934261544&CFTOKEN=17503555)









[Context-aware Captions from Context-agnostic Supervision](/publication/2017-04_context-aware-captions-context-agnostic-supervision)

Ramakrishna Vedantam, Samy Bengio, Kevin Murphy, Devi Parikh, [Gal Chechik](/person/gal-chechik)



[Computer Vision and Pattern Recognition](https://arxiv.org/abs/1701.02870)









[Learning From Noisy Large-Scale Datasets With Minimal Supervision](/publication/2017-04_learning-noisy-large-scale-datasets-minimal-supervision)

Andreas Veit, Neil Alldrin, [Gal Chechik](/person/gal-chechik), Ivan Krasin, Abhinav Gupta, Serge Belongie



[Computer Vision and Pattern Recognition](https://arxiv.org/abs/1701.01619)









### 2016 

[A Patch Memory System For Image Processing and Computer Vision.](/publication/2016-10_patch-memory-system-image-processing-and-computer-vision)

[Jason Clemons](/person/jason-clemons), Chih-Chi Cheng, [Iuri Frosio](/person/iuri-frosio), Daniel Johnson, [Steve Keckler](/person/stephen-keckler)



[International Symposium on Microarchitecture (MICRO)](https://ieeexplore.ieee.org/document/7783754)









[Multilayer and Multimodal Fusion of Deep Neural Networks for Video Classification](/publication/2016-10_multilayer-and-multimodal-fusion-deep-neural-networks-video-classification)

[Xiaodong Yang](/person/xiaodong-yang), [Pavlo Molchanov](/person/pavlo-molchanov), [Jan Kautz](/person/jan-kautz)



[ACM Multimedia](http://www.acmmm.org/2016/)









[Learning to generalize to new compositions in image understanding](/publication/2016-08_learning-generalize-new-compositions-image-understanding)

[Yuval Atzmon](/person/yuval-atzmon), Jonathan Berant, Vahid Kezami, Amir Globerson, [Gal Chechik](/person/gal-chechik)



[Arxiv](https://arxiv.org/abs/1608.07639)









[Reflectance Modeling by Neural Texture Synthesis](/publication/2016-07_reflectance-modeling-neural-texture-synthesis)

Miika Aittala, [Timo Aila](/person/timo-aila), [Jaakko Lehtinen](/person/jaakko-lehtinen)



[ACM Transactions on Graphics 35(4) (proc. SIGGRAPH 2016)](http://dl.acm.org/sig.cfm?id=SP932)









[Online Detection and Classification of Dynamic Hand Gestures with Recurrent 3D Convolutional Neural Networks ](/publication/2016-06_online-detection-and-classification-dynamic-hand-gestures-recurrent-3d)

[Pavlo Molchanov](/person/pavlo-molchanov), [Xiaodong Yang](/person/xiaodong-yang), [Shaline Gupta](/person/shalini-de-mello), Kihwan Kim, [Stephen Tyree](/person/stephen-tyree), [Jan Kautz](/person/jan-kautz)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2016](http://cvpr2016.thecvf.com/)









[Accelerated Generative Models for 3D Point Cloud Data](/index.php/publication/2016-06_accelerated-generative-models-3d-point-cloud-data)

Ben Eckart, Kihwan Kim, Alejandro Troccoli, Alonzo Kelly, [Jan Kautz](/index.php/person/jan-kautz)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2016](http://cvpr2016.thecvf.com/)









### 2015 

[Robust Model-based 3D Head Pose Estimation](/index.php/publication/2015-12_robust-model-based-3d-head-pose-estimation)

Gregory P Meyer, [Shalini Gupta](/index.php/person/shalini-de-mello), [Iuri Frosio](/index.php/person/iuri-frosio), Dikpal Reddy, [Jan Kautz](/index.php/person/jan-kautz)



[IEEE International Conference on Computer Vision (ICCV) 2015](http://pamitc.org/iccv15/)









[MLMD: Maximum Likelihood Mixture Decoupling for Fast and Accurate Point Cloud Registration](/index.php/publication/2015-10_mlmd-maximum-likelihood-mixture-decoupling-fast-and-accurate-point-cloud)

Ben Eckart, Kihwan Kim, Alejandro Troccoli, Alonzo Kelly, [Jan Kautz](/index.php/person/jan-kautz)



[IEEE International Conference on 3D Vision (3DV2015)](http://www.3dv.org/)









[Hand Gesture Recognition with 3D Convolutional Neural Networks ](/publication/2015-06_hand-gesture-recognition-3d-convolutional-neural-networks)

[Pavlo Molchanov](/person/pavlo-molchanov), [Shalini Gupta](/person/shalini-de-mello), Kihwan Kim, [Jan Kautz](/person/jan-kautz)



[IEEE Computer Vision and Pattern Recognition Workshop (CVPRW) 2015](http://www.pamitc.org/cvpr15/)



Winner (1st place) Hand Gesture Recognition Challenge





[Retrieving Gray-Level Information from a Binary Sensor and its Application to Gesture Detection](/index.php/publication/2015-06_retrieving-gray-level-information-binary-sensor-and-its-application-gesture)

Orazio Gallo, [Iuri Frosio](/index.php/person/iuri-frosio), Leonardo Gasparini, Kari Pulli, Massimo Gottardi



[IEEE Computer Vision and Pattern Recognition (CVPR 2015), Embedded Vision Works…](http://www.pamitc.org/cvpr15/)









[Filtering Environment Illumination for Interactive Physically-Based Rendering in Mixed Reality](/publication/2015-06_filtering-environment-illumination-interactive-physically-based-rendering-mixed)

Soham Uday Mehta, Kihwan Kim, Dawid Pajak, Kari Pulli, [Jan Kautz](/person/jan-kautz), Ravi Ramamoorthi



[Eurographics Symposium on Rendering (EGSR 2015)](http://egsr2015.gcc.tu-darmstadt.de/)









[Camera Re-calibration after Zooming based on Sets of Conics](/publication/2015-05_camera-re-calibration-after-zooming-based-sets-conics)

[Iuri Frosio](/person/iuri-frosio), Cristina Turrini, Alberto Alzati



[The Visual Computer](http://link.springer.com/article/10.1007%2Fs00371-015-1089-8)









[Adaptive Segmentation based on a Learned Quality Metric](/publication/2015-03_adaptive-segmentation-based-learned-quality-metric)

[Iuri Frosio](/person/iuri-frosio), Ed Ratner



[Proceedings of the 10th International Conference on Computer Vision Theory and …](http://www.visapp.visigrapp.org/)









### 2014 

[DT-SLAM: Deferred Triangulation for Robust SLAM](/publication/2014-12_dt-slam-deferred-triangulation-robust-slam)

Daniel Herrera C., Kihwan Kim, Juho Kannala, Kari Pulli, Janne Heikkila¨



[IEEE International Conference in 3DV (3D Vision)](http://www.3dimpvt.org/)









[Addressing System-Level Optimization with OpenVX Graphs](/index.php/publication/2014-06_addressing-system-level-optimization-openvx-graphs)

Erik Rainey, Jesse Villareal, Goksel Dedeoglu, Kari Pulli, Thierry Lepley, Frank Brill



[10th IEEE Embedded Vision Workshop](http://www.computervisioncentral.com/evw2014)









### 2013 

[WYSIWYG Computational Photography via Viewfinder Editing](/index.php/publication/2013-11_wysiwyg-computational-photography-viewfinder-editing)

Jongmin Baek, Dawid Pająk, Kihwan Kim, Kari Pulli, Marc Levoy



[Proc. ACM SIGGRAPH Asia](http://sa2013.siggraph.org/en/)









[An Energy Efficient Time-sharing Pyramid Pipeline for Multi-resolution Computer Vision ](/publication/2013-10_energy-efficient-time-sharing-pyramid-pipeline-multi-resolution-computer-vision)

Qiuling Zhu, Navjot Garg, Yun-Ta Tsai, Kari Pulli



[VLSI-SOC](http://vlsisoc2013.ozyegin.edu.tr/)









[Practical SVBRDF Capture in the Frequency Domain](/index.php/publication/2013-07_practical-svbrdf-capture-frequency-domain)

Miika Aittala, Tim Weyrich, [Jaakko Lehtinen](/index.php/person/jaakko-lehtinen)



[ACM Transactions on Graphics (Proc. SIGGRAPH 2013)](http://dx.doi.org/10.1145/2461912.2461978)









### 2012 

[Detecting Regions of Interest in Dynamic Scenes with Camera Motions](/publication/2012-06_detecting-regions-interest-dynamic-scenes-camera-motions)

Kihwan Kim, Dongryeol Lee, Irfan Essa



[ IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2012](http://www.cvpr2012.org/)









[Robust Stereo with Flash and No-flash Image Pairs](/index.php/publication/2012-06_robust-stereo-flash-and-no-flash-image-pairs)

Changyin Zhou, Alejandro Troccoli, Kari Pulli



[CVPR 2012](http://www.cvpr2012.org/)









[Realtime Computer Vision with OpenCV ](/publication/2012-06_realtime-computer-vision-opencv)

Kari Pulli, Anatoly Baksheev, Kirill Kornyakov, Victor Eruhimov



[Communications of the ACM](http://cacm.acm.org/magazines/2012/6/149789-real-time-computer-vision-with-opencv/fulltext)









### 2011 

[Gaussian Process Regression Flow for Analysis of Motion Trajectories](/index.php/publication/gaussian-process-regression-flow-analysis-motion-trajectories)

Kihwan Kim, Dongryeol Lee, Irfan Essa



[IEEE International Conference on Computer Vision (ICCV) 2011](http://www.iccv2011.org/)









### 2010 

[ Point Set Registration: Coherent Point Drift ](/index.php/publication/2010-12_point-set-registration-coherent-point-drift)

[Andriy Myronenko](/index.php/person/andriy-myronenko), Xubo Song



[PAMI 2010](https://ieeexplore.ieee.org/document/5432191)









[VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation](/publication/_vila-u-unified-foundation-model-integrating-visual-understanding-and-generation)















 

 



 ### Researchers

 

[Adithya Murali](/index.php/person/adithya-murali)



[Alexander Trevithick](/person/alexander-trevithick)



[Amrita Mazumdar](/index.php/person/amrita-mazumdar)



[Andriy Myronenko](/person/andriy-myronenko)



[Ankit Goyal](/index.php/person/ankit-goyal)



[Arash Vahdat](/person/arash-vahdat)



[Balakumar Sundaralingam](/person/balakumar-sundaralingam)



[Benjamin Eckart](/person/ben-eckart)



[Boris Ivanovic](/person/boris-ivanovic)



[Bowen Wen](/person/bowen-wen)



[Can Zhao](/person/can-zhao)



[Charles Loop](/person/charles-loop)



[Chen-Hsuan Lin](/person/chen-hsuan-lin)



[Cheng Sun](/person/cheng-sun)



[Chi-Pin Huang](/person/chi-pin-huang)



[Chia-Wen Kuo](/person/chia-wen-kuo)



[Daguang Xu](/person/daguang-xu)



[Dvir Samuel](/person/dvir-samuel)



[Ekta Prashnani](/person/ekta-prashnani)



[Enze Xie](/person/enze-xie)



[Frank Wang](/person/frank-wang)



[Fred Yang](/person/fred-yang)



[Haggai Maron](/person/haggai-maron)



[Hanrong Ye](/person/hanrong-ye)



[Hao Zhang](/person/hao-zhang)



[Haotian Zhang](/person/haotian-zhang)



[Haoyu Yang](/person/haoyu-yang)



[Heng Yang](/person/heng-yang)



[Hongxu Danny Yin](/person/danny-yin)



[Hugo Hadfield](/index.php/person/hugo-hadfield)



[Iuri Frosio](/person/iuri-frosio)



[Jacob Munkberg](/person/jacob-munkberg)



[Jaesung Choe](/person/jaesung-choe)



[Jason Stock](/person/jason-stock)



[Jean Kossaifi](/index.php/person/jean-kossaifi)



[Jiaojiao Fan](/person/jiaojiao-fan)



[Jiaxiang Tang](/person/jiaxiang-tang)



[Jiefeng Li](/person/jiefeng-li)



[Linxi "Jim" Fan](/person/linxi-jim-fan)



[Jimmy Wu](/person/jimmy-wu)



[Jindong Jiang](/person/jindong-jiang)



[Jinwei Gu](/person/jinwei-gu)



[Joohwan Kim](/person/joohwan-kim)



[Kaichun Mo](/person/kaichun-mo)



[Koki Nagano](/person/koki-nagano)



[Loic Magne](/person/loic-magne)



[Max Zhaoshuo Li](/person/max-zhaoshuo-li)



[Merlin Nimier-David](/person/merlin-nimier-david)



[Michael Stengel](/index.php/person/michael-stengel)



[Min-Hung Chen](/person/min-hung-chen)



[Ming-Yu Liu](/person/ming-yu-liu)



[Omer Shapira](/index.php/person/omer-shapira)



[Pavlo Molchanov](/person/pavlo-molchanov)



[Peter Kocsis](/person/peter-kocsis)



[Prithvijit Chattopadhyay](/person/prithvijit-chattopadhyay)



[Qianli Ma](/person/qianli-ma)



[Ravi Ramamoorthi](/index.php/person/ravi-ramamoorthi)



[Runyu Ding](/person/runyu-ding)



[Ruth Rosenholtz](/index.php/person/ruth-rosenholtz)



[Ryo Hachiuma](/person/ryo-hachiuma)



[Sai Bangaru](/index.php/person/sai-bangaru)



[Sameer Dharur](/person/sameer-dharur)



[Samuli Laine](/person/samuli-laine)



[Sanja Fidler](/person/sanja-fidler)



[Scott Reed](/person/scott-reed)



[Seonwook Park](/index.php/person/seonwook-park)



[Seungjun Nah](/person/seungjun-nah)



[Shalini De Mello](/person/shalini-de-mello)



[Shengze Wang](/person/shengze-wang)



[Song Han](/person/song-han)



[Stan Birchfield](/index.php/person/stan-birchfield)



[Stephen Tyree](/person/stephen-tyree)



[Steve Keckler](/person/stephen-keckler)



[Steve Marschner](/index.php/person/steve-marschner)



[Thomas Breuel](/index.php/person/thomas-breuel)



[Thomas Müller](/person/thomas-muller)



[Tianye Li](/index.php/person/tianye-li)



[Tianyi Xie](/person/tianyi-xie)



[Timo Aila](/person/timo-aila)



[Tsung-Yi Lin](/person/tsung-yi-lin)



[Tucker Hermans](/person/tucker-hermans)



[Umar Iqbal](/person/umar-iqbal)



[Valts Blukis](/index.php/person/valts-blukis)



[Vinu Joseph](/person/vinu-joseph)



[Wenjie Luo](/person/wenjie-luo)



[Wonmin Byeon](/index.php/person/wonmin-byeon)



[Xiangyu Chen](/person/xiangyu-chen)



[Xiaodong Yang](/person/xiaodong-yang)



[Xin Kong](/person/xin-kong)



[Xinshuo Weng](/person/xinshuo-weng)



[Xuan Li](/person/xuan-li)



[Xueting Li](/person/xueting-li)



[Yatian Pang](/person/yatian-pang)



[Ye Yuan](/person/ye-yuan)



[Yecheng Wu](/index.php/person/yecheng-wu)



[Yin Cui](/person/yin-cui)



[Yinzhen Xu](/person/yinzhen-xu)



[Yoad Tewel](/person/yoad-tewel)



[Yogesh Balaji](/person/yogesh-balaji)



[Yoni Kasten](/person/yoni-kasten)



[Yu Zeng](/person/yu-zeng)



[Yu-Wei Chao](/person/yu-wei-chao)



[Yuchao Gu](/person/yuchao-gu)



[Yue Wang](/person/yue-wang)



[Yufei Ye](/person/yufei-ye)



[Yukang Chen](/person/yukang-chen)



[Yuke Zhu](/index.php/person/yuke-zhu)



[Yusuke Hirota](/person/yusuke-hirota)



[Yuval Atzmon](/person/yuval-atzmon)



[Yuyang Zhao](/person/yuyang-zhao)



[Zekun Hao](/person/zekun-hao)



[Zhengyi Luo](/person/zhengyi-luo)



[Zhiding Yu](/person/zhiding-yu)



[Zhijian Liu](/person/zhijian-liu)



[Zhiqi Li](/index.php/person/zhiqi-li)



[Ziyue Xu](/person/ziyue-xu)