  ## Generative AI

 ### Associated Publications

 

### 2026 

[GalaxyDiT: Efficient Video Generation with Guidance Alignment and Adaptive Proxy in Diffusion Transformers](/publication/2026-07_galaxydit-efficient-video-generation-guidance-alignment-and-adaptive-proxy)

Zoey Song, [Steve Dai](/person/steve-dai), [Ben Keller](/person/ben-keller), [Brucek Khailany](/person/brucek-khailany)



[DAC 2026](https://dac.com/2026)









[Editing Physiological Signals in Videos Using Latent Representations](/publication/2026-06_editing-physiological-signals-videos-using-latent-representations)

Tianwen Zhou, Akshay Paruchuri, [Josef Spjut](/person/josef-spjut), Kaan Akşit



[CVPR Workshop on Subtle Visual Computing](https://sites.google.com/view/svc-cvpr26)









[QCalEval: Benchmarking Vision-Language Models for Quantum Calibration Plot Understanding](/publication/2026-04_qcaleval-benchmarking-vision-language-models-quantum-calibration-plot)

Shuxiang Cao, Zijian Zhang, Abhishek Agarwal, Grace Bratrud, Niyaz R. Beysengulov, Daniel C. Cole, Alejandro Gomez Frieiro, Elena O. Glen, Hao Hsu, Gang Huang, Raymond Jow, Greshma Shaji, Tom Lubowe, [Ligeng Zhu](/person/ligeng-zhu), Luis Mantilla Calderon, Nicola Pancotti, Joel Pendleton, Brandon Severin, Charles Etienne Staub, Sara Sussman, Antti Vepsäläinen, Neel Rajeshbhai Vora, Yilun Xu, Varinia Bernales, Daniel Bowring, Elica Kyoseva, Ivan Rungger, Giulia Semeghini, Sam Stanwyck, Timothy Costa, [Alán Aspuru-Guzik](/person/alan-aspuru-guzik), Krysta Svore













[3D-GENERALIST: Vision-Language-Action Models for Crafting 3D Worlds](/publication/2026-03_3d-generalist-vision-language-action-models-crafting-3d-worlds)

Fan-Yun Sun, Shengguang Wu, Christian Jacobsen, Thomas Yim, Haoming Zou, [Alex Zook](/person/alex-zook), Shangru Li, Yu-Hsin Chou, Ethem Can, Xunlei Wu, Clemens Eppner, [Valts Blukis](/person/valts-blukis), [Jonathan Tremblay](/person/jonathan-tremblay), Jiajun Wu, [Stan Birchfield](/person/stan-birchfield), Nick Haber



[International Conference on 3D Vision 2026](https://3dvconf.github.io/2026/)









[CRoCoDiL: Continuous and Robust Conditioned Diffusion for Language](/publication/2026-03_crocodil-continuous-and-robust-conditioned-diffusion-language)

Roy Uziel, Omer Belhasin, Itay Levy, Akhiad Bercovich, Ran El-Yaniv, Ran Zilberstein, Michael Elad



[Arxiv](https://arxiv.org/abs/2603.20210)









[Learn from Your Mistakes: Self-Correcting Masked Diffusion Models](/publication/2026-02_learn-your-mistakes-self-correcting-masked-diffusion-models)

Yair Schiff, Omer Belhasin, Roy Uziel, Guanghan Wang, Marianne Arriola, Gilad Turok, Michael Elad, Volodymyr Kuleshov













[Proteina-Complexa: Scaling Atomistic Protein Binder Design with Generative Pretraining and Test-Time Compute](/publication/2026-01_proteina-complexa-scaling-atomistic-protein-binder-design-generative)

[Kieran Didi](/person/kieran-didi), Zuobai Zhang, Guoqing Zhou, Danny Reidenbach, Zhonglin Cao, Sooyoung Cha, [Tomas Geffner](/person/tomas-geffner), Christian Dallago, Jian Tang, Michael M. Bronstein, Martin Steinegger, Emine Kucukbenli, [Arash Vahdat](/person/arash-vahdat), [Karsten Kreis](/person/karsten-kreis)



[International Conference on Learning Representations (ICLR) 2026 (Oral)](https://arxiv.org/abs/2603.27950)









[Exploring Synthesizable Chemical Space with Iterative Pathway Refinements](/publication/2026-01_exploring-synthesizable-chemical-space-iterative-pathway-refinements)

Seul Lee, [Karsten Kreis](/person/karsten-kreis), Srimukh Prasad Veccham, Meng Liu, Danny Reidenbach, Saee Paliwal, Weili Nie, [Arash Vahdat](/person/arash-vahdat)



[International Conference on Learning Representations (ICLR) 2026 (Oral)](https://arxiv.org/abs/2509.16084)









[Demystifying Data-Driven Probabilistic Medium-Range Weather Forecasting](/publication/2026-01_demystifying-data-driven-probabilistic-medium-range-weather-forecasting)

[Jean Kossaifi](/person/jean-kossaifi), [Nikola Kovachki](/person/nikola-kovachki), [Morteza Mardani](/person/morteza-mardani), [Daniel Leibovici](/person/daniel-leibovici), Suman Ravuri, Ira Shokar, Edoardo Calvello, Mohammad Shoaib Abbas, Peter Harrington, Ashay Subramaniam, [Noah Brenowitz](/person/noah-brenowitz), [Boris Bonev](/person/boris-bonev), [Wonmin Byeon](/person/wonmin-byeon), [Karsten Kreis](/person/karsten-kreis), [Dale Durran](/person/dale-durran), [Arash Vahdat](/person/arash-vahdat), [Mike Pritchard](/person/mike-pritchard), [Jan Kautz](/person/jan-kautz)













[La-Proteina: Atomistic Protein Generation via Partially Latent Flow Matching](/publication/2026-01_la-proteina-atomistic-protein-generation-partially-latent-flow-matching)

[Tomas Geffner](/person/tomas-geffner), [Kieran Didi](/person/kieran-didi), Zhonglin Cao, Danny Reidenbach, Zuobai Zhang, Christian Dallago, Emine Kucukbenli, [Karsten Kreis](/person/karsten-kreis), [Arash Vahdat](/person/arash-vahdat)



[International Conference on Learning Representations (ICLR) 2026](https://arxiv.org/abs/2507.09466)









### 2025 

[Beyond Behavior Cloning in Autonomous Driving: a Survey of Closed-Loop Training Techniques](/publication/2025-12_beyond-behavior-cloning-autonomous-driving-survey-closed-loop-training)

[Peter Karkus](/person/peter-karkus), [Maximilian Igl](/person/maximilian-igl), [Yuxiao Chen](/person/yuxiao-chen), Kashyap Chitta, Jef Packer, [Bertrand Douillard](/person/bertrand-douillard), [Thomas Tian](/person/thomas-tian), Alexander Naumann, Guillermo Garcia-Cobo, Shuhan Tan, [Alperen Degirmenci](/person/alperen-degirmenci), Alexander Popov, Nikolai Smolyanskiy, Urs Muller, [Boris Ivanovic](/person/boris-ivanovic), [Marco Pavone](/person/marco-pavone)













[Elucidated Rolling Diffusion Models for Probabilistic Weather Forecasting](/publication/2025-12_elucidated-rolling-diffusion-models-probabilistic-weather-forecasting)

Salva Rühling Cachay, [Miika Aittala](/person/miika-aittala), [Karsten Kreis](/person/karsten-kreis), [Noah Brenowitz](/person/noah-brenowitz), [Arash Vahdat](/person/arash-vahdat), [Morteza Mardani](/person/morteza-mardani), Rose Yu



[Neural Information Processing Systems (NeurIPS) 2025](https://arxiv.org/abs/2506.20024)









[ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning](/publication/2025-12_thinkact-vision-language-action-reasoning-reinforced-visual-latent-planning)

[Chi-Pin Huang](/person/chi-pin-huang), Yueh-Hua Wu, [Min-Hung Chen](/person/min-hung-chen), [Frank Wang](/person/frank-wang), [Fred Yang](/person/fred-yang)



[Neural Information Processing Systems (NeurIPS) 2025](https://arxiv.org/pdf/2507.16815)









[Align Your Flow: Scaling Continuous-Time Flow Map Distillation](/publication/2025-12_align-your-flow-scaling-continuous-time-flow-map-distillation)

Amirmojtaba Sabour, [Sanja Fidler](/person/sanja-fidler), [Karsten Kreis](/person/karsten-kreis)



[Neural Information Processing Systems (NeurIPS) 2025](https://arxiv.org/abs/2506.14603)









[Seeing What Matters: Generalizable AI-generated Video Detection with Forensic-Oriented Augmentation](/publication/2025-11_seeing-what-matters-generalizable-ai-generated-video-detection-forensic)

Riccardo Corvi, Davide Cozzolino, [Ekta Prashnani](/person/ekta-prashnani), [Shalini De Mello](/person/shalini-de-mello), [Koki Nagano](/person/koki-nagano), Luisa Verdoliva



[Advances in Neural Information Processing Systems (NeurIPS) 2025](https://neurips.cc/virtual/2025/loc/san-diego/poster/117010)









[Alpamayo 1: Bridging Reasoning and Action Prediction for Generalizable Autonomous Driving in the Long Tail](/publication/2025-10_alpamayo-r1)

[Marco Pavone](/person/marco-pavone), Many other contributors found on Page 33













[VoiceNoNG: Robust High-Quality Speech Editing Model without Hallucinations](/publication/2025-08_voicenong-robust-high-quality-speech-editing-model-without-hallucinations)

[Sung-Feng Huang](/person/sung-feng-huang), Heng-Cheng Kuo, Zhehuai Chen, Xuesong Yang, Pin-Jui Ku, Ante Jukić, [Huck Yang](/person/huck-yang), Yu Tsao, [Frank Wang](/person/frank-wang), Hung-yi Lee, [Szu-Wei Fu](/person/szu-wei-fu)



[Interspeech 2025](https://www.interspeech2025.org/home)









[Assessing Learned Models for Phase-only Hologram Compression](/publication/2025-08_assessing-learned-models-phase-only-hologram-compression)

Zicong Peng, Yicheng Zhan, [Josef Spjut](/person/josef-spjut), Kaan Akşit



[SIGGRAPH 2025 Posters](https://dl.acm.org/doi/10.1145/3721250.3742993)









[GAIA: Generative Animatable Interactive Avatars with Expression-conditioned Gaussians](/publication/2025-08_gaia-generative-animatable-interactive-avatars-expression-conditioned-gaussians)

Zhengming Yu, [Tianye Li](/person/tianye-li), Jingxiang Sun, [Omer Shapira](/person/omer-shapira), [Seonwook Park](/person/seonwook-park), [Michael Stengel](/person/michael-stengel), Matthew Chan, Xin Li, Wenping Wang, [Koki Nagano](/person/koki-nagano), [Shalini De Mello](/person/shalini-de-mello)



[ACM SIGGRAPH 2025](https://dl.acm.org/doi/10.1145/3721238.3730737)









[MAISI-v2: Accelerated 3D High-Resolution Medical Image Synthesis with Rectified Flow and Region-specific Contrastive Loss](/publication/2025-08_maisi-v2-accelerated-3d-high-resolution-medical-image-synthesis-rectified-flow)

[Can Zhao](/person/can-zhao), Pengfei Guo, [Dong Yang](/person/dong-yang), Yucheng Tang, [Yufan He](/person/yufan-he), Benjamin Simon, Mason Belue, Stephanie Harmon, Baris Turkbey, [Daguang Xu](/person/daguang-xu)



[AAAI 2026](https://arxiv.org/abs/2508.05772)









[Fly, Fail, Fix: Iterative Game Repair with Reinforcement Learning and Large Multimodal Models](/publication/2025-08_fly-fail-fix-iterative-game-repair-reinforcement-learning-and-large-multimodal)

[Alex Zook](/person/alex-zook), [Josef Spjut](/person/josef-spjut), [Jonathan Tremblay](/person/jonathan-tremblay)



[Reinforcement Learning and Video Games Workshop @ RLC 2025](https://sites.google.com/view/rlvg-workshop-2025/home)









[Identity-Motion Trade-offs in Text-to-Video Generation](/publication/2025-07_identity-motion-trade-offs-text-video-generation)

[Yuval Atzmon](/person/yuval-atzmon), Rinon Gal, [Yoad Tewel](/person/yoad-tewel), [Yoni Kasten](/person/yoni-kasten), [Gal Chechik](/person/gal-chechik)



[BMVC 2025](https://bmvc2025.bmva.org/proceedings/159/)









[FourCastNet 3: A geometric approach to probabilistic machine-learning weather forecasting at scale](/publication/2025-07_fourcastnet-3-geometric-approach-probabilistic-machine-learning-weather)

[Boris Bonev](/person/boris-bonev), Thorsten Kurth, Ankur Mahesh, Mauro Bisson, [Jean Kossaifi](/person/jean-kossaifi), Karthik Kashinath, Anima Anandkumar, William D. Collins, [Mike Pritchard](/person/mike-pritchard), [Alex Keller](/person/alex-keller)













[GenMol: A Drug Discovery Generalist with Discrete Diffusion](/publication/2025-07_genmol-drug-discovery-generalist-discrete-diffusion)

Seul Lee, [Karsten Kreis](/person/karsten-kreis), Srimukh Prasad Veccham, Meng Liu, Danny Reidenbach, Yuxing Peng, Saee Paliwal, Weili Nie, [Arash Vahdat](/person/arash-vahdat)



[International Conference on Machine Learning (ICML) 2025](https://arxiv.org/abs/2501.06158)









[Efficient Molecular Conformer Generation with SO(3)-Averaged Flow Matching and Reflow](/publication/2025-07_efficient-molecular-conformer-generation-so3-averaged-flow-matching-and-reflow)

Zhonglin Cao, Mario Geiger, Allan Dos Santos Costa, Danny Reidenbach, [Karsten Kreis](/person/karsten-kreis), [Tomas Geffner](/person/tomas-geffner), Franco Pellegrini, Guoqing Zhou, Emine Kucukbenli



[International Conference on Machine Learning (ICML) 2025](https://arxiv.org/abs/2507.09785)









[Score-based Diffusion Models in Function Space](/publication/2025-07_score-based-diffusion-models-function-space)

Jae Hyun Lim, [Nikola Kovachki](/person/nikola-kovachki), Ricardo Baptista, Christopher Beckham, Kamyar Azizzadenesheli, [Jean Kossaifi](/person/jean-kossaifi), Vikram Voleti, Jiaming Song, [Karsten Kreis](/person/karsten-kreis), [Jan Kautz](/person/jan-kautz), Christopher Pal, [Arash Vahdat](/person/arash-vahdat), Anima Anandkumar



[Journal of Machine Learning Research (JMLR) 2025](https://arxiv.org/abs/2302.07400)









[Make It Count: Text-to-Image Generation with an Accurate Number of Objects](/publication/2025-06_make-it-count-text-image-generation-accurate-number-objects)

Lital Binyamin, [Yoad Tewel](/person/yoad-tewel), Eran Hirsch, Royi Rassin, [Gal Chechik](/person/gal-chechik)



[CVPR 2025](https://openaccess.thecvf.com/content/CVPR2025/papers/Binyamin_Make_It_Count_Text-to-Image_Generation_with_an_Accurate_Number_of_CVPR_2025_paper.pdf)









[Coherent 3D Portrait Video Reconstruction via Triplane Fusion](/publication/2025-06_coherent-3d-portrait-video-reconstruction-triplane-fusion)

Shengze Wang, [Xueting Li](/person/xueting-li), [Chao Liu](/person/chao-liu), Matthew Chan, [Michael Stengel](/person/michael-stengel), Henry Fuchs, [Shalini De Mello](/person/shalini-de-mello), [Koki Nagano](/person/koki-nagano)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2025](https://openaccess.thecvf.com/content/CVPR2025/papers/Wang_Coherent_3D_Portrait_Video_Reconstruction_via_Triplane_Fusion_CVPR_2025_paper.pdf)









[SimAvatar: Simulation-Ready Clothed Gaussian Avatars from Text](/publication/2025-06_simavatar-simulation-ready-clothed-gaussian-avatars-text)

[Xueting Li](/person/xueting-li), [Ye Yuan](/person/ye-yuan), [Shalini De Mello](/person/shalini-de-mello), Gilles Daviet, Jonathan Leaf, Miles Macklin, [Jan Kautz](/person/jan-kautz), [Umar Iqbal](/person/umar-iqbal)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2025](https://openaccess.thecvf.com/content/CVPR2025/papers/Li_SimAvatar_Simulation-Ready_Avatars_with_Layered_Hair_and_Clothing_CVPR_2025_paper.pdf)









[A Generative AI Game Jam Case Study from October 2024](/publication/2025-06_generative-ai-game-jam-case-study-october-2024)

[Josef Spjut](/person/josef-spjut)



[CVPR 2025 Workshop on Computer Vision For Videogames (CV2)](https://sites.google.com/view/cv2-2025/)









[Beyond the Buzz: A Pragmatic Take on Inference Disaggregation](/publication/2025-06_beyond-buzz-pragmatic-take-inference-disaggregation)

Tiyasa Mitra, Ritika Borkar, Nidhi Bhatia, Ramon Matas, Shivam Raj, Dheevatsa Mudigere, Ritchie Zhao, Maximilian Golub, Arpan Dutta, Sailaja Madduri, Dharmesh Jani, Brian Pharris, Bita Darvish Rouhani 



[Arxiv](https://arxiv.org/abs/2506.05508)









[Inference-Time Policy Steering through Human Interactions](/publication/2025-05_inference-time-policy-steering-through-human-interactions)

Yanwei Wang, Lirui Wang, Yilun Du, [Balakumar Sundaralingam](/person/balakumar-sundaralingam), [Xuning Yang](/person/xuning-yang), [Yu-Wei Chao](/person/yu-wei-chao), [Claudia Pérez D’Arpino ](/person/cdarpino), Dieter Fox, Julie Shah



[IEEE ICRA 2025](https://arxiv.org/pdf/2411.16627)









[Score Distillation Sampling for Audio: Source Separation, Synthesis, and Beyond](/publication/2025-05_score-distillation-sampling-audio-source-separation-synthesis-and-beyond)

Jessie Richter-Powell, Antonio Torralba, Jonathan Lorraine



[Arxiv](https://arxiv.org/pdf/2505.04621)









[Fugatto 1 - Foundational Generative Audio Transformer Opus 1](/publication/2025-04_fugatto-1-foundational-generative-audio-transformer-opus-1)

Rafael Valle, Rohan Badlani, Zhifeng Kong, Sang-gil Lee, Arushi Goel, Sungwon Kim, Joao Felipe Santos, Shuqi Dai, [Siddharth Gururani](/person/siddharth-gururani), Aya AIJa'fari, Alex Liu, Kevin Shih, Wei Ping, [Huck Yang](/person/huck-yang), Bryan Catanzaro



[ICLR 2025](https://openreview.net/forum?id=B2Fqu7Y2cd)









[UniWav: Towards Unified Pre-training for Speech Representation Learning and Generation](/publication/2025-04_uniwav-towards-unified-pre-training-speech-representation-learning-and)

Alexander H. Liu, Sang-gil Lee, [Huck Yang](/person/huck-yang), Yuan Gong, [Frank Wang](/person/frank-wang), James R. Glas, Rafael Valle



[ICLR 2025](https://openreview.net/forum?id=yj9lLwMjnE)









[Minitron-SSM: Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning](/publication/2025-04_minitron-ssm-efficient-hybrid-language-model-compression-through-group-aware)

Ali Taghibakhshi, Sharath Turuvekere Sreenivas, [Saurav Muralidharan](/person/saurav-muralidharan), Marcin Chochowski, Yashaswi Karnati, Raviraj Joshi, Ameya Sunil Mahabaleshwarkar, Zijia Chen, Yoshi Suhara, Oluwatobi Olabiyi, Daniel Korzekwa, Mostofa Patwary, Mohammad Shoeybi, [Jan Kautz](/person/jan-kautz), Bryan Catanzaro, Ashwath Aithal, Nima Tajbakhsh, [Pavlo Molchanov](/person/pavlo-molchanov)



[NeurIPS 2025](https://arxiv.org/abs/2504.11409)









[Lightning-Fast Image Inversion and Editing for Text-to-Image Diffusion Models, ](/publication/2025-04_lightning-fast-image-inversion-and-editing-text-image-diffusion-models)

Dvir Samuel, Barak Meiri, [Haggai Maron](/person/haggai-maron), [Yoad Tewel](/person/yoad-tewel), Nir Darshan, Shai Avidan, [Gal Chechik](/person/gal-chechik), Rami Ben-Ari



[ICLR 2025](https://iclr.cc/virtual/2025/poster/28072)









[Cosmos Transfer 1: World-to-World Transfer with Adaptive Multi-Control for Physical AI](/publication/2025-03_cosmos-transfer-1-world-world-transfer-adaptive-multi-control-physical-ai)

[Ming-Yu Liu](/person/ming-yu-liu)



[Arxiv](https://arxiv.org/abs/2503.14492)









[Cosmos-Reason 1: From Physical AI Common Sense to Embodied Decisions](/publication/2025-03_cosmos-reason-1-physical-ai-common-sense-embodied-decisions)

[Tsung-Yi Lin](/person/tsung-yi-lin), [Ming-Yu Liu](/person/ming-yu-liu)













[NVIDIA Isaac GR00T N1: An Open Foundation Model for Humanoid Robots](/publication/2025-03_nvidia-isaac-gr00t-n1-open-foundation-model-humanoid-robots)

[Yuke Zhu](/person/yuke-zhu), [Linxi "Jim" Fan](/person/linxi-jim-fan), NVIDIA GEAR Team













[Multi-student Diffusion Distillation for Better One-step Generators](/publication/2025-03_multi-student-diffusion-distillation-better-one-step-generators)

Yanke Song, Jonathan Lorraine, Weili Nie, [Karsten Kreis](/person/karsten-kreis), James Lucas 



[Arxiv](https://arxiv.org/pdf/2410.23274)









[LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models](/publication/2025-03_llama-mesh-unifying-3d-mesh-generation-language-models)

Zhengyi Wang, Jonathan Lorraine, Yikai Wang, Hang Su, Jun Zhu, Sanja Fidler, Xiaohui Zeng



[Arxiv](https://arxiv.org/pdf/2411.09595)









[CorrFill: Enhancing Faithfulness in Reference-based Inpainting with Correspondence Guidance in Diffusion Models](/publication/2025-02_corrfill-enhancing-faithfulness-reference-based-inpainting-correspondence)

Kuan-Hung Liu, Cheng-Kun Yang, [Min-Hung Chen](/person/min-hung-chen), Yu-Lun Liu, Yen-Yu Lin



[Winter Conference on Applications of Computer Vision (WACV)](https://wacv2025.thecvf.com/)









[Energy-Based Diffusion Language Models for Text Generation](/publication/2025-01_energy-based-diffusion-language-models-text-generation)

Minkai Xu, [Tomas Geffner](/person/tomas-geffner), [Karsten Kreis](/person/karsten-kreis), Weili Nie, Yilun Xu, Jure Leskovec, Stefano Ermon, [Arash Vahdat](/person/arash-vahdat)



[International Conference on Learning Representations (ICLR) 2025](https://arxiv.org/abs/2410.21357)









[ProtComposer: Compositional Protein Structure Generation with 3D Ellipsoids](/publication/2025-01_protcomposer-compositional-protein-structure-generation-3d-ellipsoids)

Hannes Stark, Bowen Jing, [Tomas Geffner](/person/tomas-geffner), Jason Yim, Tommi Jaakkola, [Arash Vahdat](/person/arash-vahdat), [Karsten Kreis](/person/karsten-kreis)



[International Conference on Learning Representations (ICLR) 2025 (Oral)](https://arxiv.org/abs/2503.05025)









[Truncated Consistency Models](/publication/2025-01_truncated-consistency-models)

Sangyun Lee, Yilun Xu, [Tomas Geffner](/person/tomas-geffner), Giulia Fanti, [Karsten Kreis](/person/karsten-kreis), [Arash Vahdat](/person/arash-vahdat), Weili Nie



[International Conference on Learning Representations (ICLR) 2025](https://arxiv.org/abs/2410.14895)









[Proteina: Scaling Flow-based Protein Structure Generative Models](/publication/2025-01_proteina-scaling-flow-based-protein-structure-generative-models)

[Tomas Geffner](/person/tomas-geffner), [Kieran Didi](/person/kieran-didi), Zuobai Zhang, Danny Reidenbach, Zhonglin Cao, Jason Yim, Mario Geiger, Christian Dallago, Emine Kucukbenli, [Arash Vahdat](/person/arash-vahdat), [Karsten Kreis](/person/karsten-kreis)



[International Conference on Learning Representations (ICLR) 2025 (Oral)](https://arxiv.org/abs/2503.00710)









[Directed Graph Generation with Heat Kernels](/publication/2025-01_directed-graph-generation-heat-kernels)

Marc T. Law, [Karsten Kreis](/person/karsten-kreis), [Haggai Maron](/person/haggai-maron)



[Transactions on Machine Learning Research (TMLR) 2025](https://openreview.net/forum?id=60Gi1w6hte)









[Cosmos World Foundation Model Platform for Physical AI](/publication/2025-01_cosmos-world-foundation-model-platform-physical-ai)

[Ming-Yu Liu](/person/ming-yu-liu), Many other contributors at https://d1qx31qr3h6wln.cloudfront.net/publications/NVIDIA%20Cosmos_4.pdf, [Jing Zhang](/person/jing-zhang)













### 2024 

[L4GM: Large 4D Gaussian Reconstruction Model](/publication/2024-12_l4gm-large-4d-gaussian-reconstruction-model)

Jiawei Ren, Kevin Xie, Ashkan Mirzaei, Hanxue Liang, Xiaohui Zeng, [Karsten Kreis](/person/karsten-kreis), Ziwei Liu, Antonio Torralba, Sanja Fidler, Seung Wook Kim, Huan Ling



[Neural Information Processing Systems (NeurIPS) 2024](https://arxiv.org/abs/2406.10324)









[Molecule Generation with Fragment Retrieval Augmentation](/publication/2024-12_molecule-generation-fragment-retrieval-augmentation)

Seul Lee, [Karsten Kreis](/person/karsten-kreis), Srimukh Prasad Veccham, Meng Liu, Danny Reidenbach, Saee Paliwal, [Arash Vahdat](/person/arash-vahdat), Weili Nie



[Neural Information Processing Systems (NeurIPS) 2024](https://arxiv.org/abs/2411.12078)









[FactorSim: Generative Simulation via Factorized Representation](/publication/2024-12_factorsim-generative-simulation-factorized-representation)

Fan-Yun Sun, S. I. Harini, Angela Yi, Yihan Zhou, [Alex Zook](/person/alex-zook), [Jonathan Tremblay](/person/jonathan-tremblay), Logan Cross, Jiajun Wu, Nick Haber



[NeurIPS 2024](https://neurips.cc/Conferences/2024)









[Aligning Target-Aware Molecule Diffusion Models with Exact Energy Optimization](/publication/2024-12_aligning-target-aware-molecule-diffusion-models-exact-energy-optimization)

Siyi Gu, Minkai Xu, Alexander Powers, Weili Nie, [Tomas Geffner](/person/tomas-geffner), [Karsten Kreis](/person/karsten-kreis), Jure Leskovec, [Arash Vahdat](/person/arash-vahdat), Stefano Ermon



[Neural Information Processing Systems (NeurIPS) 2024](https://arxiv.org/abs/2407.01648)









[Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models](/publication/2024-12_warped-diffusion-solving-video-inverse-problems-image-diffusion-models)

Giannis Daras, Weili Nie, [Karsten Kreis](/person/karsten-kreis), Alexandros G. Dimakis, [Morteza Mardani](/person/morteza-mardani), [Nikola Kovachki](/person/nikola-kovachki), [Arash Vahdat](/person/arash-vahdat)



[Neural Information Processing Systems (NeurIPS) 2024](https://arxiv.org/abs/2410.16152)









[Diffusion-Reward Adversarial Imitation Learning](/publication/2024-12_diffusion-reward-adversarial-imitation-learning)

Chun-Mao Lai, Hsiang-Chun Wang, Ping-Chun Hsieh, [Frank Wang](/person/frank-wang), [Min-Hung Chen](/person/min-hung-chen), Shao-Hua Sun



[Neural Information Processing Systems (NeurIPS)](https://neurips.cc/Conferences/2024)









[Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models](/index.php/publication/2024-12_self-taught-recognizer-toward-unsupervised-adaptation-speech-foundation-models)

Yuchen Hu, Chen Chen, [Huck Yang](/index.php/person/huck-yang), Chengwei Qin, Pin-Yu Chen, Eng Siong Chng, Chao Zhang



[NeurIPS](https://arxiv.org/pdf/2405.14161)









[MaskedMimic: Unified Physics-Based Character Control Through Masked Motion Inpainting](/publication/2024-12_maskedmimic-unified-physics-based-character-control-through-masked-motion)

[Chen Tessler](/person/chen-tessler), Kelly Guo, Ofir Nabati, [Gal Chechik](/person/gal-chechik), Jason Peng



[SIGGRAPH Asia 2024](https://research.nvidia.com/labs/par/maskedmimic/)









[Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits](/publication/2024-12_detecting-undetectable-assessing-efficacy-current-spoof-detection-methods)

[Sung-Feng Huang](/person/sung-feng-huang), Heng-Cheng Kuo, Zhehuai Chen, Xuesong Yang, [Huck Yang](/person/huck-yang), Yu Tsao, [Frank Wang](/person/frank-wang), Hung-yi Lee, [Szu-Wei Fu](/person/szu-wei-fu)



[IEEE SLT 2024](https://2024.ieeeslt.org/)









[DRC-Coder: Automated DRC Checker Code Generation Using LLM Autonomous Agent](/publication/2024-11_drc-coder-automated-drc-checker-code-generation-using-llm-autonomous-agent)

Chen-Chia Chang, [Chia-Tung (Mark) Ho](/person/chia-tung-mark-ho), Yaguang Li, Yiran Chen, Mark Haoxing Ren



[arXiv](https://arxiv.org/abs/2412.05311)









[From Descriptive Richness to Bias: Unveiling the Dark Side of Generative Image Caption Enrichment](/publication/2024-11_descriptive-richness-bias-unveiling-dark-side-generative-image-caption)

Yusuke Hirota, [Ryo Hachiuma](/person/ryo-hachiuma), [Huck Yang](/person/huck-yang), Yuta Nakashima



[EMNLP](https://arxiv.org/pdf/2406.13912)









[Avatar Fingerprinting for Authorized Use of Synthetic Talking-Head Videos](/publication/2024-09_avatar-fingerprinting-authorized-use-synthetic-talking-head-videos)

[Ekta Prashnani](/person/ekta-prashnani), [Koki Nagano](/person/koki-nagano), [Shalini De Mello](/person/shalini-de-mello), [David Luebke](/person/david-luebke), Orazio Gallo



[European Conference on Computer Vision (ECCV) 2024](https://eccv.ecva.net)









[Learning to Move Like Professional Counter-Strike Players](/publication/2024-09_learning-move-professional-counter-strike-players)

David Durst, F. Xie, V. Sarukkai, Brennan Shacklett, [Iuri Frosio](/person/iuri-frosio), [Chen Tessler](/person/chen-tessler), [Joohwan Kim](/person/joohwan-kim), C. Taylor, G. Bernstein, S. Choudhury, P. Hanrahan,, Kayvon Fatahalian



[The 23rd ACM SIGGRAPH / Eurographics Symposium on Computer Animation (SCA 2024)](https://computeranimation.org/program.html)









[VerilogCoder: Autonomous Verilog Coding Agents with Graph-based Planning and Abstract Syntax Tree (AST)-based Waveform Tracing Tool](/publication/2024-08_verilogcoder-autonomous-verilog-coding-agents-graph-based-planning-and-abstract)

[Chia-Tung (Mark) Ho](/person/chia-tung-mark-ho), Mark Haoxing Ren, [Brucek Khailany](/person/brucek-khailany)



[arXiv](https://arxiv.org/abs/2408.08927)









[Kilometer-Scale Convection Allowing Model Emulation using Generative Diffusion Modeling](/index.php/publication/2024-08_kilometer-scale-convection-allowing-model-emulation-using-generative-diffusion)

[Jaideep Pathak](/index.php/person/jaideep-pathak), Yair Cohen, Piyush Garg, Peter Harrington, [Noah Brenowitz](/index.php/person/noah-brenowitz), [Dale Durran](/index.php/person/dale-durran), [Morteza Mardani](/index.php/person/morteza-mardani), [Arash Vahdat](/index.php/person/arash-vahdat), Shaoming Xu, Karthik Kashinath, [Mike Pritchard](/index.php/person/mike-pritchard)













[GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators](/publication/2024-08_gentranslate-large-language-models-are-generative-multilingual-speech-and)

Yuchen Hu, Chen Chen, [Huck Yang](/person/huck-yang), Ruizhe Li, Zhehuai Chen, Eng Siong Chng



[ACL 2024](https://arxiv.org/pdf/2402.06894)









[TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models](/publication/2024-08_turboedit-text-based-image-editing-using-few-step-diffusion-models)

Gilad Deutch, Rinon Gal, Daniel Garibi, Or Patashnik, Daniel Cohen-Or



[SIGGRAPH Asia 2024](https://arxiv.org/abs/2408.00735)









[Align Your Steps: Optimizing Sampling Schedules in Diffusion Models](/publication/2024-07_align-your-steps-optimizing-sampling-schedules-diffusion-models)

Amirmojtaba Sabour, Sanja Fidler, [Karsten Kreis](/person/karsten-kreis)



[International Conference on Machine Learning (ICML) 2024](https://arxiv.org/abs/2404.14507)









[DoRA: Weight-Decomposed Low-Rank Adaptation](/publication/2024-07_dora-weight-decomposed-low-rank-adaptation)

Shih-Yang Liu, Chien-Yi Wang, [Hongxu Danny Yin](/person/danny-yin), [Pavlo Molchanov](/person/pavlo-molchanov), [Frank Wang](/person/frank-wang), Kwang-Ting Cheng, [Min-Hung Chen](/person/min-hung-chen)



[International Conference on Machine Learning (ICML) 2024](https://icml.cc/Conferences/2024)









[DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents](/publication/2024-07_disco-diff-enhancing-continuous-diffusion-models-discrete-latents)

Yilun Xu, Gabriele Corso, Tommi Jaakkola, [Arash Vahdat](/person/arash-vahdat), [Karsten Kreis](/person/karsten-kreis)



[International Conference on Machine Learning (ICML) 2024](https://arxiv.org/abs/2407.03300)









[Breathing Life Into Sketches Using Text-to-Video Priors](/publication/2024-07_breathing-life-sketches-using-text-video-priors)

Rinon Gal, Yael Vinker, Yuval Alaluf, Amit Bermano, Daniel Cohen-Or, Ariel Shamir, [Gal Chechik](/person/gal-chechik)



[CVPR 2024](https://arxiv.org/abs/2311.13608)









[fVDB: A Deep-Learning Framework for Sparse, Large-Scale, and High-Performance Spatial Intelligence](/publication/2024-07_fvdb-deep-learning-framework-sparse-large-scale-and-high-performance-spatial)

Francis Williams, Jiahui Huang, Jonathan Swartz, Gergely Klar, Vijay Thakkar, Matthew Cong, Xuanchi Ren, Ruilong Li, Clement Fuji-Tsang, Sanja Fidler, Eftychios Sifakis, Ken Museth













[SuperPADL: Scaling Language-Directed Physics-Based Control with Progressive Supervised Distillation](/publication/2024-07_superpadl-scaling-language-directed-physics-based-control-progressive)















[Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling](/publication/2024-06_motion-i2v-consistent-and-controllable-image-video-generation-explicit-motion)

Xiaoyu Shi, Zhaoyang Huang, Fu-Yun Wang, Weikang Bian, Dasong Li, Yi Zhang, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li



[Paper](https://arxiv.org/abs/2401.15977)









[Large Language Model (LLM) for Standard Cell Layout Design Optimization](/publication/2024-06_large-language-model-llm-standard-cell-layout-design-optimization)

[Chia-Tung (Mark) Ho](/person/chia-tung-mark-ho), Mark Haoxing Ren



[The First IEEE International Workshop on LLM-Aided Design (LAD'24)](https://arxiv.org/abs/2406.06549)



Best Paper Award





[Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer](/publication/2024-06_space-time-diffusion-features-zero-shot-text-driven-motion-transfer)

Danah Yatim, Rafail Fridman, Omer Bar-Tal, [Yoni Kasten](/person/yoni-kasten), Tali Dekel



[CVPR 2024](https://openaccess.thecvf.com/content/CVPR2024/html/Yatim_Space-Time_Diffusion_Features_for_Zero-Shot_Text-Driven_Motion_Transfer_CVPR_2024_paper.html)









[What You See is What You GAN: Rendering Every Pixel for High-Fidelity Geometry in 3D GANs](/publication/2024-06_what-you-see-what-you-gan-rendering-every-pixel-high-fidelity-geometry-3d-gans)

[Alexander Trevithick](/person/alexander-trevithick), Matthew Chan, Towaki Takikawa, [Umar Iqbal](/person/umar-iqbal), [Shalini De Mello](/person/shalini-de-mello), Manmohan Chandraker, Ravi Ramamoorthi, [Koki Nagano](/person/koki-nagano)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2024](https://openaccess.thecvf.com/content/CVPR2024/papers/Trevithick_What_You_See_is_What_You_GAN_Rendering_Every_Pixel_CVPR_2024_paper.pdf)









[Outdoor Scene Extrapolation with Hierarchical Generative Cellular Automata](/publication/2024-06_outdoor-scene-extrapolation-hierarchical-generative-cellular-automata)

Dongsu Zhang, Francis Williams, Zan Gojcic, [Karsten Kreis](/person/karsten-kreis), Sanja Fidler, Young Min Kim, Amlan Kar



[CVPR 2024 (Highlight)](https://arxiv.org/abs/2406.08292)









[GAvatar: Animatable 3D Gaussian Avatars with Implicit Mesh Learning](/publication/2024-06_gavatar-animatable-3d-gaussian-avatars-implicit-mesh-learning)

[Ye Yuan](/person/ye-yuan), [Xueting Li](/person/xueting-li), Yangyi Huang, [Shalini De Mello](/person/shalini-de-mello), [Koki Nagano](/person/koki-nagano), [Jan Kautz](/person/jan-kautz), [Umar Iqbal](/person/umar-iqbal)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2024](https://openaccess.thecvf.com/content/CVPR2024/papers/Yuan_GAvatar_Animatable_3D_Gaussian_Avatars_with_Implicit_Mesh_Learning_CVPR_2024_paper.pdf)



Highlight





[Dream-in-4D: A Unified Approach for Text- and Image-guided 4D Scene Generation](/publication/2024-06_dream-4d-unified-approach-text-and-image-guided-4d-scene-generation)

Yufeng Zheng, [Xueting Li](/person/xueting-li), [Koki Nagano](/person/koki-nagano), [Sifei Liu](/person/sifei-liu), Otmar Hilliges, [Shalini De Mello](/person/shalini-de-mello)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2024](https://openaccess.thecvf.com/content/CVPR2024/papers/Zheng_A_Unified_Approach_for_Text-_and_Image-guided_4D_Scene_Generation_CVPR_2024_paper.pdf)









[Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models](/publication/2024-06_align-your-gaussians-text-4d-dynamic-3d-gaussians-and-composed-diffusion-models)

Huan Ling, Seung Wook Kim, Antonio Torralba, Sanja Fidler, [Karsten Kreis](/person/karsten-kreis)



[CVPR 2024 (Highlight)](https://arxiv.org/abs/2312.13763)









[RegionGPT: Towards Region Understanding Vision Language Model](/publication/2024-06_regiongpt-towards-region-understanding-vision-language-model)

Qiushan Guo, [Shalini De Mello](/person/shalini-de-mello), [Hongxu Danny Yin](/person/danny-yin), [Wonmin Byeon](/person/wonmin-byeon), Ka Chun Cheung, Yizhou Yu, Ping Luo, [Sifei Liu](/person/sifei-liu)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2024](https://openaccess.thecvf.com/content/CVPR2024/papers/Guo_RegionGPT_Towards_Region_Understanding_Vision_Language_Model_CVPR_2024_paper.pdf)









[Nemotron-4 340B](/publication/2024-06_nemotron-4-340b)















[Flexible Motion In-betweening with Diffusion Models](/publication/2024-05_flexible-motion-betweening-diffusion-models)

Setareh Cohan, Guy Tevet, Daniele Reda, Xue Bin Peng, Michiel van de Panne



[Paper](https://arxiv.org/abs/2405.11126)









[Large Language Models are Efficient Learners of Noise-Robust Speech Recognition](/publication/2024-05_large-language-models-are-efficient-learners-noise-robust-speech-recognition)

YuChen Hu, Chen Chen, [Huck Yang](/person/huck-yang), Ruizhe Li, Chao Zhang, Pin-Yu Chen, EnSiong Chng



[ICLR 2024](https://iclr.cc/Conferences/2024)









[WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space](/publication/2024-05_wildfusion-learning-3d-aware-latent-diffusion-models-view-space)

Katja Schwarz, Seung Wook Kim, Jun Gao, Sanja Fidler, Andreas Geiger, [Karsten Kreis](/person/karsten-kreis)



[International Conference on Learning Representations (ICLR) 2024](https://arxiv.org/abs/2311.13570)









[It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition](/publication/2024-05_it-s-never-too-late-fusing-acoustic-information-large-language-models-automatic)

Chen Chen, Ruizhe Li, Yuchen Hu, Sabato Marco Siniscalchi, Pin-Yu Chen, Ensiong Chng, [Huck Yang](/person/huck-yang)



[ICLR 2024](https://iclr.cc/Conferences/2024)









[3D Reconstruction with Generalizable Neural Fields using Scene Priors](/publication/2024-05_3d-reconstruction-generalizable-neural-fields-using-scene-priors)

Yang Fu, [Shalini De Mello](/person/shalini-de-mello), [Xueting Li](/person/xueting-li), Amey Kulkarni, [Jan Kautz](/person/jan-kautz), Xiaolong Wang, [Sifei Liu](/person/sifei-liu)



[International Conference on Learning Representations (ICLR) 2024](https://proceedings.iclr.cc/paper_files/paper/2024/hash/0bd32794b26cfc99214b89313764da8e-Abstract-Conference.html)









[LCM-Lookahead for Encoder-based Text-to-Image Personalization](/publication/2024-04_lcm-lookahead-encoder-based-text-image-personalization)

Rinon Gal, Or Lichter, Elad Richardson, Or Patashnik, Amit H Bermano, [Gal Chechik](/person/gal-chechik), Daniel Cohen-Or



[ECCV 2024](https://arxiv.org/abs/2404.03620)









[LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis](/publication/2024-03_latte3d-large-scale-amortized-text-enhanced3d-synthesis)

Kevin Xie, Jonathan Lorraine, Tianshi Cao, Jun Gao, James Lucas, Antonio Torralba, Sanja Fidler, Xiaohui Zeng



[ECCV](https://eccv2024.ecva.net/)









[Consolidating Attention Features for Multi-view Image Editing](/publication/2024-02_consolidating-attention-features-multi-view-image-editing)

Or Patashnik, Rinon Gal, Daniel Cohen-Or, Jun-Yan Zhu, Fernando De la Torre



[SIGGRAPH Asia 2024](https://arxiv.org/abs/2402.14792)









[ConsiStory: Training-Free Consistent Text-to-Image Generation](/publication/2024-02_consistory-training-free-consistent-text-image-generation)

Yoad Tewel, Omri Kaduri, Rinon Gal, [Yoni Kasten](/person/yoni-kasten), Lior Wolf, [Gal Chechik](/person/gal-chechik), [Yuval Atzmon](/person/yuval-atzmon)



[SIGGRAPH](https://arxiv.org/abs/2402.03286)









[Generating images of rare concepts using pre-trained diffusion models](/publication/2024-01_generating-images-rare-concepts-using-pre-trained-diffusion-models)

Dvir Samuel, Rami Ben-Ari, Simon Raviv, Nir Darshan, [Gal Chechik](/person/gal-chechik)



[AAAI 2024](https://arxiv.org/abs/2304.14530)









### 2023 

[Point-Cloud Completion with Pretrained Text-to-image Diffusion Models](/publication/2023-12_point-cloud-completion-pretrained-text-image-diffusion-models)

[Yoni Kasten](/person/yoni-kasten), Ohad Rahamim, [Gal Chechik](/person/gal-chechik)



[NeurIPS 2023](https://arxiv.org/pdf/2306.10533.pdf)









[HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models](/publication/2023-12_hyporadise-open-baseline-generative-speech-recognition-large-language-models)

Chen Chen, YuChen Hu, [Huck Yang](/person/huck-yang), Sabato Marco Siniscalchi, Pin-Yu Chen, Ensiong Chng



[NeurIPS 2023](https://openreview.net/forum?id=cAjZ3tMye6)









[SceneScape: Text-Driven Consistent Scene Generation](/publication/2023-12_scenescape-text-driven-consistent-scene-generation)

Rafail Fridman, Amit Abecasis, [Yoni Kasten](/person/yoni-kasten), Tali Dekel



[NeurIPS 2023](https://arxiv.org/pdf/2302.01133.pdf)









[Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition](/publication/2023-12_whispering-llama-cross-modal-generative-error-correction-framework-speech)

Srijith Radhakrishnan, [Huck Yang](/person/huck-yang), Sumeer Khan, Rohit Kumar, Narsis Kiani, David Gomez-Cabrero, Jesper Tegnér



[EMNLP](https://aclanthology.org/2023.emnlp-main.618/)









[XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies](/publication/2023-12_xcube-large-scale-3d-generative-modeling-using-sparse-voxel-hierarchies)

Xuanchi Ren, Jiahui Huang, Xiaohui Zeng, Ken Museth, Sanja Fidler, Francis Williams



[CVPR](https://arxiv.org/abs/2312.03806)









[Domain-Agnostic Tuning-Encoder for Fast Personalization of Text-To-Image Models](/publication/2023-12_domain-agnostic-tuning-encoder-fast-personalization-text-image-models)

Moab Arar, Rinon Gal, [Yuval Atzmon](/person/yuval-atzmon), [Gal Chechik](/person/gal-chechik), Daniel Cohen-Or, Ariel Shamir, Amit Bermano



[SIGGRAPH Asia 2023](https://arxiv.org/abs/2307.06925)









[ChipNeMo: Domain-Adapted LLMs for Chip Design](/publication/2023-10_chipnemo-domain-adapted-llms-chip-design)

[Mingjie Liu](/person/mingjie-liu), Teo Ene, Robert Kirby, Chris Cheng, [Nathaniel Pinckney](/person/nathaniel-pinckney), [Rongjian Liang](/person/rongjian-liang), Jonah Alben, Himyanshu Anand, Sanmitra Banerjee, Ismet Bayraktaroglu, Bonita Bhaskaran, Bryan Catanzaro, Arjun Chaudhuri, Sharon Clay, Bill Dally, Laura Dang, Parikshit Deshpande, Siddhanth Dhodhi, Sameer Halepete, Eric Hill, Jiashang Hu, Sumit Jain, [Brucek Khailany](/person/brucek-khailany), George Kokai, Kishor Kunal, Xiaowei Li, Charley Lind, Hao Liu, Stuart Oberman, Sujeet Omar, Sreedhar Pratty, Jonathan Raman, Ambar Sarkar, Zhengjiang Shao, Hanfei Sun, Pratik P Suthar, Varun Tej, [Walker Turner](/person/walker-turner), Kaizhe Xu, Mark Haoxing Ren













[TexFusion: Synthesizing 3D Textures with Text-Guided Image Diffusion Models](/publication/2023-10_texfusion-synthesizing-3d-textures-text-guided-image-diffusion-models)

Tianshi Cao, [Karsten Kreis](/person/karsten-kreis), Sanja Fidler, Nicholas Sharp, Kangxue Yin



[IEEE/CVF International Conference on Computer Vision (ICCV) 2023 (Oral)](https://arxiv.org/abs/2310.13772)









[Generative Novel View Synthesis with 3D-Aware Diffusion Models](/publication/2023-10_generative-novel-view-synthesis-3d-aware-diffusion-models)

Eric R. Chan, [Koki Nagano](/person/koki-nagano), Matthew Chan, Alexander W. Bergman, Jeong Joon Park, Axel Levy, [Miika Aittala](/person/miika-aittala), [Shalini De Mello](/person/shalini-de-mello), [Tero Karras](/person/tero-karras), Gordon Wetzstein



[International Conference on Computer Vision (ICCV) 2023](https://iccv2023.thecvf.com/)



Oral





[DreamTeacher: Pretraining Image Backbones with Deep Generative Models](/publication/2023-10_dreamteacher-pretraining-image-backbones-deep-generative-models)

Daiqing Li, Huan Ling, Amlan Kar, David Acuna, Seung Wook Kim, [Karsten Kreis](/person/karsten-kreis), Antonio Torralba, Sanja Fidler



[IEEE/CVF International Conference on Computer Vision (ICCV) 2023](https://arxiv.org/abs/2307.07487)









[ATT3D: Amortized Text-To-3D Object Synthesis](/publication/2023-10_att3d-amortized-text-3d-object-synthesis)

Jonathan Lorraine, Kevin Xie, Xiaohui Zeng, [Chen-Hsuan Lin](/person/chen-hsuan-lin), Towaki Takikawa, Nicholas Sharp, [Tsung-Yi Lin](/person/tsung-yi-lin), [Ming-Yu Liu](/person/ming-yu-liu), Sanja Fidler, James Lucas



[ICCV](https://openaccess.thecvf.com/content/ICCV2023/papers/Lorraine_ATT3D_Amortized_Text-to-3D_Object_Synthesis_ICCV_2023_paper.pdf)









[Syntactic Binding in Diffusion Models: Enhancing Attribute Correspondence through Attention Map Alignment](/publication/2023-10_syntactic-binding-diffusion-models-enhancing-attribute-correspondence-through)

Royi Rassin, Eran Hirsch, Daniel Glickman, Shauli Ravfogel, Yoav Goldberg, [Gal Chechik](/person/gal-chechik)



[NeurIPS 2023](https://nips.cc/virtual/2023/oral/73870)



Oral presentation





[Norm-guided latent space exploration for text-to-image generation](/publication/2023-10_norm-guided-latent-space-exploration-text-image-generation)

Dvir Samuel, Rami Ben-Ari, Nir Darshan, [Haggai Maron](/person/haggai-maron), [Gal Chechik](/person/gal-chechik)



[NeurIPS 2023](https://nips.cc/virtual/2023/poster/70922)









[VerilogEval: Evaluating Large Language Models for Verilog Code Generation](/publication/2023-09_verilogeval-evaluating-large-language-models-verilog-code-generation)

[Mingjie Liu](/person/mingjie-liu), [Nathaniel Pinckney](/person/nathaniel-pinckney), [Brucek Khailany](/person/brucek-khailany), Mark Haoxing Ren



[2023 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)](https://arxiv.org/abs/2309.07544)









[Differentially Private Diffusion Models](/publication/2023-08_differentially-private-diffusion-models)

Tim Dockhorn, Tianshi Cao, [Arash Vahdat](/person/arash-vahdat), [Karsten Kreis](/person/karsten-kreis)



[Transactions on Machine Learning Research (TMLR) 2023](https://arxiv.org/abs/2210.09929)









[Flexible Isosurface Extraction for Gradient-Based Mesh Optimization](/publication/2023-08_flexible-isosurface-extraction-gradient-based-mesh-optimization)

Tianchang Shen, [Jacob Munkberg](/person/jacob-munkberg), [Jon Hasselgren](/person/jon-hasselgren), Kangxue Yin, Zian Wang, Wenzheng Chen, Zan Gojcic, Sanja Fidler, Nicholas Sharp, Jun Gao



[ACM Transactions On Graphics (SIGGRAPH 2023)](https://dl.acm.org/doi/abs/10.1145/3592430)









[AI-Mediated 3D Video Conferencing](/publication/2023-08_ai-mediated-3d-video-conferencing)

[Michael Stengel](/person/michael-stengel), [Koki Nagano](/person/koki-nagano), [Chao Liu](/person/chao-liu), Matthew Chan, Alex Trevithick, [Shalini De Mello](/person/shalini-de-mello), [Jonghyun Kim](/person/jonghyun-kim), [David Luebke](/person/david-luebke), [Amrita Mazumdar](/person/amrita-mazumdar), Shengze Wang, Mayoore Jaiswal



[ACM SIGGRAPH Emerging Technologies 2023](https://dl.acm.org/doi/abs/10.1145/3588037.3595385)









[A Hybrid Generator Architecture for Controllable Face Synthesis](/publication/2023-08_hybrid-generator-architecture-controllable-face-synthesis)

Dann Mensah, Nam Hee Kim, [Miika Aittala](/person/miika-aittala), [Samuli Laine](/person/samuli-laine), [Jaakko Lehtinen](/person/jaakko-lehtinen)



[SIGGRAPH 2023](https://s2023.siggraph.org/)









[IGB: Addressing The Gaps In Labeling, Features, Heterogeneity, and Size of Public Graph Datasets for Deep Learning Research](/publication/2023-08_igb-addressing-gaps-labeling-features-heterogeneity-and-size-public-graph)

Arpandeep Khatua, [Vikram Sharma Mailthody](/person/vikram-sharma-mailthody-0), Bhagyashree Taleka, Tengfei Ma, Xiang Song, [Wen-mei Hwu](/person/wen-mei-hwu)



[KDD'23](https://kdd.org/kdd2023)









[Live 3D Portrait: Real-Time Radiance Fields for Single-Image Portrait View Synthesis](/publication/2023-08_live-3d-portrait-real-time-radiance-fields-single-image-portrait-view-synthesis)

Alexander Trevithick, Matthew Chan, [Michael Stengel](/person/michael-stengel), Eric R. Chan, [Chao Liu](/person/chao-liu), [Zhiding Yu](/person/zhiding-yu), Sameh Khamis, Manmohan Chandraker, Ravi Ramamoorthi, [Koki Nagano](/person/koki-nagano)



[ACM Transactions On Graphics (SIGGRAPH 2023)](https://s2023.siggraph.org/)









[Encoder-based Domain Tuning for Fast Personalization of Text-to-Image Models](/publication/2023-08_encoder-based-domain-tuning-fast-personalization-text-image-models-0)

Rinon Gal, Moab Arar, [Yuval Atzmon](/person/yuval-atzmon), Amit Bermano, [Gal Chechik](/person/gal-chechik), Daniel Cohen-Or



[SIGGRAPH 2023](https://arxiv.org/abs/2302.12228)









[Efficient Transformer Inference with Statically Structured Sparse Attention](/publication/2023-07_efficient-transformer-inference-statically-structured-sparse-attention)

[Steve Dai](/person/steve-dai), Hasan Genc, [Rangharajan Venkatesan](/person/rangharajan-venkatesan), [Brucek Khailany](/person/brucek-khailany)



[2023 60th ACM/IEEE Design Automation Conference (DAC)](https://ieeexplore.ieee.org/xpl/conhome/10247654/proceeding)









[Physics-Informed Optical Kernel Regression Using Complex-valued Neural Fields](/publication/2023-07_physics-informed-optical-kernel-regression-using-complex-valued-neural-fields)

Guojin Chen, Zehua Pei, [Haoyu Yang](/person/haoyu-yang), Yuzhe Ma, Bei Yu, Martin Wong



[60th ACM/IEEE Design Automation Conference](https://arxiv.org/abs/2303.08435)









[StructDiffusion: Language-Guided Creation of Physically-Valid Structures using Unseen Objects](/publication/2023-07_structdiffusion-language-guided-creation-physically-valid-structures-using)

 Weiyu Liu, Yilun Du, [Tucker Hermans](/person/tucker-hermans), Sonia Chernova, Chris Paxton



[Robotics: Science and Systems (RSS) 2023](https://roboticsconference.org/)









[Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models](/publication/2023-06_align-your-latents-high-resolution-video-synthesis-latent-diffusion-models)

Andreas Blattmann, Robin Rombach, Huan Ling, Tim Dockhorn, Seung Wook Kim, Sanja Fidler, [Karsten Kreis](/person/karsten-kreis)



[IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023](https://arxiv.org/abs/2304.08818)









[NeuralField-LDM: Scene Generation with Hierarchical Latent Diffusion Models](/publication/2023-06_neuralfield-ldm-scene-generation-hierarchical-latent-diffusion-models)

Seung Wook Kim, Bradley Brown, Kangxue Yin, [Karsten Kreis](/person/karsten-kreis), Katja Schwarz, Daiqing Li, Robin Rombach, Antonio Torralba, Sanja Fidler



[IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023](https://arxiv.org/abs/2304.09787)









[FreeNeRF: Improving Few-shot Neural Rendering with Free Frequency Regularization](/publication/2023-06_freenerf-improving-few-shot-neural-rendering-free-frequency-regularization)

Jiawei Yang, [Marco Pavone](/person/marco-pavone), [Yue Wang](/person/yue-wang)



[IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023](https://cvpr2023.thecvf.com/)









[Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models](/publication/2023-06_open-vocabulary-panoptic-segmentation-text-image-diffusion-models)

Jiarui Xu, [Sifei Liu](/person/sifei-liu), [Arash Vahdat](/person/arash-vahdat), [Wonmin Byeon](/person/wonmin-byeon), Xiaolong Wang, [Shalini De Mello](/person/shalini-de-mello)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023](https://cvpr2023.thecvf.com/)



Hightlight top 10%





[Planning for Multi-Object Manipulation with Graph Neural Network Relational Classifiers](/publication/2023-06_planning-multi-object-manipulation-graph-neural-network-relational-classifiers)

Yixuan Huang, Adam Conkey, [Tucker Hermans](/person/tucker-hermans)



[IEEE International Conference on Robotics and Automation (ICRA)](https://www.icra2023.org/)









[Magic3D: High-Resolution Text-to-3D Content Creation](/publication/2023-06_magic3d-high-resolution-text-3d-content-creation)

[Chen-Hsuan Lin](/person/chen-hsuan-lin), Jun Gao, Luming Tang, Towaki Takikawa, Xiaohui Zeng, Xun Huang, [Karsten Kreis](/person/karsten-kreis), Sanja Fidler, [Ming-Yu Liu](/person/ming-yu-liu), [Tsung-Yi Lin](/person/tsung-yi-lin)



[CVPR 2023 (Highlight)](https://cvpr2023.thecvf.com/)









[BITS: Bi-level Imitation for Traffic Simulation](/publication/2023-05_bits-bi-level-imitation-traffic-simulation)

[Danfei Xu](/person/danfei-xu), [Yuxiao Chen](/person/yuxiao-chen), [Boris Ivanovic](/person/boris-ivanovic), [Marco Pavone](/person/marco-pavone)



[IEEE International Conference on Robotics and Automation (ICRA) 2023](https://www.icra2023.org/)









[ProgPrompt: Generating Situated Robot Task Plans Using Large Language Models](/publication/2023-05_progprompt-generating-situated-robot-task-plans-using-large-language-models)

Ishika Singh, [Valts Blukis](/person/valts-blukis), Arsalan Mousavian, [Ankit Goyal](/person/ankit-goyal), [Danfei Xu](/person/danfei-xu), [Jonathan Tremblay](/person/jonathan-tremblay), Dieter Fox, Jesse Thomason, Animesh Garg



[The International Conference on Robotics and Automation (ICRA)](https://www.icra2023.org/welcome)









[Guided Conditional Diffusion for Controllable Traffic Simulation](/publication/2023-05_guided-conditional-diffusion-controllable-traffic-simulation)

Ziyuan Zhong, Davis Rempe, [Danfei Xu](/person/danfei-xu), [Yuxiao Chen](/person/yuxiao-chen), [Sushant Veer](/person/sushant-veer), [Gerry Che](/person/gerry-che), Baishakhi Ray, [Marco Pavone](/person/marco-pavone)



[IEEE International Conference on Robotics and Automation (ICRA) 2023](https://www.icra2023.org/)









[Expanding the Deployment Envelope of Behavior Prediction via Adaptive Meta-Learning](/publication/2023-05_expanding-deployment-envelope-behavior-prediction-adaptive-meta-learning)

[Boris Ivanovic](/person/boris-ivanovic), James Harrison, [Marco Pavone](/person/marco-pavone)



[IEEE International Conference on Robotics and Automation (ICRA) 2023](https://www.icra2023.org/)









[Subpixel Deblurring of Anti-Aliased Raster Clip Art](/publication/2023-05_subpixel-deblurring-anti-aliased-raster-clip-art)

Jinfan Yang, [Nicholas Vining](/person/nicholas-vining), Shakiba Kheradmand, Nathan Carr, Leonid Sigal, Alla Sheffer



[Computer Graphics Forum (Proc. Eurographics 2023)](https://diglib.eg.org/handle/10.1111/cgf14744)









[CALM: Conditional Adversarial Latent Models for Directable Virtual Characters](/publication/2023-05_calm-conditional-adversarial-latent-models-directable-virtual-characters)

[Chen Tessler](/person/chen-tessler), [Yoni Kasten](/person/yoni-kasten), Yunrong Guo, [Shie Mannor](/person/shie-mannor), [Gal Chechik](/person/gal-chechik), Xue Bin Peng



[SIGGRAPH 2023](https://s2023.siggraph.org/)









[Robust and Controllable Object-Centric Learning through Energy-based Models](/publication/2023-05_robust-and-controllable-object-centric-learning-through-energy-based-models)

Ruixiang Zhang, [Gerry Che](/person/gerry-che), [Boris Ivanovic](/person/boris-ivanovic), Renhao Wang, [Marco Pavone](/person/marco-pavone), Yoshua Bengio, Liam Paull



[International Conference on Learning Representations (ICLR) 2023](https://iclr.cc/Conferences/2023)









[Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis](/publication/2023-02_frido-feature-pyramid-diffusion-complex-scene-image-synthesis)

Wan-Cyuan Fan, Yen-Chun Chen, Dongdong Chen, Yu Cheng, Lu Yuan, [Frank Wang](/person/frank-wang)



[AAAI 2023](https://aaai.org/Conferences/AAAI-23/)









[BufFormer: A Generative ML Framework for Scalable Buffering](/publication/2023-01_bufformer-generative-ml-framework-scalable-buffering)

[Rongjian Liang](/person/rongjian-liang), Siddhartha Nath, Anand Rajaram, Jiang Hu, Mark Haoxing Ren



[28th Asia and South Pacific Design Automation Conference](https://www.aspdac.com/aspdac2023/cfp/#:~:text=ASP%2DDAC%202023%20is%20the,silicon%20chips%20in%20the%20world.)









### 2022 

["This is my unicorn, Fluffy": Personalizing frozen vision-language representations](/publication/2022-11_my-unicorn-fluffy-personalizing-frozen-vision-language-representations)

Niv Cohen, Rinon Gal, [Eli Meirom](/person/eli-meirom), [Gal Chechik](/person/gal-chechik), [Yuval Atzmon](/person/yuval-atzmon)



[ECCV 2022](https://www.ecva.net/papers/eccv_2022/papers_ECCV/papers/136800544.pdf)









[Elucidating the Design Space of Diffusion-Based Generative Models](/publication/2022-11_elucidating-design-space-diffusion-based-generative-models)

[Tero Karras](/person/tero-karras), [Miika Aittala](/person/miika-aittala), [Timo Aila](/person/timo-aila), [Samuli Laine](/person/samuli-laine)



[NeurIPS 2022 (oral)](https://nips.cc/Conferences/2022)



NeurIPS Outstanding Paper





[LION: Latent Point Diffusion Models for 3D Shape Generation](/publication/2022-11_lion-latent-point-diffusion-models-3d-shape-generation)

Xiaohui Zeng, [Arash Vahdat](/person/arash-vahdat), Francis Williams, Zan Gojcic, Or Litany, Sanja Fidler, [Karsten Kreis](/person/karsten-kreis)



[Neural Information Processing Systems (NeurIPS) 2022](https://arxiv.org/abs/2210.06978)









[GENIE: Higher-Order Denoising Diffusion Solvers](/publication/2022-11_genie-higher-order-denoising-diffusion-solvers)

Tim Dockhorn, [Arash Vahdat](/person/arash-vahdat), [Karsten Kreis](/person/karsten-kreis)



[Neural Information Processing Systems (NeurIPS) 2022](https://arxiv.org/abs/2210.05475)









[Diffusion Models for Adversarial Purification](/publication/2022-07_diffusion-models-adversarial-purification)

Weili Nie, Brandon Guo, Yujia Huang, [Chaowei Xiao](/person/chaowei-xiao), [Arash Vahdat](/person/arash-vahdat), Anima Anandkumar



[International Conference on Machine Learning (ICML), 2022](https://arxiv.org/abs/2205.07460)









[CoordGAN: Self-Supervised Dense Correspondences Emerge from GANs](/publication/2022-07_coordgan-self-supervised-dense-correspondences-emerge-gans)

Jiteng Mu, [Shalini De Mello](/person/shalini-de-mello), [Zhiding Yu](/person/zhiding-yu), Nuno Vasconcelos, Xiaolong Wang, [Sifei Liu](/person/sifei-liu)



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022](https://cvpr2022.thecvf.com/)









[Polymorphic-GAN: Generating Aligned Samples across Multiple Domains with Learned Morph Maps](/publication/2022-06_polymorphic-gan-generating-aligned-samples-across-multiple-domains-learned)

Seung Wook Kim, [Karsten Kreis](/person/karsten-kreis), Daiqing Li, Antonio Torralba, Sanja Fidler



[Conference on Computer Vision and Pattern Recognition (CVPR) 2022 (Oral)](https://arxiv.org/abs/2206.02903)









[Efficient Geometry-aware 3D Generative Adversarial Networks](/publication/2022-06_efficient-geometry-aware-3d-generative-adversarial-networks)

Eric R. Chan, Connor Z. Lin, Matthew A. Chan, [Koki Nagano](/person/koki-nagano), Boxiao Pan, [Shalini De Mello](/person/shalini-de-mello), Orazio Gallo, Leonidas Guibas, [Jonathan Tremblay](/person/jonathan-tremblay), Sameh Khamis, [Tero Karras](/person/tero-karras), Gordon Wetzstein



[IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022](https://cvpr2022.thecvf.com/)



Oral





[BigDatasetGAN: Synthesizing ImageNet with Pixel-wise Annotations](/publication/2022-06_bigdatasetgan-synthesizing-imagenet-pixel-wise-annotations)

Daiqing Li, Huan Ling, Seung Wook Kim, [Karsten Kreis](/person/karsten-kreis), Adela Barriuso, Sanja Fidler, Antonio Torralba



[IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022](https://arxiv.org/abs/2201.04684)









[StyleGAN-NADA: CLIP-Guided Domain Adaptation of Image Generators](/publication/2022-05_stylegan-nada-clip-guided-domain-adaptation-image-generators)

Rinon Gal, Or Patashnik, [Haggai Maron](/person/haggai-maron), Amir Bermano, [Gal Chechik](/person/gal-chechik), Daniel Cohen-Or



[SIGGRAPH 2022](https://s2022.siggraph.org/)









[Score-Based Generative Modeling with Critically-Damped Langevin Diffusion](/publication/2022-03_score-based-generative-modeling-critically-damped-langevin-diffusion)

Tim Dockhorn, [Arash Vahdat](/person/arash-vahdat), [Karsten Kreis](/person/karsten-kreis)



[International Conference on Learning Representations (ICLR) 2022 (Spotlight)](https://arxiv.org/abs/2112.07068)









[ Tackling the Generative Learning Trilemma with Denoising Diffusion GANs](/publication/2022-03_tackling-generative-learning-trilemma-denoising-diffusion-gans-0)

Zhisheng Xiao, [Karsten Kreis](/person/karsten-kreis), [Arash Vahdat](/person/arash-vahdat)



[International Conference on Learning Representations (ICLR) 2022 (Spotlight)](https://arxiv.org/abs/2112.07804)









[DeePattern: Layout Pattern Generation with Transforming Convolutional Auto-Encoder](/publication/2022-02_deepattern-layout-pattern-generation-transforming-convolutional-auto-encoder)

[Haoyu Yang](/person/haoyu-yang), Shuhe Li, Wen Chen, Piyush Pathak, Frank Gennari, Ya-Chieh Lai, Bei Yu



[IEEE Transactions on Semiconductor Manufacturing](https://ieeexplore.ieee.org/document/9665719)



Best Paper Award





### 2021 

[ATISS: Autoregressive Transformers for Indoor Scene Synthesis](/publication/2021-12_atiss-autoregressive-transformers-indoor-scene-synthesis)

Despoina Paschalidou, Amlan Kar, Maria Shugrina, [Karsten Kreis](/person/karsten-kreis), Andreas Geiger, Sanja Fidler



[Neural Information Processing Systems (NeurIPS) 2021](https://arxiv.org/abs/2110.03675)









[EditGAN: High-Precision Semantic Image Editing](/publication/2021-12_editgan-high-precision-semantic-image-editing)

Huan Ling, [Karsten Kreis](/person/karsten-kreis), Daiqing Li, Seung Wook Kim, Antonio Torralba, Sanja Fidler



[Neural Information Processing Systems (NeurIPS) 2021](https://arxiv.org/abs/2111.03186)









[Alias-Free Generative Adversarial Networks](/publication/2021-12_alias-free-generative-adversarial-networks)

[Tero Karras](/person/tero-karras), [Miika Aittala](/person/miika-aittala), [Samuli Laine](/person/samuli-laine), Erik Härkönen, [Janne Hellsten](/person/janne-hellsten), Jaakko Lehtinen, [Timo Aila](/person/timo-aila)



[NeurIPS 2021](https://nips.cc/Conferences/2021)









[Don’t Generate Me: Training Differentially Private Generative Models with Sinkhorn Divergence](/publication/2021-11_don-t-generate-me-training-differentially-private-generative-models-sinkhorn)

Tianshi Cao, Alex Bie, [Arash Vahdat](/person/arash-vahdat), Sanja Fidler, [Karsten Kreis](/person/karsten-kreis)



[Neural Information Processing Systems (NeurIPS) 2021](https://arxiv.org/abs/2111.01177)









[Score-based Generative Modeling in Latent Space](/publication/2021-11_score-based-generative-modeling-latent-space)

[Arash Vahdat](/person/arash-vahdat), [Karsten Kreis](/person/karsten-kreis), [Jan Kautz](/person/jan-kautz)



[Neural Information Processing Systems (NeurIPS) 2021](https://arxiv.org/abs/2106.05931)









[Semantic Segmentation with Generative Models: Semi-Supervised Learning and Strong Out-of-Domain Generalization](/publication/2021-06_semantic-segmentation-generative-models-semi-supervised-learning-and-strong-out)

Daiqing Li, Junlin Yang, [Karsten Kreis](/person/karsten-kreis), Antonio Torralba, Sanja Fidler



[IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021](https://arxiv.org/abs/2104.05833)









[VAEBM: A Symbiosis between Variational Autoencoders and Energy-based Models](/publication/2021-06_vaebm-symbiosis-between-variational-autoencoders-and-energy-based-models)

Zhisheng Xiao, [Karsten Kreis](/person/karsten-kreis), [Jan Kautz](/person/jan-kautz), [Arash Vahdat](/person/arash-vahdat)



[International Conference on Learning Representations (ICLR) 2021 (Spotlight)](https://arxiv.org/abs/2010.00654)









### 2020 

[Training Generative Adversarial Networks with Limited Data](/publication/2020-12_training-generative-adversarial-networks-limited-data)

[Tero Karras](/person/tero-karras), [Miika Aittala](/person/miika-aittala), [Janne Hellsten](/person/janne-hellsten), [Samuli Laine](/person/samuli-laine), [Jaakko Lehtinen](/person/jaakko-lehtinen), [Timo Aila](/person/timo-aila)



[NeurIPS 2020](https://nips.cc/Conferences/2020)









[Variational Amodal Object Completion](/publication/2020-12_variational-amodal-object-completion)

Huan Ling, David Acuna, [Karsten Kreis](/person/karsten-kreis), Seung Wook Kim, Sanja Fidler



[Neural Information Processing Systems (NeurIPS) 2020](https://papers.nips.cc/paper/2020/hash/bacadc62d6e67d7897cef027fa2d416c-Abstract.html)









[Neural FFTs for Universal Texture Image Synthesis](/index.php/publication/2020-12_neural-ffts-universal-texture-image-synthesis)

[Morteza Mardani](/index.php/person/morteza-mardani), Guilin Liu, Aysegul Dundar, Shiqiu Liu, Andrew Tao, Bryan Catanzaro



[NeurIPS 2020](https://proceedings.neurips.cc/paper/2020/hash/a23156abfd4a114c35b930b836064e8b-Abstract.html)









[Semi-Supervised StyleGAN for Disentanglement Learning](/publication/2020-07_semi-supervised-stylegan-disentanglement-learning)

Weili Nie, [Tero Karras](/person/tero-karras), Animesh Garg, Shoubhik Debnath, Anjul Patney, Ankit B. Patel, Anima Anandkumar



[International Conference on Machine Learning (ICML) 2020](https://icml.cc/virtual/2020)









[Analyzing and Improving the Image Quality of StyleGAN](/publication/2020-06_analyzing-and-improving-image-quality-stylegan)

[Tero Karras](/person/tero-karras), [Samuli Laine](/person/samuli-laine), [Miika Aittala](/person/miika-aittala), [Janne Hellsten](/person/janne-hellsten), Jaakko Lehtinen, [Timo Aila](/person/timo-aila)



[CVPR 2020](http://cvpr2020.thecvf.com/)









[SymGAN: Orientation Estimation without Annotation for Symmetric Objects](/publication/2020-03_symgan-orientation-estimation-without-annotation-symmetric-objects)

Phil Ammirato, [Jonathan Tremblay](/person/jonathan-tremblay), [Ming-Yu Liu](/person/ming-yu-liu), Alexander Berg, Dieter Fox



[WACV](https://wacv20.wacv.net/)









### 2019 

[A Style-Based Generator Architecture for Generative Adversarial Networks](/publication/2019-06_style-based-generator-architecture-generative-adversarial-networks)

[Tero Karras](/person/tero-karras), [Samuli Laine](/person/samuli-laine), [Timo Aila](/person/timo-aila)



[IEEE CVPR 2019](http://cvpr2019.thecvf.com/)









### 2018 

[MoCoGAN: Decomposing Motion and Content for Video Generation](/publication/2018-06_mocogan-decomposing-motion-and-content-video-generation)

Sergey Tulyakov, [Ming-Yu Liu](/person/ming-yu-liu), [Xiaodong Yang](/person/xiaodong-yang), [Jan Kautz](/person/jan-kautz)



[CVPR](https://arxiv.org/abs/1707.04993)









[High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs](/publication/2018-06_high-resolution-image-synthesis-and-semantic-manipulation-conditional-gans)

[Ting-Chun Wang](/person/ting-chun-wang), [Ming-Yu Liu](/person/ming-yu-liu), Jun-Yan Zhu, Andrew Tao, [Jan Kautz](/person/jan-kautz), Bryan Catanzaro



[CVPR](https://arxiv.org/abs/1711.11585)









[Progressive Growing of GANs for Improved Quality, Stability, and Variation](/publication/2018-04_progressive-growing-gans-improved-quality-stability-and-variation)

[Tero Karras](/person/tero-karras), [Timo Aila](/person/timo-aila), [Samuli Laine](/person/samuli-laine), Jaakko Lehtinen



[ICLR 2018](https://iclr.cc/Conferences/2018)









[Diffusion Texture Painting](/publication/_diffusion-texture-painting)

Anita Hu, Nishkrit Desai, Hassan Abu Alhaija, Seung Wook Kim, Masha Shugrina













[VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation](/publication/_vila-u-unified-foundation-model-integrating-visual-understanding-and-generation)















 

 



 ### Researchers

 

[Alán Aspuru-Guzik](/person/alan-aspuru-guzik)



[Alperen Degirmenci](/person/alperen-degirmenci)



[Arash Vahdat](/person/arash-vahdat)



[Boris Ivanovic](/person/boris-ivanovic)



[Can Zhao](/person/can-zhao)



[Chaowei Xiao](/person/chaowei-xiao)



[Chen-Hsuan Lin](/person/chen-hsuan-lin)



[Chia-Tung (Mark) Ho](/person/chia-tung-mark-ho)



[Chia-Wen Kuo](/person/chia-wen-kuo)



[Dvir Samuel](/person/dvir-samuel)



[Ekta Prashnani](/person/ekta-prashnani)



[Elie Aljalbout](/person/elie-aljalbout)



[Enze Xie](/person/enze-xie)



[Fangyin Wei](/person/fangyin-wei)



[Frank Wang](/person/frank-wang)



[Gal Chechik](/person/gal-chechik)



[Guan-Ting (Danny) Liu](/person/guan-ting-danny-liu)



[Guanzhi Wang](/person/guanzhi-wang)



[Haggai Maron](/index.php/person/haggai-maron)



[Hanrong Ye](/person/hanrong-ye)



[Huck Yang](/person/huck-yang)



[Hugo Hadfield](/index.php/person/hugo-hadfield)



[Jason Stock](/person/jason-stock)



[Jiaojiao Fan](/person/jiaojiao-fan)



[Jiaxiang Tang](/person/jiaxiang-tang)



[Jincheng Yu](/person/jincheng-yu)



[Jinwei Gu](/person/jinwei-gu)



[Josef Spjut](/person/josef-spjut)



[Julius Berner](/person/julius-berner)



[Karsten Kreis](/person/karsten-kreis)



[Koki Nagano](/person/koki-nagano)



[Ligeng Zhu](/person/ligeng-zhu)



[Max Zhaoshuo Li](/person/max-zhaoshuo-li)



[Miika Aittala](/person/miika-aittala)



[Peter Kocsis](/person/peter-kocsis)



[Peter Xenopoulos](/person/peter-xenopoulos)



[Qianli Ma](/person/qianli-ma)



[Sameer Dharur](/person/sameer-dharur)



[Samuli Laine](/person/samuli-laine)



[Saurav Muralidharan](/person/saurav-muralidharan)



[Scott Reed](/person/scott-reed)



[Shalini De Mello](/person/shalini-de-mello)



[Shengze Wang](/person/shengze-wang)



[Shuran Song](/person/shuran-song)



[Sifei Liu](/person/sifei-liu)



[Song Bian](/person/song-bian)



[Sung-Feng Huang](/person/sung-feng-huang)



[Tero Karras](/person/tero-karras)



[Tianyi Xie](/person/tianyi-xie)



[Timo Aila](/person/timo-aila)



[Tomas Geffner](/person/tomas-geffner)



[Wei-Cheng Tseng](/person/wei-cheng-tseng)



[Wenhao Ding](/person/wenhao-ding)



[Wenjie Luo](/person/wenjie-luo)



[Xiaodong Yang](/person/xiaodong-yang)



[Ximing Lu](/person/ximing-lu)



[Xin Kong](/person/xin-kong)



[Xuan Li](/person/xuan-li)



[Yashraj Narang](/person/yashraj-narang)



[Yatian Pang](/person/yatian-pang)



[Yin Cui](/person/yin-cui)



[Yonggan Fu](/person/yonggan-fu)



[Yongxin Chen](/person/yongxin-chen)



[Yu Zeng](/person/yu-zeng)



[Yuchao Gu](/person/yuchao-gu)



[Yukang Chen](/person/yukang-chen)



[Yuqi Xie](/person/yuqi-xie)



[Yuval Atzmon](/person/yuval-atzmon)



[Yuyang Zhao](/person/yuyang-zhao)



[Zekun Hao](/person/zekun-hao)



[Zhiding Yu](/person/zhiding-yu)



[Zhijian Liu](/person/zhijian-liu)