NVIDIA Research
Shalini De Mello
CosAE: Learnable Fourier Series for Image Restoration
Avatar Fingerprinting for Authorized Use of Synthetic Talking-Head Videos
Dream-in-4D: A Unified Approach for Text- and Image-guided 4D Scene Generation
GAvatar: Animatable 3D Gaussian Avatars with Implicit Mesh Learning
RegionGPT: Towards Region Understanding Vision Language Model
Rendering Every Pixel for High-Fidelity Geometry in 3D GANs
3D Reconstruction with Generalizable Neural Fields using Scene Priors
Convolutional State Space Models for Long-Range Spatiotemporal Modeling
Generalizable One-shot Neural Head Avatar
Generative Novel View Synthesis with 3D-Aware Diffusion Models
Affordance Diffusion: Synthesizing Hand-Object Interactions
GazeNeRF: 3D-Aware Gaze Redirection with Neural Radiance Fields
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models
Zero-shot Pose Transfer for Unrigged Stylized 3D Characters
GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation
CoordGAN: Self-Supervised Dense Correspondences Emerge from GANs
Efficient Geometry-aware 3D Generative Adversarial Networks
FreeSOLO: Learning to Segment Objects without Annotations
GroupViT: Semantic Segmentation Emerges From Text Supervision
Learning contrastive representation for semantic correspondence
Self-Supervised Object Detection via Generative Image Synthesis
Learning to Track Instances without Video Annotations
Weakly-Supervised Physically Unconstrained Gaze Estimation
Learning continuous environment fields via implicit functions
Online adaptation for consistent mesh reconstruction in the wild
Self-supervised single-view 3D reconstruction via semantic consistency
Self-Supervised Viewpoint Learning from Image Collections