Home
News
Members
Publications
NVIDIA Research
Light
Dark
Automatic
Anima Anandkumar
Latest
FB-BEV: BEV Representation from Forward-Backward View Transformations
Fast Sampling of Diffusion Models via Operator Learning
I^2SB: Image-to-Image Schrödinger Bridge
Vision Transformers Are Good Mask Auto-Labelers
VoxFormer: Sparse voxel transformer for camera-based 3D semantic scene completion
Prismer: A Vision-Language Model with An Ensemble of Experts
MinVIS: A minimal video instance segmentation framework without video-based training
Test-time prompt tuning for zero-shot generalization in vision-language models
Diffusion Models for Adversarial Purification
Panoptic SegFormer: Delving deeper into panoptic segmentation with transformers
M$^2$BEV: Multi-camera joint 3D detection and segmentation with unified birds-eye view representation
Controllable and Compositional Generation with Latent-Space Energy-Based Models
Coupled Segmentation and Edge Learning via Dynamic Graph Propagation
SegFormer: Simple and efficient design for semantic segmentation with transformers
DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision
Generating and Characterizing Scenarios for Safety Testing of Autonomous Vehicles
Angular Visual Hardness
Cite
×