MedPart: A Multi-Level Evolutionary Differentiable Hypergraph Partitioner

State-of-the-art hypergraph partitioners, such as hMETIS, usually adopt a multi-level paradigm for efficiency and scalability. However, they are prone to getting trapped in local minima due to their reliance on refinement heuristics and overlooking global structural information during coarsening. SpecPart, the most advanced academic hypergraph partitioning refinement method, improves partitioning by leveraging spectral information. Still, its success depends heavily on the quality of initial input solutions.

FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects

We present FoundationPose, a unified foundation model for 6D object pose estimation and tracking, supporting both model-based and model-free setups. Our approach can be instantly applied at test-time to a novel object without fine-tuning, as long as its CAD model is given, or a small number of reference images are captured. We bridge the gap between these two setups with a neural implicit representation that allows for effective novel view synthesis, keeping the downstream pose estimation modules invariant under the same unified framework.

A 0.190-pJ/bit 25.2-Gb/s/wire Inverter-Based AC-Coupled Transceiver for Short-Reach Die-to-Die Interfaces in 5-nm CMOS

This article presents an inverter-based short-reach ac-coupled toggle (ISR-ACT) link targeted for short-reach die-to-die communication over silicon interposer or similar high-density interconnect. The ISR-ACT’s transmitter (TX) sends non-return-to-zero (NRZ) data through a small series capacitor to inject low-swing pulses into the line. These pulses are amplified and latched by a two-stage receiver (RX), where the 1st-stage transimpedance amplifier (TIA) amplifies the pulse data and positive feedback around both stages captures the data and maintains the dc level on the line.

XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies

We present XCube, a novel generative model for high-resolution sparse 3D voxel grids with arbitrary attributes. Our model can generate millions of voxels with a finest effective resolution of up to $1024^3$ in a feed-forward fashion without time-consuming test-time optimization. To achieve this, we employ a hierarchical voxel latent diffusion model which generates progressively higher resolution grids in a coarse-to-fine manner using a custom framework built on the highly efficient VDB data structure.

Yunsheng Bai

Yunsheng Bai earned his PhD in Computer Science from the University of California, Los Angeles (UCLA) in June 2023, under the guidance of Professors Yizhou Sun and Wei Wang, and in collaboration with Professor Jason Cong's team. His research is dedicated to developing innovative approaches for graph-related tasks, with a keen interest in graph neural networks and large language models applied to graph-level tasks, including graph similarity, graph matching, and HLS design modeling.

Scott Reed

I am a principal research scientist at NVResearch on the Generalist Embodied Agent Research (GEAR) team. My research goal is to build generalist agents that can help humans in the real (including physical) world. Previously I worked on control and generative models at Google DeepMind since 2016. I completed my PhD under Professor Honglak Lee at the University of Michigan in 2016. 

Rachel Luo

Rachel Luo is a Research Scientist in the Autonomous Vehicles Group at NVIDIA Research. She works on improving the safety and reliability of autonomous systems. Her research interests lie at the intersection of machine learning, computer vision, and robotics, and include topics in uncertainty quantification, distribution shift, and foundation models. Prior to joining Nvidia, Rachel completed her PhD and MS in Electrical Engineering at Stanford University, and her BS in Electrical Engineering and Computer Science at MIT.

Yongxin Chen

I am a research scientist at Nvidia. I also hold the position of Associate Professor at Georgia Institute of Technology. I obtained my PhD degree from University of Minnesota in 2016. My research interests encompass machine learning, control theory, optimal transport, optimization, Markov Chain Monte Carlo, and robotics.

Graph Metanetworks for Processing Diverse Neural Architectures

Neural networks efficiently encode learned information within their parameters. Consequently, many tasks can be unified by treating neural networks themselves as input data. When doing so, recent studies demonstrated the importance of accounting for the symmetries and geometry of parameter spaces. However, those works developed architectures tailored to specific networks such as MLPs and CNNs without normalization layers, and generalizing such architectures to other types of networks can be challenging.

ATT3D: Amortized Text-To-3D Object Synthesis

Text-to-3D modeling has seen exciting progress by combining generative text-to-image models with image-to-3D methods like Neural Radiance Fields. DreamFusion recently achieved high-quality results but requires a lengthy, per-prompt optimization to create 3D objects. To address this, we amortize optimization over text prompts by training on many prompts simultaneously with a unified model, instead of separately. With this, we share computation across a prompt set, training in less time than per-prompt optimization.