Transform2Act: Learning a Transform-and-Control Policy for Efficient Agent Design

An agent's functionality is largely determined by its design, i.e., skeletal structure and joint attributes (e.g., length, size, strength). However, finding the optimal agent design for a given function is extremely challenging since the problem is inherently combinatorial and the design space is prohibitively large. Additionally, it can be costly to evaluate each candidate design which requires solving for its optimal controller. To tackle these problems, our key idea is to incorporate the design procedure of an agent into its decision-making process.

GLAMR: Global Occlusion-Aware Human Mesh Recovery with Dynamic Cameras

We present an approach for 3D global human mesh recovery from monocular videos recorded with dynamic cameras. Our approach is robust to severe and long-term occlusions and tracks human bodies even when they go outside the camera's field of view. To achieve this, we first propose a deep generative motion infiller, which autoregressively infills the body motions of occluded humans based on visible motions. Additionally, in contrast to prior work, our approach reconstructs human meshes in consistent global coordinates even with dynamic cameras.

Ye Yuan

Ye Yuan is a research scientist at NVIDIA Research. He received his Ph.D. in Robotics from Carnegie Mellon University (CMU) in 2022, where he worked with Prof. Kris Kitani. His research lies at the intersection of computer vision, robotics, and machine learning. He is particularly interested in simulation, reinforcement learning, 3D computer vision, generative models, embodied agents, and digital humans.

Bart Wronski

Bart has joined NVIDIA in 2022, coming from Google Research, where he spent 5 years working on computational photography, image processing, and machine learning.

Before that, he worked for over 8 years in various roles in video games industry (CD Projekt RED, Ubisoft Montreal, Sony Santa Monica) on graphics, engine and tooling architecture, content pipelines, and technical direction.

His main research interests are combining traditional algorithms with machine learning, and improving / automating content creation pipelines to empower artists and creative users.

Chen Tessler

I am a research scientist at the Tel-Aviv research lab (Israel). I am interested in challenges within the realm of decision making (RL, planning, control) that arise from real world challenges.

I received my B.Sc. and Ph.D. from the Technion Institute of Technology, where my research focused on the intricate connection between theory and practice in reinforcement learning.

HandoverSim: A Simulation Framework and Benchmark for Human-to-Robot Object Handovers

We introduce a new simulation benchmark "HandoverSim" for human-to-robot object handovers. To simulate the giver's motion, we leverage a recent motion capture dataset of hand grasping of objects. We create training and evaluation environments for the receiver with standardized protocols and metrics. We analyze the performance of a set of baselines and show a correlation with a real-world evaluation.