Introduction
The 1st International Workshop on Interactive Physical AI (IPA 2026) at CVPR 2026 will bring together researchers from computer vision, robotics, and multimodal AI, providing the first comprehensive forum to address the full scope of interactive physical AI systems while building upon prior workshops that have explored subsets of this space. The workshop topics include (but are not limited to):
- Human-AI interaction in physical environments
- Embodied conversational AI and multimodal learning
- Full-duplex multimodal conversational models
- Social intelligence and communication for robots and avatars
- Egocentric vision and first-person perception
- Real-time audio-visual processing for interactive systems
- Safe and cooperative human-robot interaction
- Personalization and lifelong learning for physical AI
- Privacy-aware learning in interactive settings
- Physically authentic perception and generation for avatars and agents
We will be hosting invited speakers and will also be accepting the submission of full unpublished papers. These papers will be peer-reviewed via a double-blind process, and will be published in the official CVPR 2026 workshop proceedings and be presented at the workshop itself.
Advances in multimodal learning, embodied intelligence, and conversational AI are transforming how humans interact with intelligent AI systems situated alongside us in our physical world. We define such systems as Interactive Physical AI (IPA). IPA systems simultaneously
- Perceive humans and scenes using audio-visual signals
- Generate communication signals via verbal and nonverbal behaviors (speech, prosody, backchannels, visual cues such as gaze and gestures)
- Act safely and effectively under physical-world constraints in shared spaces
Embodiments of IPA include:
- Robots (both humanoids and non-humanoids)
- Physically-grounded and environment-aware avatars (e.g., AR telepresence)
- On-device audio-visual agents
Call for Papers
Submission: We invite authors to submit unpublished papers (8-page CVPR format) to our workshop, to be presented at a poster session upon acceptance. All submissions will go through a double-blind review process. All contributions must be submitted (along with supplementary materials, if any) on OpenReview (The link will be provided soon).
Accepted papers will be published in the official CVPR Workshops proceedings and the Computer Vision Foundation (CVF) Open Access archive.
Note: Authors of previously rejected main
conference submissions are also welcome to submit their work to our workshop.
When doing so, you must submit the previous reviewers' comments (named as
previous_reviews.pdf) and a letter of changes (named as
letter_of_changes.pdf) as part of your supplementary materials to
clearly demonstrate the changes made to address the comments made by previous
reviewers.
Important Dates
| Paper Submission Deadline | March 10, 2026 (23:59 PST) | |
| Notification to Authors | March 24, 2026 | |
| Camera-Ready Deadline | April 10, 2026 |
Tentative Schedule
- Wednesday, 3 June 2026 · 8:25 AM – 1:00 PM MDT
- Rooms 210 / 212
Note: The following schedule is tentative and will likely change before workshop day.
| Time in Denver (MDT) | Start time in UTC | Item |
|---|---|---|
| 8:25 – 8:30 | 3 Jun 2026 8:25:00 MDT | Opening Remarks |
| 8:30 – 9:10 | 3 Jun 2026 8:30:00 MDT |
Oral Session A
|
| 9:10 – 9:50 | 3 Jun 2026 9:10:00 MDT | Keynote Talk by Alexander Richard |
| 9:50 – 10:20 | 3 Jun 2026 9:50:00 MDT | Oral Session B |
| 10:20 – 10:30 | 3 Jun 2026 10:20:00 MDT | Coffee Break |
| 10:30 – 11:10 | 3 Jun 2026 10:30:00 MDT | Keynote Talk by Agon Serifi |
| 11:10 – 11:50 | 3 Jun 2026 11:10:00 MDT | Keynote Talk by Maja Matarić |
| 11:50 – 12:00 | 3 Jun 2026 11:50:00 MDT | Closing Remarks |
| 12:00 – 13:00 | 3 Jun 2026 12:00:00 MDT | Poster Presentation |
Note: Time offset detected from your browser; may differ from your actual timezone.
Keynote Speakers
Professor at University of Southern California
Principal Scientist at Google DeepMind
Founding Director, Robotics and Autonomous Systems Center (RASC)
Founding Director, Interaction Lab
The Challenges of Human-Centered AI and Robotics: What We Want, Need, and are Getting From Human-Machine Interaction
Bio
Accepted Full Papers
Peer-reviewed papers accepted to IPA 2026 through our double-blind review process. Each will be presented as a 10-minute oral talk in Oral Session A, followed by a poster during the poster session.
Invited CVPR Papers
We are honoured to host a selection of CVPR 2026 main-conference papers whose contributions advance interactive physical AI.
Oral Presentations
Poster Presentations
Organizers
NVIDIA
NVIDIA
NVIDIA
Carnegie Mellon University
NVIDIA
