Assessing Learned Models for Phase-only Hologram Compression

We evaluate the performance of four common learned models utilizing INR and VAE structures for compressing phase-only holograms in holographic displays. The evaluated models include a vanilla MLP, SIREN [Sitzmann et al. 2020], and FilmSIREN [Chan et al.

Jerome Gonthier

Jerome got his PhD in theoretical chemistry from EPFL (Lausanne, Switzerland) in 2013. He then moved to the US for a first post-doctoral appointment at GeorgiaTech, and then a second one at UC Berkeley starting in 2016. During this time, he worked to develop methods to better understand intermolecular interactions from first principles. In 2019, he moved to the quantum computing industry, working at Zapata Computing in the professional services organization to help industry customers understand the potential impact of quantum computers for chemistry simulations.

Jiaxiang Tang

Jiaxiang Tang is a research scientist at NVIDIA Research. He finished his Ph.D. at Peking University, advised by Prof. Gang Zeng. His research interest is in 3D Computer Vision, with a specialty in 3D reconstruction and generation.

Personal Website: https://me.kiui.moe/

Generative Detail Enhancement for Physically Based Materials

We present a tool for enhancing the detail of physically based materials using an off-the-shelf diffusion model and inverse rendering. Our goal is to enhance the visual fidelity of materials with detail that is often tedious to author, by adding signs of wear, aging, weathering, etc. As these appearance details are often rooted in real-world processes, we leverage a generative image model trained on a large dataset of natural images with corresponding visuals in context. Starting with a given geometry, UV mapping, and basic appearance, we render multiple views of the object.

Radiance Surfaces: Optimizing Surface Representations with a 5D Radiance Field Loss

We present a fast and simple technique to convert images into a radiance surface-based scene representation. Building on existing radiance volume reconstruction algorithms, we introduce a subtle yet impactful modification of the loss function requiring changes to only a few lines of code: instead of integrating the radiance field along rays and supervising the resulting images, we project the training images into the scene to directly supervise the spatio-directional radiance field.

Spec2RTL-Agent: Automated Hardware Code Generation from Complex Specifications Using LLM Agent Systems

Despite recent progress in generating hardware RTL code with LLMs, existing solutions still suffer from a substantial gap between practical application scenarios and the requirements of real-world RTL code development. Prior approaches either focus on overly simplified hardware descriptions or depend on extensive human guidance to process complex specifications, limiting their scalability and automation potential.

Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits

Neural speech editing advancements have raised concerns about their misuse in spoofing attacks. Traditional partially edited speech corpora primarily focus on cut-and-paste edits, which, while maintaining speaker consistency, often introduce detectable discontinuities. Recent methods, like A^{3}T and Voicebox, improve transitions by leveraging contextual information. To foster spoofing detection research, we introduce the Speech INfilling Edit (SINE) dataset, created with Voicebox. We detailed the process of re-implementing Voicebox training and dataset creation.