NVIDIA NVIDIA AI4Media NVIDIA SIL NVIDIA Research USC VGL

HorizonRelight: Relighting Long-horizon Videos Consistently via Diffusion Transformers

Jing Yang1,2 Mayoore Jaiswal1 Zian Wang1 Steven Zeng1 Rochelle Pereira1 Yajie Zhao2 Jianyuan Min1,*

1NVIDIA 2University of Southern California *Corresponding author

ECCV 2026

HorizonRelight relighting examples across long-horizon videos.
HorizonRelight delivers temporally consistent long-horizon relighting by propagating target-domain context across sliding-window chunks.

Overview Video

Abstract

Diffusion-based video relighting enables controllable relighting from a single input video, but modern video diffusion backbones are trained on short clips and applied to long-horizon videos through chunked sliding-window inference, often causing temporal discontinuities at chunk boundaries. We address this by reframing long-horizon relighting as temporally conditioned latent domain translation. Our framework enforces cross-chunk continuity by propagating target-domain latents across boundaries and makes this behavior learnable using masked target-domain self-conditioning, training the model to continue from temporally masked propagated context. We further introduce warm-start prompting with a relit prompt anchor from a controllable generative model, which establishes the initial target-domain state and creates a general interface for prompt-based relighting. Experiments on in-the-wild long-horizon videos show markedly improved temporal consistency, with chunk-boundary artifacts largely reduced and unwanted appearance changes across chunks greatly suppressed.

Method

Each sliding-window chunk is conditioned on target-domain latents propagated from the previous chunk, so the relit appearance is continued instead of re-inferred independently.

HorizonRelight diffusion renderer with propagated context.

Results

Prompt-based relighting and long-horizon continuation produce stable relit videos under diverse target lighting while preserving source content and temporal consistency.

Qualitative prompt-based relighting results from HorizonRelight.

BibTeX

@inproceedings{yang2026horizonrelight,
  title     = {HorizonRelight: Relighting Long-horizon Videos Consistently via Diffusion Transformers},
  author    = {Yang, Jing and Jaiswal, Mayoore and Wang, Zian and Zeng, Steven and Pereira, Rochelle and Zhao, Yajie and Min, Jianyuan},
  booktitle = {European Conference on Computer Vision (ECCV)},
  eprint    = {2606.29095},
  archivePrefix = {arXiv},
  url       = {https://arxiv.org/abs/2606.29095},
  year      = {2026}
}