Over++: Generative Video Compositing for Layer Interaction Effects

1University of North Carolina at Chapel Hill    2University of Maryland    3Industrial Light & Magic
TL;DR: Generate environmental effects between any foreground and background layers.

Over++ enables effect generation and editing, with or without mask or prompt guidance. Explore our applications below.

I. Effect Generation

II. Effect Editing

III. Keyframe masking

IV. Background Swapping



We introduce Over++, a framework for generating environmental effects and enabling effect editing through mask- or prompt-guided control. Our method trains on both paired and unpaired data, providing a unified framework that remains effective even with limited training data (≈50 paired real-world videos, 600 paired synthetic videos, and 500 unpaired synthetic videos—≈1.1K total), which is significantly less than the ≈15K paired videos used in Force Prompting for force-guided video generation. Explore the sections below for more details:

Baseline Comparisons

Our Framework

Naively compositing the foreground over the background layer (copy-paste: $\mathcal{I}_{\text{over}} = \mathcal{I}_{\text{fg}} \oplus \mathcal{I}_{\text{bg}}$) produces a video that lacks environmental effects such as shadows or wakes. Given such an input composite and an optional binary mask ($\mathcal{M}_{\text{effect}}$) indicating the target effect regions, our model generates the desired effects within those regions.

Our method is trained on both paired and unpaired data. For unpaired data, we zero out the latent codes of $\mathcal{I}_{\text{over}}$ and $\mathcal{M}_{\text{effect}}$. (Text prompts $\mathcal{T}$ are not shown here for simplicity.)

Training Data

Robustness

Failure Cases

Societal Impact

We acknowledge that powerful video editing tools, including ours, may raise ethical considerations depending on their context of use. While our work is intended to augment video compositing and professional workflows, such capabilities could potentially be misused. We therefore encourage responsible use aligned with community guidelines and emphasize transparency regarding any applied edits.

Acknowledgements

Thank you to all ILM staff who assisted in preparing this work, especially Miguel Perez Senent for the 3D boat and ocean elements used in Figure 3 (row 2) and Figure 6 (row 3), and ILM leaders Rob Bredow, Francois Chardavoine, and Greg Grusby for their assistance in clearing this work for publication.

BibTeX

@misc{qi2025overgenerativevideocompositing,
  title={Over++: Generative Video Compositing for Layer Interaction Effects}, 
  author={Luchao Qi and Jiaye Wu and Jun Myeong Choi and Cary Phillips and Roni Sengupta and Dan B Goldman},
  year={2025},
  eprint={2512.19661},
  archivePrefix={arXiv},
  primaryClass={cs.CV},
  url={https://arxiv.org/abs/2512.19661}, 
}