Zero-Shot Video Translation via Token Warping

Supplementary Materials, More Results


longer video results

Input (240 frames)
blue headphones, closed eyes
Input (240 frames)
dark forest, holy sword
Input (120 frames)
white hair, cartoon style


Optical flow results

Input (60 frames)
CG style
A handsome grandpa, white hair
Input
white, snow
Van Gogh style
Input
snow, scarf, cartoon style
cheongsam, scarf, cartoon style
Input Input (52 frames)
Pink, CG style
Blue
Input
cartoon style
Input (80 frames)
cartoon style
Input Input
cotton
sunflower
Input
cartoon style
White top
Input
Cartoon style
CG style
Input
marble sculpture
white ancient Greek sculpture


Appearance flow for large gap editing

Input
Condition
pixar style
panda with moon


Flow in challenging cases

Single scene: our aligned QKV flow-based attention can tolerate flow errors in single object and simple background.
Input
condition input
A white cat in pink background
backward flow
backward flow occlusion mask
Complex scene: changing color of shopping bag, optical flow failure caused by scene change, will solve in the future.
Input
condition input
white hair, white top and jeans, CG style
backward flow
occlusion mask