Guanlong Jiao1,4, Chenyangguang Zhang2, Jia Jun Cheng Xian1,4, Zewei Zhang1,3, Renjie Liao1,4,5
1The University of British Columbia, 2ETH Zürich, 3McMaster University, 4Vector Institute, 5Canada CIFAR AI Chair
Video Results are all shown in our Project Page.
StreamGVE is a training-free video editing framework built on few-step streaming video generation models. Instead of treating editing as data-to-data transformation, StreamGVE formulates editing as source-conditioned noise-to-target generation, thus makeing it possible for few-step fast controllable video editing. StreamGVE supports few-step text-driven video editing and optional first-frame visual prompting for videos of any length. It is implemented on streaming video generation models:
Self-Forcing_StreamGVE: StreamGVE built on Self Forcing.LongLive_StreamGVE: StreamGVE built on LongLive v1.0.
conda create -n streamgve python=3.12 -y
conda activate streamgve
# Choose the PyTorch command appropriate for your CUDA version, example for CUDA 12.8:
pip install torch==2.8.0 torchvision==0.23.0 --index-url https://download.pytorch.org/whl/cu128
pip install -r requirements.txt
pip install flash-attn --no-build-isolationFor the Self Forcing-based implementation, install the local package in editable mode:
cd Self-Forcing_StreamGVE
python setup.py develop
cd ..cd Self-Forcing_StreamGVE
huggingface-cli download Wan-AI/Wan2.1-T2V-1.3B --local-dir-use-symlinks False --local-dir wan_models/Wan2.1-T2V-1.3B
huggingface-cli download gdhe17/Self-Forcing checkpoints/self_forcing_dmd.pt --local-dir .
cd ..cd LongLive_StreamGVE
huggingface-cli download Wan-AI/Wan2.1-T2V-1.3B --local-dir wan_models/Wan2.1-T2V-1.3B
huggingface-cli download Efficient-Large-Model/LongLive --local-dir longlive_models
cd ..You can share the checkpoints of Wan through soft links, as they are the same across projects.
Each implementation provides a ready-to-run example script.
cd Self-Forcing_StreamGVE
bash inference_edit_streamgve.shcd LongLive_StreamGVE
bash inference_edit_streamgve.shThis repository builds on the excellent open-source work of Self Forcing, LongLive, and Wan2.1. We also thank UniEdit-Flow and FiVE-Bench for helpful open-soucre code and benchmarks.
If you find this work useful, please consider citing:
TODO
