Download
1 variant available
image used to create example video: https://civitai.com/images/123232086
WAN 2.2 14B I2V Workflow with SVI 2.0 (Long Video Generation)
I couldn't find a workflow based on ComfyUI-WanVideoWrapper nodes that uses the SVI 2.0 LoRA, so I decided to create my own.
This workflow includes the following optimizations:
Block swap
TeaCache
TorchCompile
Cached prompts, text embeddings, and image embeddings
RifleXRope frames interpolation in the sampler
SageAttention
Smart image resizing to ensure the first frame resolution is divisible by 32 (better resolution for WAN 2.2)
ApplyNAG (enable negative prompt for cfg=1.0)
Feel free to clone 'Extra segment' subgraph to increase total video length.
Tested on:
Kubuntu 25.10 (kernel 6.18)
ComfyUI 16.4
Pytorch 2.10+rocm7.1; Triton 3.6.0; SageAttention-1.0.6 (for ROCm)
AMD RX 6700XT 12 Gb
32 Gb RAM
P.S. (not included in this workflow) If you want to double your output video FPS, you can add RIFE VFI node before VHS Video Combine node, then change frame_rate in VHS Video Combine node
π Prerequisite
At least 12 Gb VRAM
SageAttention installed and enabled
ROCm 7.1 / CUDA 12+ compatible system
π₯ Model links:
π¬ WAN:
(CivitAI) DaSiWa-WAN 2.2 I2V 14B BoundBite v10 | Lightspeed | GGUF - Q4 High
(CivitAI) DaSiWa-WAN 2.2 I2V 14B BoundBite v10 | Lightspeed | GGUF - Q4 Low
You can try Q5 or even Q6 versions
πΌοΈ WAN Vae:
π T5 Text encoder
You can use any T5 encoder, but don't use 'scaled' versions
ποΈ Tae for WAN 2.1 (for previews, optional)
βΎοΈ SVI 2.0 Pro LoRA
(Hugging Face) Kijai/WanVideo_comfy - SVI_v2_PRO_Wan2.2-I2V-A14B_HIGH_lora_rank_128_fp16.safetensors
(Hugging Face) Kijai/WanVideo_comfy - SVI_v2_PRO_Wan2.2-I2V-A14B_LOW_lora_rank_128_fp16.safetensors
π Folder structure
/path_to_ComfyUI/
βββ models/
β βββ unet/
β β βββ DasiwaWAN22I2V14BSynthseduction_q4High.gguf
β β βββ DasiwaWAN22I2V14BSynthseduction_q4Low.gguf
β βββ text_encoders/
β β βββ umt5-xxl-enc-fp8_e4m3fn.safetensors
β βββ vae/
β β βββ wan_2.1_vae.safetensors
β βββ vae_approx/
β β βββ taew2_1.safetensors
β βββ loras/
β βββ SVI_v2_PRO_Wan2.2-I2V-A14B_HIGH_lora_rank_128_fp16.safetensors
β βββ SVI_v2_PRO_Wan2.2-I2V-A14B_LOW_lora_rank_128_fp16.safetensorsπ¦ Used nodes
π Performance
For an image 480x608@49 (16x3+1) on AMD RX 6700XT 12 GB (AOTRITON+SageAttn+TunableOps)
First run: ~30-35 min (compilation + tuning) first segment
Subsequent runs: ~5-6 min per segment
β οΈ Known Issues & Fixes
OOM: Increase Swap block parameter, decrease Image resolution, FPS, Seconds
System freeze: Disable
fun_or_fl2v_model(if enabled); Change linux kernel to 6.18 or higherFirst run slow: Normal, compilation takes time for every new parameter set (resolution, frames, etc), especially when TUNABLE_OPS enabled.
