ModelsCHECKPOINT

Zeroscope V2 XL (txt2video)

Zeroscope V2 XL (txt2video)

Stop! These models are not for txt2img inference!

Don’t put them in your stable-diffusion-webui/models directory and expect to make images!

So what are these?

These are new Modelscope based models for txt2video, optimized to produce 16:9 video compositions. They’ve been trained on 9,923 video clips and 29,769 tagged frames at 24 fps, 1024×576 res.

Note that these are the bigger brothers to the https://civitai.com/models/96454/zeroscope-v2-576w-txt2video models. The XL models use 15.3GB of VRAM when rendering 30 fps at 1024×576.

Where do they go?

Drop them in the \stable-diffusion-webui\models\ModelScope\t2v folder

It’s imperative you rename the text2video_pytorch_model.pt to .pth extension after downloading.

The files must be named open_clip_pytorch_model.bin, and text2video_pytorch_model.pth

Who made them? Original Source?

https://huggingface.co/cerspense/zeroscope_v2_XL

What else do I need?

These models are specifically for use with the txt2video Auto1111 WebUI Extension

Related Model

SD wildcard lora tool
SD wildcard lora tool
ComfyUI AnimateDiff with IPAdapter, Multi-ControlNets & Face Restore
ComfyUI AnimateDiff with IPAdapter, Multi-ControlNets & Face Restore
Supir Upscale [ComfyUi]
Supir Upscale [ComfyUi]
EPIC Profile Picture Workflow 1.0 (ComfyUI)
EPIC Profile Picture Workflow 1.0 (ComfyUI)
SDXL-TURBO _ LCM_WORKFLOW
SDXL-TURBO _ LCM_WORKFLOW
Pre & Post
Pre & Post

No comments

No comments...