A Cog implementation with txt2vid and vid2vid of:
- https://huggingface.co/cerspense/zeroscope_v2_XL
- https://huggingface.co/cerspense/zeroscope_v2_576w
- https://huggingface.co/camenduru/potat1
Deployed at https://replicate.com/anotherjesse/zeroscope-v2-xl
- Text-To-Video-Finetuning - Finetune ModelScope's Text To Video model using Diffusers
- Showlab and bryandlee[https://github.com/bryandlee/Tune-A-Video] for their Tune-A-Video contribution that made this much easier.
- lucidrains for their implementations around video diffusion.
- cloneofsimo for their diffusers implementation of LoRA.
- kabachuha for their conversion scripts, training ideas, and webui works.
- JCBrouwer Inference implementations.
- sergiobr Helpful ideas and bug fixes.
- cjwbw/damo-text-to-video for original cog implementation