Longer videos and textual inversions and fp16 autocast #25

dajes · 2023-07-14T18:23:37Z

Added the ability to choose separately the length of a video and the size of the context of the temporal attention module. By using a sliding window of attention it is now possible to generate infinitely long GIFs.
Sliding window related parameters:
--L - the length of the generated animation.
--context_length - the length of the sliding window (limited by motion modules capacity), default to L.
--context_overlap - how much neighbouring contexts overlap. By default context_length / 2
--context_stride - (2^context_stride) is a max stride between 2 neighbour frames. By default 0
Added support for .pt textual inversions from civit.ai that should be put in models/embeddings directory. Though I'm not very sure if this implementation is fully correct, but works fine for me.
Now inference automatically uses torch.autocast to fp16 if --fp32 is not specified. It sped things up by 100% in my tests.

xdomiall · 2023-07-17T23:19:35Z

Thank you dajes for your contribution! I've tested the fp16 autocast on my 4090 and the speed increases are x4. From 55s per gif I went down to 15s/gif. So 200% increase for me

Cubey42 · 2023-07-26T16:58:15Z

Any chance you would be interested in figuring out how to add embeddings or the context stride to this repo? https://github.com/neggles/animatediff-cli

Cubey42 · 2023-07-26T17:49:19Z

Any chance you would be interested in figuring out how to add embeddings or the context stride to this repo? https://github.com/neggles/animatediff-cli

actually I was able to get it to work, nevermind!

Bendito999 · 2023-08-04T20:50:38Z

Just a reminder, for users looking at this who have an old Maxwell card like the Tesla M40 , the fp16 mode actually causes a 3x slowdown instead of 3x speed boost, so use fp32 for Maxwell cards. Maxwell doesn't have dedicated fp16 hardware. Found this out the hard way haha.

JACKHAHA363 · 2024-02-08T05:36:47Z

Is this PR merged any where?

dajes · 2024-02-08T09:14:22Z

Is this PR merged any where?

AFAIK this technique is used in https://github.com/neggles/animatediff-cli https://github.com/magic-research/magic-animate

Danylo Kasianenko added 4 commits July 14, 2023 21:10

Add support for textual inversions

6dea098

long videos using sliding window

80b06c1

add long form to README.md

779da8a

Fix default value for context_length

ae49c03

sdbds mentioned this pull request Jul 18, 2023

RuntimeError continue-revolution/sd-webui-animatediff#3

Closed

tumurzakov mentioned this pull request Jul 29, 2023

Making the frame size bigger tumurzakov/AnimateDiff#4

Open

Sadrahel mentioned this pull request Jul 2, 2024

How to generate longer video than F: 16 #363

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Longer videos and textual inversions and fp16 autocast #25

Longer videos and textual inversions and fp16 autocast #25

dajes commented Jul 14, 2023

xdomiall commented Jul 17, 2023 •

edited

Loading

Cubey42 commented Jul 26, 2023

Cubey42 commented Jul 26, 2023

Bendito999 commented Aug 4, 2023

JACKHAHA363 commented Feb 8, 2024

dajes commented Feb 8, 2024

Longer videos and textual inversions and fp16 autocast #25

Are you sure you want to change the base?

Longer videos and textual inversions and fp16 autocast #25

Conversation

dajes commented Jul 14, 2023

xdomiall commented Jul 17, 2023 • edited Loading

Cubey42 commented Jul 26, 2023

Cubey42 commented Jul 26, 2023

Bendito999 commented Aug 4, 2023

JACKHAHA363 commented Feb 8, 2024

dajes commented Feb 8, 2024

xdomiall commented Jul 17, 2023 •

edited

Loading