Add PIA Model/Pipeline #6698

DN6 · 2024-01-24T15:27:23Z

What does this PR do?

Adds the Personalized Image Animator (PIA) Model/Pipeline to diffusers

TODO:

Update docs with official PIA Motion Adapter checkpoint links (waiting on authors to respond on which org this checkpoint should live in)

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

a-r-r-o-w

Thank you so much for working on this @DN6 ❤️ The results look amazing from my testing. In comparison to the original repo, though, there seems to be more artifacts in videos in terms of motion or abrupt movements, not sure why. Applying freeinit seems to work great though.

PR closes #6494.

src/diffusers/pipelines/pia/pipeline_pia.py

yiyixuxu

super cool:)
I left some comments and questions.

for a future PR maybe we can think about how to refactor free_init. I really don't like the free_init as a pipeline feature - It makes the pipelines hard to read, and also inconsistent with our design. I think it has a similar nature to pipelines such as attend-and-excite that are modifying the denosing loop in some way in order to enhance the generations, so essentially it should be a different pipeline. Would love to hear your thoughts and ideas:)

src/diffusers/models/unets/unet_motion_model.py

src/diffusers/pipelines/pia/pipeline_pia.py

HuggingFaceDocBuilderDev · 2024-01-25T13:19:49Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

DN6 · 2024-01-25T13:38:42Z

Thank you so much for working on this @DN6 ❤️ The results look amazing from my testing. In comparison to the original repo, though, there seems to be more artifacts in videos in terms of motion or abrupt movements, not sure why. Applying freeinit seems to work great though.

@a-r-r-o-w Could you share a snippet I can use to test with?

a-r-r-o-w · 2024-01-26T01:20:08Z

@a-r-r-o-w Could you share a snippet I can use to test with?

For comparing PIA, I've been using this colab by camenduru. I believe the problem is not with the diffusers implementation but just usual quality problems with AnimateDiff from time to time.

To get similar results from both, passing generator=torch.Generator().manual_seed(seed) does not seem to work. Doing this before generation, however, is giving same/similar results:

torch.manual_seed(seed)
# np.random.seed(seed) # probably not needed

I suspect the reason is that the original codebase (which camenduru's colab uses) has statements like torch.randn_like(latents) that do not make use of a generator, and directly call manual_seed in app.py. After experimenting a bit more, it seems like I just had a lucky streak with animatediff when using the original codebase when I first tested?!

patrickvonplaten

Left mostly nits

docs/source/en/api/pipelines/pia.md

src/diffusers/pipelines/pia/pipeline_pia.py

tests/pipelines/pia/test_pia.py

docs/source/en/api/pipelines/pia.md

patrickvonplaten · 2024-01-31T12:20:53Z

src/diffusers/pipelines/pia/pipeline_pia.py

+        if self.free_init_enabled:
+            latents = self._free_init_loop(
+                height=height,
+                width=width,
+                num_frames=num_frames,
+                batch_size=batch_size,
+                num_videos_per_prompt=num_videos_per_prompt,
+                denoise_args=denoise_args,
+                device=device,
+            )
+        else:
+            latents = self._denoise_loop(**denoise_args)


Let's not forgot to clean up the design once the PR is merged (cc @yiyixuxu )

src/diffusers/pipelines/pia/pipeline_pia.py

patrickvonplaten

Still a couple things to improve here

patrickvonplaten · 2024-01-31T16:00:14Z

Ok to merge! Failing test is unrelated

* update * update * updaet * add tests and docs * clean up * add to toctree * fix copies * pr review feedback * fix copies * fix tests * update docs * update * update * update docs * update * update * update * update

DN6 added 7 commits January 23, 2024 11:04

update

b14881e

update

1169370

updaet

7be981c

add tests and docs

5025fc3

clean up

845d7f7

add to toctree

1ba867d

fix copies

8d1794d

DN6 requested a review from patrickvonplaten January 24, 2024 15:34

DN6 mentioned this pull request Jan 24, 2024

[refactor] FreeInit #6644

Closed

a-r-r-o-w approved these changes Jan 24, 2024

View reviewed changes

yiyixuxu reviewed Jan 25, 2024

View reviewed changes

pr review feedback

1c7b31a

DN6 added 2 commits January 25, 2024 13:27

fix copies

a93a5ba

fix tests

4c9244d

patrickvonplaten approved these changes Jan 26, 2024

View reviewed changes

DN6 added 4 commits January 30, 2024 11:57

update docs

b3e4161

update

e4535a5

update

8961802

update docs

7542799