CLIP skip experiment on style coadapter #50
hafriedlander
started this conversation in
Show and tell
Replies: 1 comment
-
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I noticed sometimes when using style images that semantic elements from the style image would affect the result - for instance, a photo with many people would result in people in the output, even if the sketch image and prompt are for something else (i.e. a motorcycle).
I wondered if using earlier hidden layers from the clip image encoding would help.
The answer is "sort of". It's a sometimes useful technique, but also has some drawbacks.
Here's a couple of examples:
Starry night style, final layer results in many stars in the sky, penultimate layer reduces them while still giving a painterly style (but the colors shift too, undesirably)
Style source:
Final hidden layer:
Penultimate layer:
A random image from unsplash, when using the "Deliberate" model you get a lot of the houses from the style image appearing in the result. Penultimate still gives a similar style without repeating those elements. This is probably my best success so far with this technique:
Style source:
Final hidden layer:
Penultimate layer:
Beta Was this translation helpful? Give feedback.
All reactions