Anole add model by zucchini-nlp · Pull Request #36047 · huggingface/transformers

zucchini-nlp · 2025-02-05T11:45:32Z

What does this PR do?

Adds Anole as a new model, a new PR based on #32013

zucchini-nlp · 2025-02-05T15:26:32Z

This PR is ready. One thing to note is that image generation quality is very random, and even with the CFG we have it is not the best. The original repo is neither as good as the latest image generation models, and they have a slighly different CFG for instruct-based models

I can take a look at trying to match at least the original repo quality, but seems like Anole is not top model anymore. Their advantage was in having interleaved generation possible, I believe Janus also support it now given that it is one model doing both modalities.

So now I am not sure if it is still worth shipping Anole or not? @ArthurZucker WDYT?

HuggingFaceDocBuilderDev · 2025-02-05T15:55:33Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

ArthurZucker · 2025-02-13T09:20:10Z

Up to you, but if janus is easier to add we should prob add it rather than this one! Unless logic is the same!
Sorry that you had to spend time on it 😾

zucchini-nlp · 2025-02-13T13:19:19Z

Yeah, same feeling here, isn't worth maintaining but this might help @yaswanth19 with Janus shipping

ArthurZucker · 2025-02-13T16:27:33Z

Okay I'll review when I have time then!

yaswanth19 · 2025-02-16T05:35:59Z

src/transformers/models/anole/modeling_anole.py

+        return hidden_states
+
+
+class AnoleVQVAEEncoderResnetBlock(nn.Module):


@zucchini-nlp Am I missing something or is AnoleVQVAEEncoderResnetBlock similar to AnoleVQVAEResnetBlock I can see it is inherited from ChameleonVQVAEEncoderResnetBlock so while unravelling a duplicate block is created because we are not overwriting the Encoder part withAnoleVQVAEResnetBlock 🤔

Yep, completely identical modules. I just wanted didn't want to use a module with encoder prefix while decoding. So in the modular I added a general Resnet inherited from EncoderResnet

yaswanth19 · 2025-02-16T06:00:45Z

src/transformers/models/anole/modeling_anole.py

+        # compute in_ch_mult, block_in and curr_res at lowest res
+        block_in = base_channels * config.channel_multiplier[self.num_resolutions - 1]
+        curr_res = resolution // 2 ** (self.num_resolutions - 1)
+        self.z_shape = (1, latent_channels, curr_res, curr_res)


Just a query ? What is the use of curr_res and also we are not using self.z_shape anywhere else.

nope, might have forgotten to remove after a small refactor

zucchini-nlp added 12 commits February 4, 2025 13:58

add only anole and delete chameleon changes

c99da7b

small update

2414640

tests

10f551e

fix copies

1b40fae

Merge remote-tracking branch 'upstream/main' into anole-add-model

9e22b9f

remove this

2a2e44c

happy CI, hopefully

1536494

isort + docs

8847135

no autoclass here

1f9909e

dont skip test because we have no pixels

cc42ef1

why i removed pixels, stupid decision

bb4ac95

Merge branch 'main' into anole-add-model

4b04981

zucchini-nlp requested a review from ArthurZucker February 5, 2025 15:26

zucchini-nlp changed the title ~~[WIP] Anole add model~~ Anole add model Feb 5, 2025

fix copies

73ec7d6

Merge branch 'main' into anole-add-model

db7a16d

yaswanth19 reviewed Feb 16, 2025

View reviewed changes

zucchini-nlp closed this Feb 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Anole add model#36047

Anole add model#36047
zucchini-nlp wants to merge 14 commits intohuggingface:mainfrom
zucchini-nlp:anole-add-model

zucchini-nlp commented Feb 5, 2025

Uh oh!

zucchini-nlp commented Feb 5, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Feb 5, 2025

Uh oh!

ArthurZucker commented Feb 13, 2025

Uh oh!

zucchini-nlp commented Feb 13, 2025

Uh oh!

ArthurZucker commented Feb 13, 2025

Uh oh!

yaswanth19 Feb 16, 2025 •

edited

Loading

Uh oh!

zucchini-nlp Feb 16, 2025

Uh oh!

yaswanth19 Feb 16, 2025 •

edited

Loading

Uh oh!

zucchini-nlp Feb 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		return hidden_states


		class AnoleVQVAEEncoderResnetBlock(nn.Module):

Conversation

zucchini-nlp commented Feb 5, 2025

What does this PR do?

Uh oh!

zucchini-nlp commented Feb 5, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Feb 5, 2025

Uh oh!

ArthurZucker commented Feb 13, 2025

Uh oh!

zucchini-nlp commented Feb 13, 2025

Uh oh!

ArthurZucker commented Feb 13, 2025

Uh oh!

yaswanth19 Feb 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Feb 16, 2025

Choose a reason for hiding this comment

Uh oh!

yaswanth19 Feb 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Feb 16, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

yaswanth19 Feb 16, 2025 •

edited

Loading

yaswanth19 Feb 16, 2025 •

edited

Loading