Dynamic weight conversion is recursive#44300
Conversation
|
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
|
run-slow: colpali, colqwen2, emu3, ernie4_5_vl_moe, fuyu, gemma3, gemma3n, glm4v, qwen2_vl, maskformer, llava, sam3, qwen3_5 |
|
run-slow: colpali, colqwen2, emu3, ernie4_5_vl_moe, fuyu, gemma3, gemma3n, glm4v, qwen2_vl, maskformer, llava, sam3, qwen3_5 |
|
This comment contains models: ["models/colpali", "models/colqwen2", "models/emu3", "models/ernie4_5_vl_moe", "models/fuyu", "models/gemma3", "models/gemma3n", "models/glm4v", "models/llava", "models/maskformer", "models/qwen2_vl", "models/qwen3_5", "models/sam3"] |
CI ResultsCommit Info
Model CI Report❌ 8 new failed tests from this PR 😭
|
|
run-slow: colpali, colqwen2, ernie4_5_vl_moe, fuyu, gemma3, gemma3n, glm4v, qwen2_vl, maskformer, llava, sam3, qwen3_5 |
|
This comment contains models: ["models/colpali", "models/colqwen2", "models/ernie4_5_vl_moe", "models/fuyu", "models/gemma3", "models/gemma3n", "models/glm4v", "models/llava", "models/maskformer", "models/qwen2_vl", "models/qwen3_5", "models/sam3"] |
|
Should be ready now, failing qwen3-5-moe is not related. It has been failing on main for a long time and is related to expert impl |
CI ResultsCommit Info
The test failure analysis could not be completed. Please check the workflow run for details. |
|
[For maintainers] Suggested jobs to run (before merge) run-slow: conditional_detr, detr, edgetam, fast_vlm, maskformer, pe_audio_video, pe_video, perception_lm |
|
View the CircleCI Test Summary for this PR: https://huggingface.co/spaces/transformers-community/circle-ci-viz?pr=44300&sha=7e3d40 |
|
[For maintainers] Suggested jobs to run (before merge) run-slow: conditional_detr, detr, edgetam, fast_vlm, maskformer, pe_audio_video, pe_video, perception_lm |
|
@bot /repo |
|
Repo. Consistency bot fixed some files and pushed the changes. |
|
[For maintainers] Suggested jobs to run (before merge) run-slow: conditional_detr, detr, edgetam, fast_vlm, maskformer, pe_audio_video, pe_video, perception_lm |
|
@bot /style |
|
Style fix fix runs successfully without any file modified. |
|
View the CircleCI Test Summary for this PR: https://huggingface.co/spaces/transformers-community/circle-ci-viz?pr=44300&sha=df86ff |
|
[For maintainers] Suggested jobs to run (before merge) run-slow: conditional_detr, detr, edgetam, fast_vlm, maskformer, pe_audio_video, pe_video, perception_lm |
|
View the CircleCI Test Summary for this PR: https://huggingface.co/spaces/transformers-community/circle-ci-viz?pr=44300&sha=ebcb04 |
|
[For maintainers] Suggested jobs to run (before merge) run-slow: conditional_detr, detr, edgetam, fast_vlm, maskformer, pe_audio_video, pe_video, perception_lm |
|
View the CircleCI Test Summary for this PR: https://huggingface.co/spaces/transformers-community/circle-ci-viz?pr=44300&sha=99f0a0 |
|
[For maintainers] Suggested jobs to run (before merge) run-slow: conditional_detr, detr, edgetam, fast_vlm, maskformer, pe_audio_video, pe_video, perception_lm |
* split out from timm PR * all other VLMs * timm backbone is not here * oops, extra key is breaking eveerything * . * this test * maybe * fix missing keys when loading from hub * now fix fast tests * merge gone wrong * fix repo * refine the regex again! * close the bracket * Apply suggestions from code review Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com> * revert unrelated * ! * revert more * add submodule prefix when recursing * i'll need to fix maskformer later * dont duplicate the same pattern twice * fix modular * detr * colpali isn't working still! * oke, so this can be fine for now * ! * revert * dot lost in regex and comments * timm wrapper is weird * skip these, timm wrapper * bye bye timm * make repo check happy * Revert "bye bye timm" This reverts commit ca68663. * love timm! * Apply repo consistency fixes * oke, the bot can't fix it so here we go --------- Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
* split out from timm PR * all other VLMs * timm backbone is not here * oops, extra key is breaking eveerything * . * this test * maybe * fix missing keys when loading from hub * now fix fast tests * merge gone wrong * fix repo * refine the regex again! * close the bracket * Apply suggestions from code review Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com> * revert unrelated * ! * revert more * add submodule prefix when recursing * i'll need to fix maskformer later * dont duplicate the same pattern twice * fix modular * detr * colpali isn't working still! * oke, so this can be fine for now * ! * revert * dot lost in regex and comments * timm wrapper is weird * skip these, timm wrapper * bye bye timm * make repo check happy * Revert "bye bye timm" This reverts commit ca68663. * love timm! * Apply repo consistency fixes * oke, the bot can't fix it so here we go --------- Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
What does this PR do?
The recursive feature is needed for me in #44252 to allow timm backbone define its conversion only once. Also it currently allows to delete "t5gemma2" from conversion, allowing its backbones to rename weight