Fix gradient checkpointing warning filter implementation by rolandtannous · Pull Request #97 · unslothai/unsloth-zoo

rolandtannous · 2025-03-24T15:45:51Z

Fix Logger Filter Implementation for Gradient Checkpointing Warnings

Description

This PR fixes a bug in the unsloth_compile_transformers method in compiler.py that causes an AttributeError when trying to suppress gradient checkpointing warnings. The current implementation incorrectly assumes that the model file has a logger attribute, but logger instances are typically module-level variables, not attributes.

Changes

Replaced the problematic exec() call that was looking for modeling_file.logger
Implemented a proper logging filter that gets the correct logger instance via the transformers logging system
Ensures the filter works across different model architectures (Gemma, Mistral, etc.)

Bug Details

The current code attempts to suppress warnings with:

exec("modeling_file.logger.addFilter(HideLoggingMessage('Setting `use_cache=False`'))", globals(), locals())

This fails with

AttributeError: module 'transformers.models.mistral3.modeling_mistral3' has no attribute 'logger'` because the logger is a module-level variable, not a model attribute.

Solution

The fix gets the appropriate logger directly from the transformers logging module and applies a filter to target only the specific gradient checkpointing warning message.

Testing

Verified the solution works with:

Gemma 3 models
Mistral 3 models

Related Issues

Fixes:

unsloth-zoo issue #90
unsloth issue #2146

rolandtannous · 2025-03-25T09:38:55Z

This is a bug , resulting in a runtime error and actually preventing most users from even loading models using FastLanguageModel.from_pretrained. Example:

model, tokenizer = FastModel.from_pretrained(
    model_name = "mistralai/Mistral-Small-3.1-24B-Instruct-2503",
    max_seq_length = 2048, # Choose any for long context!
    load_in_4bit = False,  # 4 bit quantization to reduce memory
    load_in_8bit = False, # [NEW!] A bit more accurate, uses 2x memory
    full_finetuning = False, # [NEW!] We have full finetuning now!
    # token = "hf_...", # use one if using gated models
)

results in the runtime error:

AttributeError: module 'transformers.models.mistral3.modeling_mistral3' has no attribute 'logger'

and prevents most people from progressing forward.

Same thing happens with other models like Gemma3.

SIde note: Oddly enough Transformers implementation of modeling_mistral3 isn't even importing the logging module at all to begin with.. This is not the reason for the runtime error though.

The underlying code in compiler.py throwing the runtime error isn't even trivial. It is just trying to suppress a category of logs by filtering it out to un-clutter the console.

Fix gradient checkpointing warning filter implementation

a62e4c6

rolandtannous mentioned this pull request Mar 25, 2025

Error finetuning: cannot import name 'CompileConfig' from 'transformers' unslothai/unsloth#2180

Closed

shimmyshimmer changed the base branch from main to nightly March 25, 2025 09:57

shimmyshimmer merged commit 454757c into unslothai:nightly Mar 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix gradient checkpointing warning filter implementation#97

Fix gradient checkpointing warning filter implementation#97
shimmyshimmer merged 1 commit into
unslothai:nightlyfrom
rolandtannous:fix/suppress-gradient-checkpointing-warning

rolandtannous commented Mar 24, 2025

Uh oh!

rolandtannous commented Mar 25, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rolandtannous commented Mar 24, 2025

Fix Logger Filter Implementation for Gradient Checkpointing Warnings

Description

Changes

Bug Details

Solution

Testing

Related Issues

Uh oh!

rolandtannous commented Mar 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rolandtannous commented Mar 25, 2025 •

edited

Loading