fix text classification for datasets 4.0.0 by gplutop7 · Pull Request #2292 · huggingface/optimum-habana

gplutop7 · 2025-10-01T12:08:06Z

This PR updates the requirements to enforce evaluate>=0.4.5, as older versions of the library crash during metric computation when used together with datasets>=4.0.0.

After upgrading datasets from 3.6.0 to 4.0.0, evaluation on GLUE tasks started failing with evaluate==0.4.3 (and reproducible with 0.4.4). The issue occurs inside the simple_accuracy function of the GLUE metric:
AttributeError: 'bool' object has no attribute 'mean'

The problem was observed when running MRPC evaluation with Habana trainer:
"path": "/root/optimum-habana-fork/examples/text-classification", "command": "PT_HPU_LAZY_MODE=1 python3 run_glue.py --task_name mrpc --learning_rate 3e-5 --max_seq_length 128 --output_dir ./output/mrpc/ --use_hpu_graphs_for_inference --model_name_or_path bert-large-uncased-whole-word-masking --gaudi_config_name Habana/bert-large-uncased-whole-word-masking --do_eval --per_device_eval_batch_size 8 --use_habana --use_lazy_mode --throughput_warmup_steps 3 --sdp_on_bf16 "

Upgrading to evaluate>=0.4.5 resolves the crash and restores correct metric computation.

regisss

LGTM

…gface#736) Co-authored-by: Grzegorz Pluto-Prondzinski <gplutopx@habana.ai>

fix text classification for datasets 4.0.0

9062039

gplutop7 requested a review from regisss as a code owner October 1, 2025 12:08

regisss approved these changes Oct 1, 2025

View reviewed changes

regisss merged commit e1aa9f3 into huggingface:v1.20-release Oct 1, 2025
1 check was pending

regisss mentioned this pull request Oct 1, 2025

Fix(requirements): set evaluate>=0.4.5 to resolve GLUE crash with datasets>=4.0.0 #2291

Merged

gplutop7 pushed a commit to HabanaAI/optimum-habana-fork that referenced this pull request Oct 15, 2025

Fix text classification for datasets 4.0.0 (huggingface#2292) (huggin…

55856e3

…gface#736) Co-authored-by: Grzegorz Pluto-Prondzinski <gplutopx@habana.ai>

gplutop7 pushed a commit to HabanaAI/optimum-habana-fork that referenced this pull request Nov 6, 2025

Fix text classification for datasets 4.0.0 (huggingface#2292) (huggin…

2039379

…gface#736) Co-authored-by: Grzegorz Pluto-Prondzinski <gplutopx@habana.ai>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix text classification for datasets 4.0.0#2292

fix text classification for datasets 4.0.0#2292
regisss merged 1 commit into
huggingface:v1.20-releasefrom
HabanaAI:main-text_classification_datasets_4_0_0_fix_v1_20

gplutop7 commented Oct 1, 2025

Uh oh!

regisss left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

gplutop7 commented Oct 1, 2025

Uh oh!

regisss left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants