Naive Scaling by yiliu30 · Pull Request #2211 · intel/neural-compressor

yiliu30 · 2025-05-15T08:47:08Z

Type of Change

feature or bug fix or documentation or validation or others
API changed or not

Description

detail description

Expected Behavior & Potential Risk

the expected behavior that triggered by this PR

How has this PR been tested?

how to reproduce the test (including hardware information)

Dependency Change?

any library dependency introduced or removed

Signed-off-by: yiliu30 <yi4.liu@intel.com>

Signed-off-by: yi <yi>

Copilot

Pull Request Overview

This PR introduces a naive scaling mechanism controlled by an environment variable.

Adds the INC_FORCE_NAIVE_SCALING flag in the environment utilities.
Bypasses additional scaling logic in the scale method factory and FP8 utility functions when the flag is enabled.

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 3 comments.

File	Description
neural_compressor/torch/utils/environ.py	Introduces a new environment variable flag to enforce naive scaling.
neural_compressor/torch/algorithms/fp8_quant/_core/scale_methods/scale_method_factory.py	Adds a conditional branch to bypass additional scaling configuration when naive scaling is enabled, including a warning log.
neural_compressor/torch/algorithms/fp8_quant/_core/fp_utils.py	Forces the backoff value to 1.0 and logs a warning when naive scaling is enabled during FP8 scale calculation.

neural_compressor/torch/utils/environ.py

neural_compressor/torch/algorithms/fp8_quant/_core/scale_methods/scale_method_factory.py

neural_compressor/torch/algorithms/fp8_quant/_core/fp_utils.py

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Depends on intel/neural-compressor#2211. Naive scaling: `backoff=1`, no `scaling_round_method` Full pile: use all samples(> 1024) in pile for calibration, https://huggingface.co/Yi30/inc-woq-2282samples-514-g2 Signed-off-by: yiliu30 <yi4.liu@intel.com>

* add naive scaling Signed-off-by: yiliu30 <yi4.liu@intel.com> Signed-off-by: Zhang, Yanli L <yanli.l.zhang@intel.com>

yiliu30 and others added 2 commits May 13, 2025 11:50

add INC_MEASUREMENT_DUMP_PATH_PREFIX

52b9429

Signed-off-by: yiliu30 <yi4.liu@intel.com>

add naive scaling

6818f00

Signed-off-by: yi <yi>

yiliu30 changed the title ~~Naive scaling~~ Naive Scaling May 15, 2025

Merge branch 'r1-woq' into naive-scaling

fe59720

yiliu30 requested a review from Copilot May 16, 2025 00:30

Copilot AI reviewed May 16, 2025

View reviewed changes

neural_compressor/torch/utils/environ.py Outdated Show resolved Hide resolved

neural_compressor/torch/algorithms/fp8_quant/_core/scale_methods/scale_method_factory.py Show resolved Hide resolved

neural_compressor/torch/algorithms/fp8_quant/_core/fp_utils.py Show resolved Hide resolved

yiliu30 mentioned this pull request May 16, 2025

[DeepSeek][G2] Use naive scaling and full pile HabanaAI/vllm-fork#1265

Merged

Update neural_compressor/torch/utils/environ.py

60dfb0c

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

yiliu30 merged commit 0bd4390 into r1-woq May 16, 2025
0 of 2 checks passed

yiliu30 deleted the naive-scaling branch May 16, 2025 08:26

Yanli2190 added a commit to Yanli2190/neural-compressor that referenced this pull request Jan 13, 2026

Naive Scaling (intel#2211)

dcd2e06

* add naive scaling Signed-off-by: yiliu30 <yi4.liu@intel.com> Signed-off-by: Zhang, Yanli L <yanli.l.zhang@intel.com>

yiliu30 mentioned this pull request Jan 16, 2026

Porting naive scaling #2387

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Naive Scaling#2211

Naive Scaling#2211
yiliu30 merged 4 commits intor1-woqfrom
naive-scaling

yiliu30 commented May 15, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

yiliu30 commented May 15, 2025

Type of Change

Description

Expected Behavior & Potential Risk

How has this PR been tested?

Dependency Change?

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant