Add `add_bos_token` for Llama3 evaluation #2179

Kaihui-intel · 2025-04-22T07:07:53Z

Type of Change

bug fix

Description

model: meta-llama/Llama-3.1-8B-Instruct
add_bos_token default is False.
If the model was trained or fine-tuned with a BOS token, this may lead to incorrect results.

Expected Behavior & Potential Risk

the expected behavior that triggered by this PR

How has this PR been tested?

how to reproduce the test (including hardware information)

Dependency Change?

any library dependency introduced or removed

Signed-off-by: Kaihui-intel <[email protected]>

Copilot

Pull Request Overview

This PR introduces a fix by adding an "--add_bos_token" argument to support proper evaluation when the model is fine-tuned with a beginning-of-sequence token.

Added an argument for "add_bos_token" in the CLI parser
Modified the function call to include the "add_bos_token" parameter for evaluation

...age-modeling/quantization/transformers/weight_only/text-generation/run_generation_cpu_woq.py

Co-authored-by: Copilot <[email protected]>

xin3he

good catch!

XuehaoSun · 2025-04-23T07:12:36Z

add_bos_token for llama3

5bb594f

Signed-off-by: Kaihui-intel <[email protected]>

Kaihui-intel requested review from XuehaoSun, Copilot and xin3he April 22, 2025 07:07

Copilot AI reviewed Apr 22, 2025

View reviewed changes

...age-modeling/quantization/transformers/weight_only/text-generation/run_generation_cpu_woq.py Outdated Show resolved Hide resolved

Update code

18e421e

Co-authored-by: Copilot <[email protected]>

xin3he approved these changes Apr 22, 2025

View reviewed changes

XuehaoSun approved these changes Apr 23, 2025

View reviewed changes

XuehaoSun merged commit 1158d32 into master Apr 25, 2025
11 checks passed

XuehaoSun deleted the kaihui/llama3_eval branch April 25, 2025 02:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add `add_bos_token` for Llama3 evaluation #2179

Add `add_bos_token` for Llama3 evaluation #2179

Uh oh!

Kaihui-intel commented Apr 22, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

xin3he left a comment

Uh oh!

XuehaoSun commented Apr 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add add_bos_token for Llama3 evaluation #2179

Add add_bos_token for Llama3 evaluation #2179

Uh oh!

Conversation

Kaihui-intel commented Apr 22, 2025

Type of Change

Description

Expected Behavior & Potential Risk

How has this PR been tested?

Dependency Change?

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

xin3he left a comment

Choose a reason for hiding this comment

Uh oh!

XuehaoSun commented Apr 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add `add_bos_token` for Llama3 evaluation #2179

Add `add_bos_token` for Llama3 evaluation #2179