Support hf generate #12477

hkvision · 2024-12-02T07:08:34Z

original generate -> simple_generate
current generate -> hf generate

jason-dai · 2024-12-02T12:52:32Z

Please also test other models.

hkvision · 2024-12-02T13:05:43Z

Please also test other models.

Sure.

python/llm/src/ipex_llm/transformers/npu_models/convert.py

hkvision · 2024-12-04T07:37:25Z

Current usage:

Default to use simple=True, if want to benchmark, need to add do_print=True
If change to simple=False, will change to hf generate. In this way, can't be used together with BenchmarkWrapper, because BenchmarkWrapper's generate doesn't have the simple argument, will have the following error:

File "C:\Users\arda\miniforge3\envs\kai-acc-lib\lib\site-packages\ipex_llm\utils\benchmark_util_4_29.py", line 1180, in _validate_model_kwargs
raise ValueError(
ValueError: The following model_kwargs are not used by the model: ['simple'] (note: typos in the generate arguments will also show up in this list)

If want to benchmark hf generate performance, just add BenchmarkWrapper without specifying simple=True when generate.

@jason-dai

jason-dai · 2024-12-04T07:46:30Z

If change to simple=False, will change to hf generate. In this way, can't be used together with BenchmarkWrapper, because BenchmarkWrapper's generate doesn't have the simple argument, will have the following error:

Can we make simple a kwarg instead?

hkvision · 2024-12-04T07:47:15Z

Won't impact current all-in-one, in all-in-one, will default always to test simple generate.
If want to test hf generate, may need to manually remove the if condition here: https://github.com/intel-analytics/ipex-llm/blob/main/python/llm/dev/benchmark/all-in-one/run.py#L650C5-L651C40 or add a new option in config.yaml.
Pending to support this if needed in the future.

hkvision · 2024-12-04T07:48:26Z

If change to simple=False, will change to hf generate. In this way, can't be used together with BenchmarkWrapper, because BenchmarkWrapper's generate doesn't have the simple argument, will have the following error:

Can we make simple a kwarg instead?

Good idea, will modify this.

hkvision · 2024-12-04T08:06:28Z

If change to simple=False, will change to hf generate. In this way, can't be used together with BenchmarkWrapper, because BenchmarkWrapper's generate doesn't have the simple argument, will have the following error:

Can we make simple a kwarg instead?

Updated.

Still simple can't work together with BenchmarkWrapper, the code looks better after making it a kwarg.

jason-dai

LGTM

hkvision · 2024-12-04T08:29:05Z

Merge it first. Will fix llama3.2 issue in a next PR very soon.

hkvision requested review from plusbang and jason-dai December 2, 2024 12:27

hkvision force-pushed the generate branch from 50b0268 to 18cabaf Compare December 2, 2024 12:32

hkvision commented Dec 3, 2024

View reviewed changes

python/llm/src/ipex_llm/transformers/npu_models/convert.py Outdated Show resolved Hide resolved

hkvision added 6 commits December 3, 2024 19:10

generate

b2ea2ef

style

c733024

update

c3717dd

remove timing

94fbc10

style

75989af

style

fcd4165

hkvision force-pushed the generate branch from 2822207 to f1719f0 Compare December 4, 2024 06:09

combine generate api

f1719f0

simple in kwargs

b13471c

jason-dai approved these changes Dec 4, 2024

View reviewed changes

hkvision merged commit 7ff4533 into intel:main Dec 4, 2024
1 check passed

hkvision deleted the generate branch December 4, 2024 08:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support hf generate #12477

Support hf generate #12477

hkvision commented Dec 2, 2024 •

edited

Loading

jason-dai commented Dec 2, 2024

hkvision commented Dec 2, 2024

hkvision commented Dec 4, 2024

jason-dai commented Dec 4, 2024

hkvision commented Dec 4, 2024

hkvision commented Dec 4, 2024

hkvision commented Dec 4, 2024

jason-dai left a comment

hkvision commented Dec 4, 2024

Support hf generate #12477

Support hf generate #12477

Conversation

hkvision commented Dec 2, 2024 • edited Loading

jason-dai commented Dec 2, 2024

hkvision commented Dec 2, 2024

hkvision commented Dec 4, 2024

jason-dai commented Dec 4, 2024

hkvision commented Dec 4, 2024

hkvision commented Dec 4, 2024

hkvision commented Dec 4, 2024

jason-dai left a comment

Choose a reason for hiding this comment

hkvision commented Dec 4, 2024

hkvision commented Dec 2, 2024 •

edited

Loading