Added support of google/flan models #2321

wangzhen263 · 2023-08-26T16:22:11Z

Why are these changes needed?

Related issue number (if applicable)

Checks

I've run format.sh to lint the changes in this PR.
I've included any doc changes needed.
I've made sure the relevant tests are passing (if applicable).

merrymercy · 2023-08-27T09:27:07Z

fastchat/model/model_adapter.py

+
+    def match(self, model_path: str):
+        return "flan" in model_path.lower()
+


What is the instruction template for flan? Do we need to set it here?

Hi @merrymercy, do you mean prompt template?

@wangzhen263 Any updates? IIRC, flan-t5 has a default prompt template.

Added AutoModelForSeq2SeqLM to init google/flan models in 8bit

… a typecheck.

merrymercy · 2023-09-11T23:47:15Z

Closed due to inactivity. Feel free to reopen after you address these problems.

Use the correct prompt templates for Flan-T5
Format your code.

wangzhen263 and others added 4 commits August 26, 2023 23:20

Added a google/flan model adapter

f5d145b

flan model requires protobuf

42b7cf5

flan model to compress model loader

da73edd

Merge branch 'lm-sys:main' into main

1c7f5fe

merrymercy requested changes Aug 27, 2023

View reviewed changes

merrymercy force-pushed the main branch 2 times, most recently from bf7aa7e to a81a04c Compare August 28, 2023 01:36

wangzhen263 added 6 commits August 31, 2023 12:35

Merge branch 'lm-sys:main' into main

2252f1a

Merge branch 'lm-sys:main' into main

0a2d593

Added AutoModelForSeq2SeqLM to init google/flan models in 8bit

b4ef47d

Merge pull request #1 from wangzhen263/flan-feature

d7bd21a

Added AutoModelForSeq2SeqLM to init google/flan models in 8bit

Fixed test error because of using isinstance() rather than type() for…

03efd2e

… a typecheck.

Fixed test error because of using isinstance() rather than type() for…

061e69a

… a typecheck.

merrymercy force-pushed the main branch from 86ef64f to dc3dd12 Compare September 6, 2023 03:26

merrymercy closed this Sep 11, 2023

wangzhen263 mentioned this pull request Sep 12, 2023

Added google/flan models and fixed AutoModelForSeq2SeqLM when loading T5 compression model #2402

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added support of google/flan models #2321

Added support of google/flan models #2321

wangzhen263 commented Aug 26, 2023 •

edited

Loading

merrymercy Aug 27, 2023

wangzhen263 Aug 31, 2023

merrymercy Aug 31, 2023

merrymercy Sep 6, 2023

merrymercy commented Sep 11, 2023


		def match(self, model_path: str):
		return "flan" in model_path.lower()

Added support of google/flan models #2321

Added support of google/flan models #2321

Conversation

wangzhen263 commented Aug 26, 2023 • edited Loading

Why are these changes needed?

Related issue number (if applicable)

Checks

merrymercy Aug 27, 2023

Choose a reason for hiding this comment

wangzhen263 Aug 31, 2023

Choose a reason for hiding this comment

merrymercy Aug 31, 2023

Choose a reason for hiding this comment

merrymercy Sep 6, 2023

Choose a reason for hiding this comment

merrymercy commented Sep 11, 2023

wangzhen263 commented Aug 26, 2023 •

edited

Loading