Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Changing the allowed maximum conversation length in Plato-2 #183

Open
Sidx-sys opened this issue Mar 6, 2023 · 1 comment
Open

Changing the allowed maximum conversation length in Plato-2 #183

Sidx-sys opened this issue Mar 6, 2023 · 1 comment

Comments

@Sidx-sys
Copy link

Sidx-sys commented Mar 6, 2023

Plato-2 removes all the conversation data points which exceed its constraint of conversation length. How can I increase this allowed conversation length so as to incorporate context lengths?

Please do help, I have been trying to do this for a while now. But everytime it seems like the pretrained weights that is being downloaded for the 24L model has a fixed unique vocabulary size, which leads to the training to not happen.

Thank you.

@sserdoubleh
Copy link
Collaborator

Using tool to expand position embedding: https://github.com/PaddlePaddle/Knover/blob/develop/knover/tools/expand_embedding.sh

sh knover/tools/expand_embedding ${PATH_TO_MODEL_DIR}

And then update training config file:
Add one line after

such as:

train_args="--max_src_len 384"

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants