Auto convert without tokenizer.json to prevent performance downgrade? #62

leiwen83 · 2023-08-12T08:04:57Z

As mentioned in #20 , lightllm performance would downgrade a lot if without tokenizer.json. So for those model without this file, shall it be reasonable to add some auto conversion process in the server start to workaround this case?

Thx

llehtahw · 2023-08-13T03:40:33Z

Hi @leiwen83, Were you able to use the fast tokenizer in LightLLM? If yes, would you like to open a pull request ? 😄

super-buster · 2023-08-16T02:42:18Z

You can save fast tokenizer in advane, refer to fix slow tokenizer

Then auto load fast tokenizer if you want. I have modified in my code

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Auto convert without tokenizer.json to prevent performance downgrade? #62

Auto convert without tokenizer.json to prevent performance downgrade? #62

leiwen83 commented Aug 12, 2023

llehtahw commented Aug 13, 2023

super-buster commented Aug 16, 2023

Auto convert without tokenizer.json to prevent performance downgrade? #62

Auto convert without tokenizer.json to prevent performance downgrade? #62

Comments

leiwen83 commented Aug 12, 2023

llehtahw commented Aug 13, 2023

super-buster commented Aug 16, 2023