Availability of using other BERT alternatives for current langauges #145

gordon0414 · 2024-06-20T03:01:48Z

Hello, I was wondering if we can replace the BERT for current supported models

    Languages.JP: BASE_DIR / "bert" / "deberta-v2-large-japanese-char-wwm",
    Languages.EN: BASE_DIR / "bert" / "deberta-v3-large",
    Languages.ZH: BASE_DIR / "bert" / "chinese-roberta-wwm-ext-large",

Since the current models are mostly large variants of the models, but I think smaller BERT models will still be capable for our use case.

I mean by smaller variants such as
- English: microsoft/deberta-v3-small
- Japanese: ku-nlp/deberta-v2-base-japanese-char-wwm

I suppose these variants won't cause any issues since the tokenizers are identical to the original ones

My hypothesis is that if we fine-tune our model with the new BERT models, it will seamlessly incorporate with the current system.

The text was updated successfully, but these errors were encountered:

litagin02 · 2024-06-20T09:51:40Z

I haven't checked, but maybe as you guess we can, and you can try it.

gordon0414 · 2024-06-20T09:59:45Z

okay, maybe I should try it this weekend!

gordon0414 · 2024-07-08T06:59:28Z

The dimensional discrepancy between the base and large models (768 vs. 1024) seems to be a significant problem. We need to come up with techniques such as knowledge distillation or other methods.

gordon0414 changed the title ~~Availability using other BERT alternatives for current langauges~~ Availability of using other BERT alternatives for current langauges Jun 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Availability of using other BERT alternatives for current langauges #145

Availability of using other BERT alternatives for current langauges #145

gordon0414 commented Jun 20, 2024 •

edited

Loading

litagin02 commented Jun 20, 2024

gordon0414 commented Jun 20, 2024

gordon0414 commented Jul 8, 2024

Availability of using other BERT alternatives for current langauges #145

Availability of using other BERT alternatives for current langauges #145

Comments

gordon0414 commented Jun 20, 2024 • edited Loading

litagin02 commented Jun 20, 2024

gordon0414 commented Jun 20, 2024

gordon0414 commented Jul 8, 2024

gordon0414 commented Jun 20, 2024 •

edited

Loading