GPT mentioend in Figure3 #1

jzhang38 · 2022-12-01T07:21:02Z

Dear authors,

Thanks for open-sourcing your wonderful work.

You mention GPT in Figure 3 when comparing the Pareto front across different models("AR models of the same size"). May I ask if this is a pre-trained GPT (e.g. GPT2-small) finetuned on the LM1B dataset, or a model with GPT architecture trained from scrach on the LM1B training set?

Hzfinfdu · 2022-12-01T07:48:58Z

Hi,

Thank you for your question! We include both models in Figure 3. The red curve, which is rather close to our DiffusionBERT stands for an AR model trained from scratch and the green one for finetuned GPT2. In general, DiffusionBERT still falls behind pretrained AR models in terms of generation quality.

Best,
Zhengfu

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPT mentioend in Figure3 #1

GPT mentioend in Figure3 #1

jzhang38 commented Dec 1, 2022 •

edited

Loading

Hzfinfdu commented Dec 1, 2022

GPT mentioend in Figure3 #1

GPT mentioend in Figure3 #1

Comments

jzhang38 commented Dec 1, 2022 • edited Loading

Hzfinfdu commented Dec 1, 2022

jzhang38 commented Dec 1, 2022 •

edited

Loading