Replies: 3 comments
-
您好,打扰您一下,我还想问下,一条训练数据里面最多可以放多少个token?这是由什么决定的呀?比如说这条数据 |
Beta Was this translation helpful? Give feedback.
-
纯文本可参考 wiki_demo |
Beta Was this translation helpful? Give feedback.
-
纯文本的那个你后来 |
Beta Was this translation helpful? Give feedback.
-
Reminder
System Info
x
Reproduction
"my_demo": {
"file_name": "天龙八部.txt",
"columns": {
"prompt": "text"
}
}
我这个天龙八部.txt里面是没有将其内容按照 {"text": "document"}这种格式处理的,就是纯文本。我想问下,直接这样直接使用纯文本做预训练可以吗
Expected behavior
No response
Others
No response
Beta Was this translation helpful? Give feedback.
All reactions