Skip to content

您好,请问下,一个纯文本的txt文档来做预训练的话,dataset_info.json该如何添加这个新数据集?我需要将这个txt的内容转换成这种格式吗[ {"text": "document"}, {"text": "document"} ]?如果我不想转,就是想使用一个书本txt做预训练该如何做 #4909

Unanswered
cheun726 asked this question in Q&A
Discussion options

You must be logged in to vote

Replies: 3 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
pending This problem is yet to be addressed
3 participants
Converted from issue

This discussion was converted from issue #4900 on July 20, 2024 16:11.