We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
我们的语料库有几十万行,文件大小大概1G,这些文本作为doc输入,直接就oom了,有没有处理这种情况的好方法。
The text was updated successfully, but these errors were encountered:
可以给我试试,实现过一版类似的
Sorry, something went wrong.
https://github.com/smoothnlp/SmoothNLP/blob/master/smoothnlp/algorithm/phrase/ngram_utils.py 这个库用trie树计算自由度,内存占用上比当前库要好很多
No branches or pull requests
我们的语料库有几十万行,文件大小大概1G,这些文本作为doc输入,直接就oom了,有没有处理这种情况的好方法。
The text was updated successfully, but these errors were encountered: