-
-
Notifications
You must be signed in to change notification settings - Fork 608
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
共建为拼音服务的字典、词典库 #43
Comments
@hotoo 支持。可以考虑将词典库独立为一个单独的仓库,方便维护和反馈。 BTW, 我前段时间将字典独立成了一个单独的仓库(改为使用来自 Unihan Database 的数据): https://github.com/mozillazg/pinyin-data |
@gumblex 感谢你提供的资料。我也要尽快把词库库建起来 😂 |
初始版本已经出来了:https://github.com/mozillazg/phrase-pinyin-data |
可以转啊 |
This comment has been minimized.
This comment has been minimized.
@mozillazg 您好,我想问一下,我手头有一个中文分词分好的词库,如何转成有带声调拼音的,这样就可以合并到拼音词库里了。有类似工具吗。 |
@yaoruyi 请问词库中每个词语有对应的拼音数据不(哪种格式的都行,只要能标明正确的声调信息就好)?如果没有词语对应的拼音数据的话,就跟我们的需求不相符,我们的需求是一起维护不同词语准确的拼音数据,而不是单纯的汉语分词库。 |
拼音有就是不带声调的,哈哈。 |
拼音库主要依赖的是拼音字典、词典(后面简称“词典”),这个词典共用性很高,但由于词典库较大,出现问题的概率的也高。
建议大家一起共建、共同维护这个词典,你们觉得怎么样?
#41 #42
cc @mozillazg
The text was updated successfully, but these errors were encountered: