Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

共建为拼音服务的字典、词典库 #43

Open
hotoo opened this issue May 11, 2016 · 9 comments
Open

共建为拼音服务的字典、词典库 #43

hotoo opened this issue May 11, 2016 · 9 comments

Comments

@hotoo
Copy link

hotoo commented May 11, 2016

拼音库主要依赖的是拼音字典、词典(后面简称“词典”),这个词典共用性很高,但由于词典库较大,出现问题的概率的也高。

建议大家一起共建、共同维护这个词典,你们觉得怎么样?
#41 #42

cc @mozillazg

@mozillazg
Copy link
Owner

mozillazg commented May 11, 2016

@hotoo 支持。可以考虑将词典库独立为一个单独的仓库,方便维护和反馈。

BTW, 我前段时间将字典独立成了一个单独的仓库(改为使用来自 Unihan Database 的数据): https://github.com/mozillazg/pinyin-data

@gumblex
Copy link
Contributor

gumblex commented Sep 22, 2016

@mozillazg
Copy link
Owner

@gumblex 感谢你提供的资料。我也要尽快把词库库建起来 😂

@mozillazg
Copy link
Owner

初始版本已经出来了:https://github.com/mozillazg/phrase-pinyin-data
@gumblex 地球拼音中都是繁体字 😂

@gumblex
Copy link
Contributor

gumblex commented Mar 6, 2017

可以转啊

@zgdlime

This comment has been minimized.

@mozillazg mozillazg pinned this issue Sep 22, 2020
@yaoruyi
Copy link

yaoruyi commented Nov 13, 2020

@mozillazg 您好,我想问一下,我手头有一个中文分词分好的词库,如何转成有带声调拼音的,这样就可以合并到拼音词库里了。有类似工具吗。

@mozillazg
Copy link
Owner

mozillazg commented Nov 13, 2020

@yaoruyi 请问词库中每个词语有对应的拼音数据不(哪种格式的都行,只要能标明正确的声调信息就好)?如果没有词语对应的拼音数据的话,就跟我们的需求不相符,我们的需求是一起维护不同词语准确的拼音数据,而不是单纯的汉语分词库。

@lanyanguang32
Copy link

@yaoruyi 请问词库中每个词语有对应的拼音数据不(哪种格式的都行,只要能标明正确的声调信息就好)?如果没有词语对应的拼音数据的话,就跟我们的需求不相符,我们的需求是一起维护不同词语准确的拼音数据,而不是单纯的汉语分词库。

拼音有就是不带声调的,哈哈。

@mozillazg mozillazg unpinned this issue Nov 12, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

6 participants