this project can be used to solve the chinese documents cluster. and also be applied to English with a little change. the cluster results are not always very well. here are some reasons: the idea uesed in project is prefer to find the same documents not similar. it can not recognize the synonym. so maybe this project is more suitable to English cluster. the code does not do any optimization, so for the large text it runs slow and needs a large memory.
-
Notifications
You must be signed in to change notification settings - Fork 0
code-learner/ChineseDocCluster
About
No description, website, or topics provided.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published