Arabic Models #117
Replies: 2 comments 1 reply
-
Hello @Anas-Abdelhadi Currently core tokenizer only supports latin & devanagari scripts; therefore it is not possible to build models for Arabic language(s). If you are interested to contribute then we can together explore possibility of enhancing the tokenizer and work towards building the same. Best, |
Beta Was this translation helpful? Give feedback.
-
Thanks @Anas-Abdelhadi... sounds great. The winkNLP tokenizer is inspired by wink-tokenizer — request you to please look at it as that is a simpler version. We will have to figure out how to add capability to process Arabic scripts. You may also drop a line at [email protected]. Best, |
Beta Was this translation helpful? Give feedback.
-
Hi there, I've checked your documentation and appears you only have support for two models (en). Following up on a prior question regarding Spanish, is there a tutorial to help developing a model for Arabic? I would appreciate it if you can help by pointing where to start and how to do it.
Thanks
Beta Was this translation helpful? Give feedback.
All reactions