Skip to content
#

tatoeba

Here are 32 public repositories matching this topic...

This repository presents an approach to predict the language in which a document is written. In particular, the proposed approach transforms a text into character n-gram features and uses them to support the predictive power of a machine-learned classifier. Experimental results show that it is capable of identifying 14 languages with high accura…

  • Updated Aug 12, 2020
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the tatoeba topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the tatoeba topic, visit your repo's landing page and select "manage topics."

Learn more