Skip to content

GerHobbelt/many-stop-words

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Many Stop Words

Simple Python package that provides a single function for loading sets of stop words for different languages.

Stop words in English, French, German, Finish, Hungarian, Turkish, Russian, Czech, Greek, Arabic, Chinese, Japanese, Korean, Catalan, Polish, Hebrew, Norwegian, Swedish, Italian, Portuguese and Spanish, were retrieved from the following sources:

The directory called orig contains the original files used to compile the stop word lists. The directory called not_used contains raw data for creating more stop words lists for languages that are not yet available in many_stop_words.available_languages

About

stop word lists in several languages

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 100.0%