Scripts create lists of English nouns with plural forms using Wiktionary dump. It can be used to transform noun to singular or plural form. WordNet lacks this information.
Files:
noun.csv
- countable nounsnoun_countable_and_uncountable.csv
- e.g. http://en.wiktionary.org/wiki/beersnoun_uncountable.csv
- nouns that cannot be used freely with numbers or the indefinite article, and which therefore takes no plural form, e.g. http://en.wiktionary.org/wiki/lycranoun_usually_uncountable.csv
- e.g. http://en.wiktionary.org/wiki/informationnoun_unknown.csv
- nouns with unknown or uncertain pluralnoun_pluralia_tantum.csv
- nouns that has no singular form, e.g. http://en.wiktionary.org/wiki/scissorsnoun_not_attested.csv
- nouns with plural not attested
Two columns (singular and plural form) have files: noun.csv
, noun_countable_and_uncountable.csv
, noun_usually_uncountable.csv
.
Usage:
ruby parser.rb enwiktionary-20140206-pages-meta-current.xml
ruby process_templates.rb