GitHub - LuminosoInsight/code-sample-brown-corpus

Singular and plural nouns: a coding challenge

The Brown Corpus contains approximately a million words of English text, tagged with symbols indicating their parts of speech. You can download the corpus here.

Your task is to write a program that determines which nouns in this corpus appear particularly often in the plural form relative to the singular form. Write the results to a file, including whatever information and using whatever format seems best. Choose a reasonable criterion to determine which words to output.

Document your design and your decisions (in comments and/or a readme file), including how to run your program. Explain any problems or errors in your results, and how they could be fixed.

Feel free to make simplifying assumptions; while we expect your code to recognize the most common patterns for pluralizing words, for example, we do not necessarily expect it to know that "corpora" is the plural of "corpus". You may use any external libraries or data you wish.

Submitting your code

When your Python code is ready, package it up and send it to [email protected], along with any necessary instructions on how to run it. Please send it only to us, and don't make your code publicly available.

We review code samples anonymously, so please do not put your name in the code itself.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Singular and plural nouns: a coding challenge

Submitting your code

About

Releases

Packages

Contributors 2

LuminosoInsight/code-sample-brown-corpus

Folders and files

Latest commit

History

Repository files navigation

Singular and plural nouns: a coding challenge

Submitting your code

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Packages