This is a day project in which we will analyze various stuff about reddit. Of, course it will be nessecary to get the relevant data. THere are a few steps to do this described below
Make sure you have python 2.7+ installed. It shoudl also be useful to have the scientific Python stack (numpy,scipy,matlibplot) intalled. Anyway, install the Python wrapper to the Reddit API as described here: http://mellort.github.com/reddit_api/
pip install reddit
Great. Now make sure you can run a simple script.
Nota bene the followgin 1-gram frequency lists http://en.wiktionary.org/wiki/Wiktionary:Frequency_lists