Spam classification and sentiment analysis.
- Spam Classification: Partition emails into 2 categories depending on whether they contain spam or not.
- Sentiment Analysis: Partition movie reviews into 2 categories depending on whether they are positive or negative reviews.
- emails: text documents
- movie reviews: text documents
The goals above are each accomplished by training a Naive Bayes classifier on a set of training data, and then testing our classifier on a set of test data. We hope to have a high success rate in figuring out which emails contain spam, and whether an unseen movie review is positive or negative.