IMDBReview_NaiveBayes /

Filename Size Date modified Message
11.8 KB
Hello World
6.8 KB
minor adjustments
1.1 KB
minor adjustments
1.6 MB
add readme
This is a experimental python project that extracts IMDB reviews for a movie, classifies them and generate a result html file
The project is based on the programming assignments and what I've learned in **Udacity CSC 101 class** and **Stanford NLP class**
The script is also used as a plugin in my term project for my Distributed System class this semster

To Train:

To Classify:

The files in the lists directory is the movie lists I used to train the NaiveBayes classifier, they come from random titles in
	* IMDB Top 250 (
	* IMDB Bottom 100 (
	* The Best 1,000 Movies Ever Made (

To simplify the problem, the train process will flag reviews with more than 6 stars as a positive review, reviews with less than 4 stars as a negative review

Trained data is stored in trained.raw as a plain text file

* Future Works:
	Try other algorithms to improve the accuracy
	Make a online version