Commits

Robert Bu committed 222f95c

add readme

  • Participants
  • Parent commits 9129934

Comments (0)

Files changed (1)

 To Classify:
 	python imdb.py -c OUTPUT_HTML_PATH MOVIE_TITLE [MAX_COMMENT_COUNT]
 
+
 The files in the lists directory is the movie lists I used to train the NaiveBayes classifier, they come from random titles in
 	* IMDB Top 250 (http://www.imdb.com/chart/top)
 	* IMDB Bottom 100 (http://www.imdb.com/chart/bottom)
 	* New York Times The Best 1,000 Movies Ever Made (http://www.nytimes.com/ref/movies/1000best.html)
 
+To simplify the problem, the train process will flag reviews with more than 6 stars as a positive review, reviews with less than 4 stars as a negative review
+
 Trained data is stored in trained.raw as a plain text file
 
 * Future Works: