Commits

izazi committed 23c3481

Added an appropriate readme
Please enter the commit message for your changes. Lines starting

Comments (0)

Files changed (2)

 # tfidf-cascalog
 
-A Clojure library designed to ... well, that part is up to you.
+Implements a portion of the TF-IDF algorithm. Takes Avro based file as input, calculates TF, DF and D portions in a batch mode and places the results into a cassandra table. This will later be read by a storm-trident DRPC query and combined with the realtime data to form a complete view of the world. 
 
 ## Usage
 
-FIXME
+Ensure that both hadoop and cassandra are started, then:
 
-## License
+	lein deps
+	lein compile
+	lein uberjar
+	
+	copydata.sh
+	hadoop jar ./target/tfidf-cascalog-0.1.0-SNAPSHOT-standalone.jar data/document.avro data/en.stop 127.0.0.1
+	
+Obviously replacing the IP address with the appropriate cassandra IP address.
 
-Copyright © 2013 FIXME
-
-Distributed under the Eclipse Public License, the same as Clojure.
   :source-paths ["src/clj"]
   :test-paths   ["test/clj"]
   :resource-paths ["src/resources"]
+  :main tfidf-cascalog.core
   :license {:name "Eclipse Public License"
             :url "http://www.eclipse.org/legal/epl-v10.html"}
   :dependencies [[org.clojure/clojure "1.4.0"]