Commits

muzny committed 6ce1207

further readme updates

Comments (0)

Files changed (2)

 WiktionaryIdioms/data/*
 WiktionaryIdioms/paperOutputModels/*
 WiktionaryIdioms/bin
+WiktionaryIdiomDataSets/*
+WiktionaryIdiomDataSets.zip
 
 Grace Muzny and Luke Zettlemoyer. [Automatic Idiom Identification in Wiktionary](http://homes.cs.washington.edu/~lsz/papers/mz-emnlp13.pdf). In *Proceedings of the Conference on Empirical Methods in Natural Language Processing* (EMNLP), 2013.
 
-## The Classifier
-
-## The Detector
+The classifier and detector interaction and design are described in the above paper, which is the thing that you should cite if you use the code or data in your research.
 
 ## Building
 
 Example:
 
 ```
-$ java -Xmx4g -jar dist/RunDetectorExperimentFromFiles-1.0.jar dummy lesk ./config/nodbconfig.xml ./models-dir/basicperceptron.model 
+$ java -Xmx4g -jar dist/RunDetectorExperimentFromFiles-1.0.jar dummy lesk ./config/nodbconfig.xml ./models/filename.model
 ```
 
 ## Data
 
 * `[test|dev]_[un]annotated_nofeatures.txt` - The same as the corresponding files, but with no computed feature values.
 
+## Reproducing Paper Results
+
+The configuration files are set up such that all you have to do to reproduce the paper results for the Annotated Lexical+Graph classifier is run the following commands from the WiktionaryIdioms directory:
+
+```
+$ mkdir data   # move the data sets you get from the download into this directory
+$ ant runnables
+$ mkdir models
+$ java -Xmx4g -jar dist/RunClassifierExperimentFromFiles-1.0.jar basic ./config/classifierconfig.xml  # will produce two files - "filename" and "/models/filename.model"
+$ java -Xmx4g -jar dist/RunDetectorExperimentFromFiles-1.0.jar dummy lesk ./config/nodbconfig.xml ./models/filename.model
+```
 
 ### MySQL