Wiktionary Idiom Classifier & Detector
Grace Muzny and Luke Zettlemoyer. Automatic Idiom Identification in Wiktionary. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2013.
There is an accompanying ant file. From the
WiktionaryIdioms directory, simply type
ant dist to build the distribution jar.
If you would like the runnable jars corresponding to
detector.experiments.RunDetectorExperimentsFromFiles, these can be built by running the command
ant runnables, and will be made in your
Alternatively, the jar files are all available in the Downloads section.
Most of the classes with main files come with descriptions of all parameters that they need to be passed. Here is a description of two key classes and how to run them.
All main classes will work with 4g of memory allocated. (
-Xmx4g) Most require at least this much memory.
To work on the project in Eclipse, simply download and import the project WiktionaryIdioms into it.