nltk-trainer /

Filename Size Date modified Message
46 B
initial docs
11.1 KB
apache license, initial, shebang in scripts
1.0 KB
python versions, link to data install
3.0 KB
initial analyzed chunked corpus tests
3.4 KB
fix fractions, initial tests for analyze tagger coverage
5.7 KB
sklearn classification algorithms with scikitlearn.SklearnClassifier training
3.2 KB
dynamic formatting for tag columns
5.1 KB
bottom metrics column 13, support fileids without metrics
2.2 KB
initial script to combine classifiers into AvgProbClassifier, make other scripts executable
6.2 KB
tag_phrases basically working with ChunkedCorpusWriter
2.0 KB
default hierarchy to empty list for combining classifiers
52 B
adding >= to in requirements.txt as this package version does not exist in
1.2 KB fixed
2.7 KB
tag_phrases basically working with ChunkedCorpusWriter
7.0 KB
better corpus loading
14.2 KB
value type float for tfidf
10.3 KB
better corpus loading
3.1 KB
initial working

NLTK Trainer

NLTK Trainer exists to make training and evaluating NLTK objects as easy as possible.


You must have Python >=2.6 (but not 3.x) with argparse and NLTK 2.0 installed. NumPy, SciPy, and megam are recommended for training Maxent classifiers. To use the sklearn classifiers, you must also install scikit-learn.

If you want to use any of the corpora that come with NLTK, you should install the NLTK data.


Documentation can be found at (you can also find these documents in the docs directory. Every script also provides a --help option that describes all available parameters.

Tip: Filter by directory path e.g. /media app.js to search for public/media/app.js.
Tip: Use camelCasing e.g. ProjME to search for
Tip: Filter by extension type e.g. /repo .js to search for all .js files in the /repo directory.
Tip: Separate your search with spaces e.g. /ssh pom.xml to search for src/ssh/pom.xml.
Tip: Use ↑ and ↓ arrow keys to navigate and return to view the file.
Tip: You can also navigate files with Ctrl+j (next) and Ctrl+k (previous) and view the file with Ctrl+o.
Tip: You can also navigate files with Alt+j (next) and Alt+k (previous) and view the file with Alt+o.