Commits

Author Commit Message Labels Comments Date
Stanisław Pitucha
simple refact
Stanisław Pitucha
find_... reformatted
Stanisław Pitucha
refactored make_verification_dictionary + fixed a bug with 'izer' words
Stanisław Pitucha
verify_lemmatiser corrected
Stanisław Pitucha
simplify lemmatize_word
Stanisław Pitucha
tagger test
Stanisław Pitucha
deobfuscation step 1
Stanisław Pitucha
reformatted
Stanisław Pitucha
reformatted tagger
Stanisław Pitucha
refactor the init a bit
Stanisław Pitucha
list versions of some functions
Stanisław Pitucha
more shit removed
Stanisław Pitucha
more unobfuscation
Stanisław Pitucha
unobfuscated more functions
Stanisław Pitucha
lemmetiser/init reformatted
Stanisław Pitucha
modeline in lemmatiser
Stanisław Pitucha
tokenizer fixed, refactored, simple test added
Stanisław Pitucha
sane tokenize
Stanisław Pitucha
paragraph handling
Stanisław Pitucha
split_sentences
Stanisław Pitucha
add some sanity and unittests
cnu
Added proper license files and removed __author__ vars from source
cnu
Added .hgignore and __init__.py file
cnu
Fixed decimal number problem in tokenizing
cnu
Added common contractions to tokenizer
cnu
Initial Commit forking MontyLingua Python code from http://web.media.mit.edu/~hugo/montylingua/