Commits

Show all
Author Commit Message Labels Comments Date
spirit
requires-dist = pyenchant
Tags
0.4a6
spirit
Improve identify by PyEnchant testing. Also reorder assertEqual arguments (computed_result, expected_result).
spirit
Some optimizations to the identify by PyEnchant method
spirit
Unused import
spirit
Minor
spirit
Use PyEnchant if trigram method fails (useful for short texts).
spirit
README: too short text example
spirit
README: installation instructions
spirit
Minor
spirit
Minor
spirit
Remove workarounds in default_hook().
spirit
Added tag 0.4a5 for changeset 1d6fdb18c208
spirit
Workaround for “pysetup run sdist” under Python2
Tags
0.4a5
spirit
setup.py: Use setup_requires "3to2" if running from Python 2.
spirit
Sanity checks when generating blocks
spirit
.hgignore
spirit
Added tag 0.4a4 for changeset b351a325720f
spirit
data subpackage
Tags
0.4a4
spirit
trigrams
spirit
My changes
spirit
Rename main module.
spirit
License
spirit
Pull from original author's Subversion repository. Initial moving and removing of unused files.
kent...@b878c2a8-e649-0410-a97a-412969e4d25c
Add README and setup.py
kent...@b878c2a8-e649-0410-a97a-412969e4d25c
Import all guessLanguage* functions into the root package
kent...@b878c2a8-e649-0410-a97a-412969e4d25c
Folded Hiragana, Katakana and Katakana Phonetic Extensions into Katakana for better recognition of Japanese (see issue 3)
kent...@b878c2a8-e649-0410-a97a-412969e4d25c
Add more language names
kent...@b878c2a8-e649-0410-a97a-412969e4d25c
Somewhat hacky fix for Vietnamese (issue 1)
kent...@b878c2a8-e649-0410-a97a-412969e4d25c
Merge some changes from technorati branch - fix _load_models() to handle directories and mixed-case filenames - change unknown token to 'UNKNOWN' - add accessors for language name and number
kent...@b878c2a8-e649-0410-a97a-412969e4d25c
Initial import