Commits

Author Commit Message Labels Comments Date
david_walker
proper spacing on output text generation
david_walker
bring the bulk of regular expressions from old regex_process.py into a rule/transform
david_walker
refactor get_neighbors
david_walker
iso currency expansion
david_walker
add thousands separators to currency numbers
david_walker
obsolete code dumping ground
david_walker
Started on transform of single digits to spelled out numbers. Next checkin will get rid of classifier.py and put that logic as properties (computed attributes) of Token class.
david_walker
fix URL-detecting regex and its flags
david_walker
Fix bugs in abbreviation splitting
david_walker
a bit of refactoring
david_walker
support splitting tokens on abbreviations and periods
david_walker
refactor base.py and ruleset.py from kea2.py change to-do list comment in kea.py refactoring in kea2.py
david_walker
more initial implementation of kea2, still not running yet
david_walker
Mostly writing broad description of algorithm in comments, and starting to tentatively define some classes.
david_walker
changes to internal to-do list comment
david_walker
initial version
david_walker
add entries to currency and abbreviation lists consider '%' to be a punctuation character in Token.is_punc() use NamedTemporaryFile for logging to fix access-denied when attempting to create hard-coded 'kiva.log' file in a r/o directory add more items to internal TODO list
david_walker
Do propert command-line parsing with kea.py wrap stdin and stdout in unicode reader and writer
david_walker
wscript: add creation of __init__.py
david_walker
changed .hgignore: more junk to ignore changed wscript: add build rule
david_walker
changed .hgignore: added more things that are not part of project but that live in the directory on my dev machine
david_walker
added gkea.py: pyqt4 based graphical front end added kivaui.ui: qt designer layout file added wscript: waf build framework control file changed .hgignore: ignore waf cruft and generated file kivaui.py
david_walker
make single-backslash regex strings raw
david_walker
handle us and euro currency styles
david_walker
needed to increment index to account for inserting $ behind it
david_walker
begin work on US currency symbol
david_walker
further work on alphanumerics
david_walker
fix regexes for 11th 101st etc
david_walker
fix bug in test.py, should have been using stringio
david_walker
handle asterisk as a footnote character
  1. Prev
  2. Next