Author Commit Message Labels Comments Date
Default avatar david_walker
add only single space after sentence final punctuation
Default avatar david_walker
don't add space before closing quote
Default avatar david_walker
merge
Default avatar david_walker
Append an s to currency names unless the next word is "loan"
Default avatar david_walker
don't split at apostrophes because the simple approach of having a list of words which can contain them ("you'll" etc.) fails to account for names, some of which can contain multiple apostrophes (e.g. "Ng'ang'a").
Default avatar da...@david-office.Bubka
add ll to list of apostrophe endings
david_walker avatardavid_walker
prevent pycountry logging complaint by adding null handler
david_walker avatardavid_walker
put log file in temp directory instead of current dir
Default avatar da...@david-office
add two spaces after sentence-final period
Default avatar da...@david-office.Bubka
add a.m. and p.m. as abbreviations
david_walker avatardavid_walker
one acre fund template cleanup
david_walker avatardavid_walker
new regexes for One Acre Fund
david_walker avatardavid_walker
initial progress report
Branches
parse
david_walker avatardavid_walker
completed draft of first progress report
Branches
parse
david_walker avatardavid_walker
BibTeX bibliography file
Branches
parse
david_walker avatardavid_walker
update token.cend for merging tokens in "years old" type expressions
Branches
parse
david_walker avatardavid_walker
checkpoint
Branches
parse
david_walker avatardavid_walker
handle quoted parenthesis
Branches
parse
david_walker avatardavid_walker
implementation progress report, initial (incomplete) version
Branches
parse
david_walker avatardavid_walker
rename parser and token modules
Branches
parse
david_walker avatardavid_walker
converting pos from simple string to PosContainer object
Branches
parse
david_walker avatardavid_walker
Token.pos was a single Penn Treebank token type, such as 'NN'. With this checkin, it becomes a list of PosTag namedtuple objects, each of which has a token type and a probability value. In most cases there will be only a single entry in the list, but there can be three or more. This change is necessary because the parser fails to parse some sentences given only the highest-probability part-of-speech tag for each token, but succeeds if lower-probability alternatives are present.
Branches
parse
david_walker avatardavid_walker
rename parser.py to myparser.py to avoid conflict with system module
Branches
parse
david_walker avatardavid_walker
improve handling of punctuation characters
Branches
parse
david_walker avatardavid_walker
add test for "nn-year-old" type expressions
Branches
parse
david_walker avatardavid_walker
changes needed to support YearOldRule, which depends on parse trees
Branches
parse
david_walker avatardavid_walker
launch cheap as xml-rpc server if not already running
Branches
parse
david_walker avatardavid_walker
code migrated into rules.py
Branches
parse
david_walker avatardavid_walker
passing unit tests, except improve/expand. that can be re-enabled once it is possible to search for a noun phrase
Branches
parse
david_walker avatardavid_walker
work in progress: removing transforms and making rules directly change tokens
Branches
parse
  1. Prev
  2. Next
Help
Tip: Filter by directory path e.g. /media app.js to search for public/media/app.js.
Tip: Use camelCasing e.g. ProjME to search for ProjectModifiedEvent.java.
Tip: Filter by extension type e.g. /repo .js to search for all .js files in the /repo directory.
Tip: Separate your search with spaces e.g. /ssh pom.xml to search for src/ssh/pom.xml.
Tip: Use ↑ and ↓ arrow keys to navigate and return to view the file.
Tip: You can also navigate files with Ctrl+j (next) and Ctrl+k (previous) and view the file with Ctrl+o.
Tip: You can also navigate files with Alt+j (next) and Alt+k (previous) and view the file with Alt+o.