WordCountStep needs to be simplified
Issue #693
new
The current WordCountStep
is rather slow and it is the counting part that is slow (not the annotations). We should try to speed it up, possibly getting a slightly less accurate count.
Important: Any change should be through an option or a new step so we do not change the count output from existing applications.
We already have an existing step called "SimpleWordCountStep" that does most of what we want. This step uses only ICU4J word counts.