Commits

Author Commit Message Labels Comments Date
Joerg Tiedemann
fixed problem witgh raw files (space after newline!)
Joerg Tiedemann
added non-empty/empty alignment ratio output
Joerg Tiedemann
now with sorted time frames
tiedeman
additional string cleanup before parsing XML and before converting to XML
tiedeman
time-oberlap ratio added
tiedeman
fixed module name
tiedeman
added use lib in initial test
tiedeman
added srt-dir in test
tiedeman
language flags in srtalign
tiedeman
srt2xml now in utf8
tiedeman
added tests
tiedeman
better handling of contractions
tiedeman
added dependency on proper version of Locale::Codes::Language
tiedeman
use nonbreaking prefixes from Uplug
tiedeman
fixed deocding issues
tiedeman
better splitter
tiedeman
handle inline XML markup in Chinese
tiedeman
additional regex for fixing English OCR errors and fixed search path for non-breaking prefixes
tiedeman
fixed tokenization with line breaks in raw file output
tiedeman
make package structure
tiedeman
inital import
tiedeman
inital import