Commits

Author Commit Message Labels Comments Date
tiedeman
new release version
tiedeman
added language detection
tiedeman
added maerg-paragraph heuristics
tiedeman
pdfxtk source with my little modification
Branches
pdfxtk
tiedeman
test 1 is fixed
tiedeman
no unneccesary de-hyphenation
tiedeman
do not add ligatures tosingle letters
tiedeman
new string segmenter approach
tiedeman
fixed test suite
tiedeman
added yet another heuristics for ligature handling
tiedeman
added yet another heuristics for ligature handling
tiedeman
added ligature handling
tiedeman
more post-processing for pdfxtk
tiedeman
bugfix in pdfxtk-based conversion
tiedeman
better pdfxtk conversion
tiedeman
fixed test suite
tiedeman
fixed test suite documents
tiedeman
post-processing of pdfxtk output
tiedeman
post-processing of pdfxtk output
tiedeman
post-processing of pdfxtk output
tiedeman
fix in test suite
Joerg Tiedemann
skip poppler-test if no pdftotext is found
tiedeman
fixed test failures on Linux (?)
tiedeman
more efficient find_words
tiedeman
correct sharedir
Jörg Tiedemann
test pdftotext version
tiedeman
pdfXtk added
tiedeman
lowercasing added
tiedeman
autofill in README
tiedeman
more options and better merging heuristics
  1. Prev
  2. Next