1. Jörg Tiedemann
  2. pdf2xml

Commits

tiedeman  committed 01f3e8d

bugfix in pdfxtk-based conversion

  • Participants
  • Parent commits 93bd9ff
  • Branches master

Comments (0)

Files changed (1)

File pdf2xml

View file
 
 sub find_words_pdfxtk{
     my $string = shift;
+    $string=~s/^\s*//;
 
     unless ($string=~/\s/){
 	return find_words_charlevel($string);
     }
 
-    $string=~s/^\s*//;
     my @words = ();
     my @tokens = split(/\s+/,$string);