Incorrect segmentation with inline code near break

Original issue 169 created by lukas.sta... on 2011-03-09T09:45:01.000Z:

tikal of okapi version 0.10 does not segment correctly docx files.
The attached srx file contains rule for dividing sentences to separated segments. But tikal does not separate sentences in some cases, e.g. if both are underlined and the second one is bold. The example documents are attached.
Using command (on Linux):
tikal.sh -x sentence.docx -seg test.srx
tikal.sh -x tst-alter.docx -seg test.srx

Comments (5)