Text missing when using Moses InlineText leveraging with plain text
Original [issue 178](https://code.google.com/p/okapi/issues/detail?id=178) created by Achi... on 2011-07-14T21:35:44.000Z:
Environment: Okapi M12 on Windows 7 (reproduces on Ubuntu Linux too) 1. Create a text file test.txt with this content: This is line 1. This is line 2 consisting of multiple sentences. This is the second sentence. 2. Run "tikal.bat -xm test.txt -sl en -fc okf\_plaintext -seg". This creates a file test.txt.en with this content: This is line 1. This is line 2 consisting of multiple sentences. This is the second sentence. 3. Pseudo-localize test.txt.en into test.txt.fr with this content: That is line 1. That is line 2 consisting of multiple sentences. That is the second sentence. 4. Run "tikal.bat -lm test.txt -sl en -tl fr -overtrg". This produces a file test.out.txt with this content: That is line 1. That is line 2 consisting of multiple sentences.
Result: The last sentence "That is the second sentence." is missing from the leveraged file.
Comments (4)
-
Account Deleted -
Account Deleted Comment [2.](https://code.google.com/p/okapi/issues/detail?id=178#c2) originally posted by Achi... on 2011-07-16T13:32:51.000Z:
Incorrect statement in previous comment: adding -seg to step 4. \*does\* add the second sentence.
This is not a bug - please close.
-
Account Deleted Comment [3.](https://code.google.com/p/okapi/issues/detail?id=178#c3) originally posted by Achi... on 2011-07-16T22:03:44.000Z:
The error behavior reappears if -fc okf\_plaintext is added to the options.
-
Account Deleted - changed status to resolved
Comment [4.](https://code.google.com/p/okapi/issues/detail?id=178#c4) originally posted by @ysavourel on 2011-07-19T09:28:24.000Z:
The example works for me. Basically, whenever you are using segmentation or a specific filter when extracting, you need to use the same for leveraging.
using the almost same command-line is best, just replacing -xm by -lm and adding any merging-specific options like -overtrg.
So if the extraction is:
tikal.bat -xm test.txt -sl en -fc okf\_plaintext -seg
The following should work for leveraging:
tikal.bat -lm test.txt -sl en -fc okf\_plaintext -seg -overtrg
- Log in to comment
Comment [1.](https://code.google.com/p/okapi/issues/detail?id=178#c1) originally posted by Achi... on 2011-07-15T17:43:00.000Z:
Additional investigation: This issue is reproduceible with multi-line paragraphs in HTML documents (okf\_html). It doesn't matter if in step 4. the option "-seg" is set or not.