- changed status to open
Adding text before/after first/last inline code cause invalid merged file in PPTX extraction
Original [issue 103](https://code.google.com/p/okapi/issues/detail?id=103) created by @ysavourel on 2009-08-06T20:11:53.000Z:
In the attached PPTX, the only text is extracted as
<source xml:lang="EN-US"><bpt id="1">[\#$dp2]</bpt>An Introduction to the Okapi Tools <ept id="1"></a:t></a:r></ept><ph id="2">[\#$dp3] </ph></source>
If we add text like here:
<target xml:lang="FR-FR">[<bpt id="1">[\#$dp2]</bpt>An Introduction to the Okapi Tools <ept id="1"></a:t></a:r></ept><ph id="2">[\#$dp3]</ph>] </target>
(see [ and ]) before and after the first/last inline codes, the merged PPTX is invalid. I'm guessing this is because some of the inline codes are maybe structiral codes, maybe.
Comments (4)
-
Account Deleted -
Account Deleted - changed status to resolved
Comment [2.](https://code.google.com/p/okapi/issues/detail?id=103#c2) originally posted by @ysavourel on 2009-08-17T22:04:01.000Z:
-
Account Deleted - changed status to new
- attached TestBadRewrite.pptx
Comment [3.](https://code.google.com/p/okapi/issues/detail?id=103#c3) originally posted by @ysavourel on 2009-08-19T20:19:25.000Z:
The first example provided passes now. But this new one does not. It seems the last slide has something that cause to corrupt the code when re-writting.
-
Account Deleted - changed status to resolved
Comment [4.](https://code.google.com/p/okapi/issues/detail?id=103#c4) originally posted by @ysavourel on 2009-08-31T13:44:32.000Z:
The second file uploaded now works.
- Log in to comment
Comment [1.](https://code.google.com/p/okapi/issues/detail?id=103#c1) originally posted by @ysavourel on 2009-08-12T19:28:12.000Z: