Adding text before/after first/last inline code cause invalid merged file in PPTX extraction

Issue #103 resolved
Former user created an issue

Original [issue 103](https://code.google.com/p/okapi/issues/detail?id=103) created by @ysavourel on 2009-08-06T20:11:53.000Z:

In the attached PPTX, the only text is extracted as

<source xml:lang="EN-US"><bpt id="1">[\#$dp2]</bpt>An Introduction to the Okapi Tools <ept id="1">&lt;/a:t>&lt;/a:r></ept><ph id="2">[\#$dp3] </ph></source>

If we add text like here:

<target xml:lang="FR-FR">[<bpt id="1">[\#$dp2]</bpt>An Introduction to the Okapi Tools <ept id="1">&lt;/a:t>&lt;/a:r></ept><ph id="2">[\#$dp3]</ph>] </target>

(see [ and ]) before and after the first/last inline codes, the merged PPTX is invalid. I'm guessing this is because some of the inline codes are maybe structiral codes, maybe.

Comments (4)

  1. Log in to comment