XMLStreamFilter while handling HTML CDATA produces spurious segments.
Original issue 332 created by 143.ravik... on 2013-04-26T09:38:08.000Z:
With okapi M20 The XMLStream filter class while parsing the CDATA section using the HTML sub-filter creates an extra spurious placeholder text unit for each CDATA within the file -
<trans-unit id="tu1" restype="x-cdata">
<source xml:lang="en-us"><ph id="1">[#$tu1_ssf1]</ph></source>
<seg-source><mrk mid="0" mtype="seg"><ph id="1">[#$tu1_ssf1]</ph></mrk></seg-source>
<target xml:lang="fr-fr"><mrk mid="0" mtype="seg"><ph id="1">[#$tu1_ssf1]</ph></mrk></target>
</trans-unit>
This issue was discussed in the following ticket -
http://code.google.com/p/okapi/issues/detail?id=320
Further the fix was marked dependent on the ticketcomment 30.3,which was also similar but related to PCDATA parsing instead of CDATA-
http://code.google.com/p/okapi/issues/detail?id=303
The fix for ticket # 303 doesn't seem to work for ticket # 320. Unable to reopen the ticketcomment 32.0 hence opening a new one.
Comments (2)
-
Account Deleted -
Account Deleted - changed status to resolved
Comment 2. originally posted by @ysavourel on 2013-04-26T20:50:26.000Z:
Fixed on dev. Commit is 596b2a84ac4ed1c3f451ae8361843258c73afbf6
For some reason I decided to call this bug "Issue comment 32.8" in the commit message instead of comment 33.2. Oops. Regardless, it is the fix for this problem.
- Log in to comment
Comment 1. originally posted by @ysavourel on 2013-04-26T16:14:01.000Z:
Taking this