PPTX slides not extracted in order
Issue #325
resolved
Original issue 325 created by @ysavourel on 2013-04-09T14:58:10.000Z:
The slides of PPTX files are not extracted in the order they are used.
Comments (3)
-
Account Deleted -
Account Deleted - attached Okapi-325.pptx
Comment 2. originally posted by @ysavourel on 2013-04-20T04:26:41.000Z:
Attaching fresh copy of that PPTX since it looks like I corrupted it by mistake when I was messing with it.
-
Account Deleted - changed status to resolved
Comment 3. originally posted by @ysavourel on 2013-04-24T22:22:18.000Z:
Fixed on dev, commit 1283327f7a31d9def3c7427ab7f6e4acc52fff70
- Log in to comment
Comment 1. originally posted by @ysavourel on 2013-04-10T16:54:08.000Z:
Very simple testcase attached. It looks like the filter is depending on the ordering of the slides in the ZIP archive. The correct way to do this is to parse ppt/_rels/presentation.xml.rels and look for all relationships of type http://schemas.openxmlformats.org/officeDocument/2006/relationships/slide, and use that data to map IDs to the slide XML files in the archive. Then parse ppt/presentation.xml and use the IDs to figure out the correct ordering of the slide data.