DOCX rewritten as recoverable file
Some DOCX files that open OK in Word, and get processed without problem with the Okapi filter, have a problem when they get re-written: Word sees them as recoverable files. You get prompted to let Word recover them. If you answer Yes, the file get read fine. For example, the attached test1.docx file is such file. If I process it to change its text with Rainbow (Text Rewriting) we get back the attached test1.out.docx: that opens fine, but only after a prompt. This is with latest 1.46-snapshot, but this is true for any previous version I have tried too. If I open the test1.docx in Word and save it without any change, the resulting file processes fine and opens fine. A possible difference I can see is that the original test1.docx seems to have some mac word namespaces.
Comments (4)
-
-
@YvesS a solution is available as a 2nd commit within pull request #802.
-
reporter Thanks a bunch Dennis. It does fix the issue.
-
reporter - changed status to resolved
Fixed.
- Log in to comment
@YvesS thank you for documenting this! The issue can be recreated with
translatePowerpointDocProperties.b=true
parameter. ThedocProps/core.xml
is corrupted after the merge. Original:Merged: