Commits

Author Commit Message Labels Comments Date
Virgil Dupras
v1.3.0
Virgil Dupras
Instead of trying to optimize grouping (which broke a test), skip it when there's too many boxes to group.
Virgil Dupras
Virgil Dupras
Optimized layout.group_textboxes() to fix a problem where many text elements would cause the function to stall for eons.
Virgil Dupras
Support object references as filters in streams.
Virgil Dupras
Parse everything as soon as an objectid can't be found.
Virgil Dupras
Fixed a crash preventing redeaing of PDFs for which stream IDs hosting page objects weren't in the xrefs.
Virgil Dupras
Improved pdfexplore.
Virgil Dupras
Improved pdfexplore.
Virgil Dupras
Began the development of pdfexplore, a command prompt utility to explore PDFs (to debug them).
Virgil Dupras
Replaced all "if STRICT: raise stuff" idioms by handle_error(), which logs a warning message.
Virgil Dupras
v1.2.4
Tags
1.2.4
Virgil Dupras
When xref tables are corrupt, read all objects (including object streams) and build a xref from that.
Virgil Dupras
Fixed a bogus assertion error in layout code.
Virgil Dupras
v1.2.3
Tags
1.2.3
Virgil Dupras
Fixed meta crash on buggy PSParser repr.
Virgil Dupras
Fixed a crash on uneven cmap codes.
Virgil Dupras
v1.2.2
Tags
1.2.2
Virgil Dupras
Don't crash on invalid dictionary constructs when parsing postscript.
Virgil Dupras
Ignore lines with no text for textbox grouping.
Virgil Dupras
Fixed crash on trying to read invalid LZW-encoded data.
Virgil Dupras
Added tag 1.2.1 for changeset cc3e9edc9934
Virgil Dupras
v1.2.1
Tags
1.2.1
Virgil Dupras
Adjusted the charheight text line grouping algo again.
Virgil Dupras
Adjusted heuristic word margin multiplier.
Virgil Dupras
Use median charheight instead of avg chargeight for text line grouping. It prevents lines starting with a very big letter to mess things up.
Virgil Dupras
Work around inline image corruption in some PDFs.
Virgil Dupras
v1.2.0
Virgil Dupras
Fixed str/bytes crashes on decipher.
Virgil Dupras
Added the heuristic_word_margin param.
  1. Prev
  2. Next