Commits

Author Commit Message Labels Comments Date
Virgil Dupras
v1.3.0
Virgil Dupras
Instead of trying to optimize grouping (which broke a test), skip it when there's too many boxes to group.
Virgil Dupras
Virgil Dupras
Optimized layout.group_textboxes() to fix a problem where many text elements would cause the function to stall for eons.
Virgil Dupras
Support object references as filters in streams.
Virgil Dupras
Parse everything as soon as an objectid can't be found.
Virgil Dupras
Fixed a crash preventing redeaing of PDFs for which stream IDs hosting page objects weren't in the xrefs.
Virgil Dupras
Improved pdfexplore.
Virgil Dupras
Improved pdfexplore.
Virgil Dupras
Began the development of pdfexplore, a command prompt utility to explore PDFs (to debug them).
Virgil Dupras
Replaced all "if STRICT: raise stuff" idioms by handle_error(), which logs a warning message.
Virgil Dupras
Added tag 1.2.4 for changeset 7a88da3be62f
Virgil Dupras
v1.2.4
Tags
1.2.4
Virgil Dupras
When xref tables are corrupt, read all objects (including object streams) and build a xref from that.
Virgil Dupras
Fixed a bogus assertion error in layout code.
Virgil Dupras
Added tag 1.2.3 for changeset a2a60a0dbbb7
Virgil Dupras
v1.2.3
Tags
1.2.3
Virgil Dupras
Fixed meta crash on buggy PSParser repr.
Virgil Dupras
Fixed a crash on uneven cmap codes.
Virgil Dupras
Added tag 1.2.2 for changeset e18d00a4199b
Virgil Dupras
v1.2.2
Tags
1.2.2
Virgil Dupras
Don't crash on invalid dictionary constructs when parsing postscript.
Virgil Dupras
Ignore lines with no text for textbox grouping.
Virgil Dupras
Fixed crash on trying to read invalid LZW-encoded data.
Virgil Dupras
Added tag 1.2.1 for changeset cc3e9edc9934
Virgil Dupras
v1.2.1
Tags
1.2.1
Virgil Dupras
Adjusted the charheight text line grouping algo again.
Virgil Dupras
Adjusted heuristic word margin multiplier.
Virgil Dupras
Use median charheight instead of avg chargeight for text line grouping. It prevents lines starting with a very big letter to mess things up.
Virgil Dupras
Work around inline image corruption in some PDFs.
  1. Prev
  2. Next