Commits

Show all
Author Commit Message Labels Comments Date
Virgil Dupras
v1.2.1
Tags
1.2.1
Virgil Dupras
Adjusted the charheight text line grouping algo again.
Virgil Dupras
Adjusted heuristic word margin multiplier.
Virgil Dupras
Use median charheight instead of avg chargeight for text line grouping. It prevents lines starting with a very big letter to mess things up.
Virgil Dupras
Work around inline image corruption in some PDFs.
Virgil Dupras
Added tag 1.2.0 for changeset ad9b4a381375
Virgil Dupras
Fixed setup.
Tags
1.2.0
Virgil Dupras
v1.2.0
Virgil Dupras
Fixed str/bytes crashes on decipher.
Virgil Dupras
Added the heuristic_word_margin param.
Virgil Dupras
Tweaked paragraph detection in layout.
Virgil Dupras
Fixed a "textlining" bug happening when the last processed char was alone: the previous char would end up forming the last line.
Virgil Dupras
Don't try to extract paragraphs from vertical textboxes.
Virgil Dupras
Fixed a crash in LTLayoutContainer.get_textlines() when called with only one object.
Virgil Dupras
Fixed crash related octal escapes in strings.
Virgil Dupras
Merged with ply-lexer branch.
Virgil Dupras
Fixed bugs with xref finding algo and hexstrings lexing.
Branches
ply-lexer
Virgil Dupras
Removed the old buffering system.
Branches
ply-lexer
Virgil Dupras
Fixed typo.
Virgil Dupras
Modernized debug logging.
Virgil Dupras
Fixed a str/byte crash.
Virgil Dupras
Added tag 1.1.0 for changeset db5df969f48c
Virgil Dupras
v1.1.0
Tags
1.1.0
Virgil Dupras
Disabled broken sample test and adjusted simple1.
Virgil Dupras
Improved layout detection for mixes of titles and paragraphs.
Virgil Dupras
Forgot to add a sample pdf in previous commit.
Virgil Dupras
Added paragraph_indent param to allow auto-detection of paragraphs in textboxes.
Virgil Dupras
Fixed a layout bug where text lines that have a slightly different height end up in a weird order.
Virgil Dupras
I didn't realize I broke the textbox layout sort function in [17ed9c7aecac]. Fixed it.
Virgil Dupras
Modernized iterations in pdfminer.layout.
  1. Prev
  2. Next