1. Brian Kerr
  2. Ann Arbor Government Documents
Issue #30 new

Extract PDF text

Matt Hampel
created an issue

The system should extract PDF text -> HTML wherever possible.

Perhaps use http://www.unixuser.org/~euske/python/pdfminer/index.html

Comments (0)

  1. Log in to comment