A simple app that indexes the full text of all pdfs within one directory as well as allows searching for terms. Uses the great PDF extraction library PDFTextStream from SnowTide


  • Build with: lein uberjar
  • create an index with: java -jar pdf-index.jar --index <index directory> <pdf directories>+
  • search for any term with: java -jar pdf-index.jar --search <index directory> <search term>+

Beware: You MUST NOT redistribute the resulting jar file (restriction of the pdf library, see License).


Copyright © 2012 Steffen Dienst

Distributed under the Eclipse Public License, the same as Clojure.

Tip: Filter by directory path e.g. /media app.js to search for public/media/app.js.
Tip: Use camelCasing e.g. ProjME to search for ProjectModifiedEvent.java.
Tip: Filter by extension type e.g. /repo .js to search for all .js files in the /repo directory.
Tip: Separate your search with spaces e.g. /ssh pom.xml to search for src/ssh/pom.xml.
Tip: Use ↑ and ↓ arrow keys to navigate and return to view the file.
Tip: You can also navigate files with Ctrl+j (next) and Ctrl+k (previous) and view the file with Ctrl+o.
Tip: You can also navigate files with Alt+j (next) and Alt+k (previous) and view the file with Alt+o.