Disable pdftotext

Issue #587 resolved
supporthq created an issue

Hi,

Can I disable pdftotext? It consuming alot of CPU and I dont have a need to search inside PDF's. Most of the PDF files are merly scanned documents.

Thanks and let me know if this is a bad idea.

Comments (7)

  1. supporthq reporter

    thanks for the awesomely quick reply....any suggestion on how to disable? Cant seem to find a reference on how to do that

  2. Janos SUTO repo owner

    Currently you can't disable it. If the configure script can find it, then it's enabled. So remove or rename the pdftotext utility (poppler package), then recompile piler.

  3. Janos SUTO repo owner

    If you can skip indexing attachments at all, then set the following in piler.conf:

    extract_attachments=0
    
  4. supporthq reporter

    Hi,

    Have added 'extract_attachments=0' to '/usr/local/etc/piler.conf', but I am still seeing pdftotext running after a reboot, am I missing something here?

    Output of piler -v below, maybe my vewrsion is too old?

    piler 0.1.24-master-branch, build 836, Janos SUTO sj@acts.hu Build Date: Sun Sep 1 21:35:24 CEST 2013 ldd version: ldd (Debian EGLIBC 2.13-38) 2.13 gcc version: gcc version 4.7.2 (Debian 4.7.2-5) Configure command: ./configure --localstatedir=/var --enable-starttls --enable-tcpwrappers --with-database=mysql

  5. Janos SUTO repo owner

    Yes, your version is too old, look at the build date, almost 2 years old, and doesn't support this keyword.

  6. Log in to comment