Currently, the BibTeX scraper also returns results when a binary file, e.g., a PDF is scraped (test this URL). This needs to be changed: the scraper should check, e.g., the MIME type of the returned document and only try to extract information from text, HTML, etc. files. Another option would be to check the extracted BibTeX for valid characters.
Issue #2102 resolved