Improving biorxiv scraper and fix HTTP client generation

Merged
#40 · Created  · Last updated

Merged pull request

Merged in fix-2872 (pull request #40)

5e075fd·Author: ·Closed by: ·2021-12-23

Description

https://bitbucket.org/bibsonomy/bibsonomy/issues/2872/no-connection-timeouts-configured-when

  • added new tests

  • fixed BIBTEX_PATTERN regex of BioRxivScraper

  • HTTP client was never built with defaultconfig in the first place

tests randomly succeed and randomly timeout as biorxiv sometimes needs over one minute to reply to our requests. the tests for which this happens seem to be randomly affected.

Nevertheless mergeworthy IMHO as the HTTP client construction gets fixed, the biorxiv-scraper works faster for some link-heavy pages and tests have been updated

0 attachments

0 comments

Loading commits...