- changed status to open
DBLP Scraper should support all DBLP URLs
The DBLP scraper currently supports URLs of the form
Since DBLP supports export to other formats, the scraper should also support these pages. For instance:
- http://dblp.uni-trier.de/rec/bibtex/books/sp/stdesign14/AtzmuellerBHKM0SSS14
- http://dblp.uni-trier.de/rec/xml/books/sp/stdesign14/AtzmuellerBHKM0SSS14.xml
- http://dblp.uni-trier.de/rec/rdf/books/sp/stdesign14/AtzmuellerBHKM0SSS14.rdf
- http://dblp.uni-trier.de/rec/ris/books/sp/stdesign14/AtzmuellerBHKM0SSS14.ris
This is easy to accomplish as a subclass of the GenericBibTeXURLScraper
, since the DBLP ID can be extracted from all of the above paths with a simple Regexp like /rec/(bibtex|xml|rdf|ris|html)/(.+)(\.xml|\.rdf|\.ris)?
Comments (6)
-
reporter -
reporter - edited description
-
reporter Please also ensure that the three different hosts
are supported.
-
reporter Additionally, the
bib1
andbib2
formats should also be supported: -
- changed status to resolved
resolving
#2562; closing#2562Now the DBLPSraper can extract Data from the following hosts http://dblp.uni-trier.de/<path>; http://dblp.dagstuhl.de/<path>; http://dblp.org/<path>; in the extensions .xml, .rdf and .ris and the path /html, /bibtex, /bib1 and /bib2→ <<cset c538e0a09dde>>
-
- changed status to closed
resolving
#2562; closing#2562Now the DBLPSraper can extract Data from the following hosts http://dblp.uni-trier.de/<path>; http://dblp.dagstuhl.de/<path>; http://dblp.org/<path>; in the extensions .xml, .rdf and .ris and the path /html, /bibtex, /bib1 and /bib2→ <<cset c538e0a09dde>>
- Log in to comment