add scraper for J-Stage

Issue #14 resolved
Robert Jäschke created an issue

Please implement a scraper for J-Stage. It should be possible to get BibTeX by just modifying the URL. E.g. for the URL

https://www.jstage.jst.go.jp/article/dsj/9/0/9_CRIS4/_article

the BibTeX is available at

https://www.jstage.jst.go.jp/AF06S010ShoshJkuDld?sryCd=dsj&noVol=9&noIssue=0&kijiCd=9_CRIS4&kijiLangKrke=en&kijiToolIdHkwtsh=AT0073

which means that the different parts of the URL https://www.jstage.jst.go.jp/article/dsj/9/0/9_CRIS4/_article must be split and assigned to the corresponding parameters for the BibTeX URL in the following way:

https://www.jstage.jst.go.jp/article/sryCd/noVol/noIssue/kijiCd/_article

Please note that the returned BibTeX looks like this:

@article{2010CRIS4,
  title={CERIF: The Common European Research Information Format Model},
  author={Brigitte Jörg},
  journal={Data Science Journal},
  volume={9},
  number={ },
  pages={CRIS24-CRIS31},
  year={2010}
}

Hence, it must improved with respect to the following:

  1. all HTML special characters must be replaced (using the method StringEscapeUtils.unescapeHtml())
  2. the URL must be added (using the method BibTexUtils.addFieldIfNotContained())

Comments (1)

  1. Log in to comment