- changed status to resolved
add scraper for J-Stage
Issue #14
resolved
Please implement a scraper for J-Stage. It should be possible to get BibTeX by just modifying the URL. E.g. for the URL
https://www.jstage.jst.go.jp/article/dsj/9/0/9_CRIS4/_article
the BibTeX is available at
https://www.jstage.jst.go.jp/AF06S010ShoshJkuDld?sryCd=dsj&noVol=9&noIssue=0&kijiCd=9_CRIS4&kijiLangKrke=en&kijiToolIdHkwtsh=AT0073
which means that the different parts of the URL https://www.jstage.jst.go.jp/article/dsj/9/0/9_CRIS4/_article must be split and assigned to the corresponding parameters for the BibTeX URL in the following way:
https://www.jstage.jst.go.jp/article/sryCd/noVol/noIssue/kijiCd/_article
Please note that the returned BibTeX looks like this:
@article{2010CRIS4,
title={CERIF: The Common European Research Information Format Model},
author={Brigitte Jörg},
journal={Data Science Journal},
volume={9},
number={ },
pages={CRIS24-CRIS31},
year={2010}
}
Hence, it must improved with respect to the following:
- all HTML special characters must be replaced (using the method
StringEscapeUtils.unescapeHtml()
) - the URL must be added (using the method
BibTexUtils.addFieldIfNotContained()
)
Comments (1)
-
Account Deleted - Log in to comment
JStageScraper constructs new url and get the BibTex file.