BaGLAMa: add sv.wiki Lsjbot creations

Issue #12 new
Federico Leva created an issue

If possible without too much effort, it would be nice to see what are the effects of mass-import of data from external institutions.

Specifically, I'd like to see view stats of the sv.wiki articles on forms of life created by Lsjbot. They don't have a single category but they all contain the template {{robotskapad}} (used on 994,238 pages); if a category is easier it can probably be added. I tried TreeViews but that's just too much. :)

Comments (1)

  1. Federico Leva reporter

    Another attempt with TreeViews: I tried https://sv.wikipedia.org/wiki/Kategori:Robotskapade_artiklar and Chromium went fast using almost two cores but crashed the tab eventually (~5 minutes, 1+ GB RAM, 8398 requests and 40 MB later).

    With a single subcategory containing 50k articles I think it will finish: pageview data for one month is fetched on my browser for about 10k articles every 5 minutes, with about 17 % failure (500 error from cassandra) on first attempt and about 10 % of the pages having some data.

  2. Log in to comment