Commits

Author Commit Message Labels Comments Date
Frederic De Groef
temporary deactivated article download for lavenir.net (need to check the article parsing first)
Frederic De Groef
updated frontage parsing for lavenir.net, to reflect cms changes
Frederic De Groef
load blogpost list correctly
Frederic De Groef
sudinfo back in business
Frederic De Groef
fixed get_frontpage_toc()
Frederic De Groef
disabled old stat files, disabled make_figures,
Frederic De Groef
updated version
Tags
v0.4.99-20120401-dev
Frederic De Groef
updated summary func
Frederic De Groef
changed the filters for link getters
Frederic De Groef
fixed netlocs
Frederic De Groef
detect and tags intenal sites using domain
Frederic De Groef
better tags
Frederic De Groef
added utility to detect if urls belong to a specified domain
Frederic De Groef
removed unused file
Frederic De Groef
updated unit tests
Frederic De Groef
lavenir.net back in the download queue
Frederic De Groef
better tags
Frederic De Groef
bumped version
Tags
v0.4.99-20120319-dev
Frederic De Groef
write errors from reprocessed sessions into ERRORS_FILENAME
Frederic De Groef
loads errors for both type of error files if available
Frederic De Groef
temporarily disabled lavenir.net (waiting for article scrapper update)
Frederic De Groef
updated version
Tags
v0.4.99-20120311-dev
Frederic De Groef
updated frontage scrapper for lavenir.net
Frederic De Groef
updated version
Frederic De Groef
better stats about last update. Commented out the deprecated functions.
Frederic De Groef
using all the frontpage scrappers
Frederic De Groef
added frontpage scrapper for 7sur7
Frederic De Groef
updated readme and version
Frederic De Groef
added frontpage items extractor for levif.be
Frederic De Groef
updated url classification unittest
  1. Prev
  2. Next