Commits

Show all
Author Commit Message Labels Comments Date
Frederic De Groef
updated version
Tags
v0.4.99-20120401-dev
Frederic De Groef
updated summary func
Frederic De Groef
changed the filters for link getters
Frederic De Groef
fixed netlocs
Frederic De Groef
detect and tags intenal sites using domain
Frederic De Groef
better tags
Frederic De Groef
added utility to detect if urls belong to a specified domain
Frederic De Groef
removed unused file
Frederic De Groef
updated unit tests
Frederic De Groef
lavenir.net back in the download queue
Frederic De Groef
better tags
Frederic De Groef
bumped version
Tags
v0.4.99-20120319-dev
Frederic De Groef
write errors from reprocessed sessions into ERRORS_FILENAME
Frederic De Groef
loads errors for both type of error files if available
Frederic De Groef
temporarily disabled lavenir.net (waiting for article scrapper update)
Frederic De Groef
updated version
Tags
v0.4.99-20120311-dev
Frederic De Groef
updated frontage scrapper for lavenir.net
Frederic De Groef
updated version
Frederic De Groef
better stats about last update. Commented out the deprecated functions.
Frederic De Groef
using all the frontpage scrappers
Frederic De Groef
added frontpage scrapper for 7sur7
Frederic De Groef
updated readme and version
Frederic De Groef
added frontpage items extractor for levif.be
Frederic De Groef
updated url classification unittest
Frederic De Groef
return empty list for blogposts
Frederic De Groef
added frontpage items extractor for rtbfinfo
Frederic De Groef
extracted locale setup to utils, should be used everywhere.
Frederic De Groef
updated imports
Frederic De Groef
process new errors
Frederic De Groef
new date extraction
  1. Prev
  2. Next