Commits

Show all
Author Commit Message Labels Comments Date
Frederic De Groef
converted queries.py into an actual figure generator script
Tags
v0.2.0
Frederic De Groef
save raw data to disk so we can reprocess them if needed
Frederic De Groef
data extractors also yield the raw data
Frederic De Groef
save figures as image
Frederic De Groef
detect and tag when a url is hosted on same domain (non referenced internal blog)
Frederic De Groef
datetime is not date
Frederic De Groef
removed useless print
Frederic De Groef
[La Libre] updated for underlying changes
Frederic De Groef
[DHNet] updated for underlying changes
Frederic De Groef
[Le Soir] adding 'in text' tag
Frederic De Groef
Added tag before-big-changes for changeset fe8b79ef2ed4
Frederic De Groef
[Le Soir] updated to reflect api changes
Frederic De Groef
no more external/internal attrs. Added properties for convenience. Rebuilds TaggegURL objects when deserializing from json
Frederic De Groef
extracted the html markup cleanup function
Frederic De Groef
new test case for url detection : no url
Frederic De Groef
Merge w/ new url detection
Frederic De Groef
upadted hgignore
Frederic De Groef
added <strong> as a text formatting tag to cleanup
Frederic De Groef
added 'in text' tags
Frederic De Groef
new test
iadanja
Changed the plaintext url detection to use this solution instead: http://www.codinghorror.com/blog/2008/10/the-problem-with-urls.html
Frederic De Groef
added a way to detect urls in plaintext + associated testcases
Frederic De Groef
stupid readme update
Frederic De Groef
[ArticleData] extracted the datetime string conversion funcs
Frederic De Groef
[dhnet] more dyslexia
Frederic De Groef
tinkering with plotting
Frederic De Groef
added a function to classify a url with tags.
Frederic De Groef
[La Libre] fixed tag name
Frederic De Groef
[La Libre] fuck yeah dyslexia. Also, new tag.
Frederic De Groef
[Le Soir] Sometimes there is no title for a link. Brilliant.
  1. Prev
  2. Next