Commits

Author Commit Message Labels Comments Date
Frederic De Groef
more pep8 stuff
Frederic De Groef
pep8 stuff
Juliette De Maeyer
Merge
Juliette De Maeyer
[7sur7] improved embedded media detection to extract spottily player
Juliette De Maeyer
[7sur7] improved twitter widget extraction to extract more than one embedded tweet
Juliette De Maeyer
[7sur7] improved extract_links_from_readmore_box (sometimes there's no read more box)
Juliette De Maeyer
[7sur7] improved title extraction for in text links
Juliette De Maeyer
[7sur7] embedded media in multimedia box : added poll detection (polls are ignored)
Juliette De Maeyer
[7sur7] improved embedded media detection, new type of media previously unknown (liveleaks video)
Juliette De Maeyer
[7sur7] improved embedded media detection: fixed the duplication of embedded iframe detection by reducing the scope of generic iframe mining (previously looked for generic iframe in the whole article container, now only in article text — other parts of the function look for iframe in side or bottom media containers)
Juliette De Maeyer
[7sur7] moved things around in the extract_intro function (no need to look for links in the intro text if there's no intro text…)
Juliette De Maeyer
[7sur7]
Juliette De Maeyer
[7sur7] fixed intro extraction (now sanitizes intro text)
Juliette De Maeyer
[lesoir_new] extract_article_data now returns ArticleData
Frederic De Groef
real list flattening with itertools.
Frederic De Groef
Added tag v0.4.99-20120311-dev for changeset c49e9dd17764
Frederic De Groef
Added tag v0.4.99-20120319-dev for changeset 885c56ff09a4
Frederic De Groef
Added tag v0.4.99-20120401-dev for changeset 97450611def0
Frederic De Groef
Added tag v0.4.99-20120610-dev for changeset a6b202d95f2d
Frederic De Groef
Added tag v0.4.99-20120626-dev for changeset ecf37ca297b4
Frederic De Groef
Added tag v0.4.99-20120812-dev for changeset b93ed36ce1bd
Frederic De Groef
Added tag v0.4.99-20120915-dev for changeset 22168f635d32
Frederic De Groef
Added tag v0.4.99-20120916-dev for changeset 4c92d2b064ae
Frederic De Groef
Added tag v0.4.99-20121002-dev for changeset fe583ecc0c4c
Frederic De Groef
Added tag v0.4.99-20121226-dev for changeset 89c26ee5578e
Frederic De Groef
Added tag v0.4.99-20121124-dev for changeset 5706bca2b831
Frederic De Groef
Removed tag 0.4.99-20121021-dev
Frederic De Groef
Removed tag 0.4.99-20121124-dev
Frederic De Groef
Removed tag 0.4.99-20121226-dev
Frederic De Groef
Added tag v0.4.99-20121021-dev for changeset a3d411b88215
  1. Prev
  2. Next