Commits

Author Commit Message Labels Comments Date
Juliette De Maeyer
[tests] sudinfo test data
Juliette De Maeyer
[test] [sudinfo] added test for embedded video extraction
Juliette De Maeyer
[tests] improved test tools to handle unicode errors in urls
Juliette De Maeyer
[septsursept, sudpresse, sudinfo] tagging conventions : "embedded" instead of "embedded media"
Juliette De Maeyer
[sudpresse] added embedded media (iframe) extraction
Juliette De Maeyer
[test] [sudpresse] added test for 'sidebar box' tagging
Juliette De Maeyer
[sudpresse] added 'sidebar box' tag
Frederic De Groef
[scripts] added an option to filter the sources of which errors have to be listed
Frederic De Groef
[scripts] added an option to filter the sources of which errors have to be listed
Branches
sudpresse_media_detection_improved
Frederic De Groef
[sudpresse] post merge bs
Branches
sudpresse_media_detection_improved
Frederic De Groef
Merge
Branches
sudpresse_media_detection_improved
Frederic De Groef
[tests/lalibre] added a test for the rendered tweet extraction
Frederic De Groef
[tests] saving html for a specific generated test is optional
Frederic De Groef
[lalibre] fixed index.json for test data, reorganised the html files by test type
Frederic De Groef
[lalibre] when extracting text fragments located outside a <p> element, process all the children that match a text markup tag
Frederic De Groef
[dhnet] added support to extract embedded rendered tweets
Frederic De Groef
[tests] added a helper function to generate a test and update the test_data at once
Juliette De Maeyer
[lesoir_new] added test for 'same owner' tagging
Juliette De Maeyer
[rossel, ipm] better 'same owner' lists
Juliette De Maeyer
[lesoir_new] improved plaintext link detection to avoid extraction of true links (with url as title) as plaintext + added 'same owner' tagging
Juliette De Maeyer
[lesoir_new] added multiple sidebar boxes detection
Juliette De Maeyer
[tests, lavenir] updated test to reflect change in tagging ('sidebar' --> 'sidebar box')
Juliette De Maeyer
[lavenir] tagging: 'sidebar box' instead of 'sidebar'
Juliette De Maeyer
[tests, lavenir] added test for 'same owner' tagging
Juliette De Maeyer
[tests, sudpresse] finalized test for 'same owner' tagging
Juliette De Maeyer
[tests, lalibre, dhnet] 'same owner' tagging tests now says what they do
Juliette De Maeyer
[tests, lesoir] added a test for 'same owner' tagging
Juliette De Maeyer
Merge
Juliette De Maeyer
[tests, sudpresse] added a test for 'same owner' tags
Juliette De Maeyer
[lavenir, lesoir, sudinfo, sudpresse] added 'same owner' tagging
  1. Prev
  2. Next