Commits

Show all
Author Commit Message Labels Comments Date
Frederic De Groef
[sudinfo] added a stub test module
Branches
sudinfo_unittests
Frederic De Groef
[sudinfo] updated selectors
Branches
sudinfo_unittests
Frederic De Groef
[dhnet] embedded tweet detection
Frederic De Groef
clamp lengths in print_TaggedURLs()
Frederic De Groef
[tests] show both TaggedURL lists when they don't match
Frederic De Groef
[tests/dhnet] converted index.py to index.json
Frederic De Groef
[lalibre] handle text paragraphs outside <p></p> tags
Frederic De Groef
[lalibre] improved content and intro extraction
Frederic De Groef
when processing items in the queue, save the raw data even if there was an error.
Frederic De Groef
renamed a parameter in test generator
Frederic De Groef
Merge
Juliette De Maeyer
[lalibre] list of files with embedded tweets
Juliette De Maeyer
[lalibre] added extraction of embedded tweets (rendered tweets)
Juliette De Maeyer
[lalibre] when processing paragraphs and fragments, we now ignore embedded tweets
Frederic De Groef
os.exists -> os.path.exists
Frederic De Groef
removed a useless param in the test generator
Frederic De Groef
[tests] minor cleanups and renames
Frederic De Groef
[tests] added a test for the test function generator. this is so meta
Frederic De Groef
renamed a file so it's not collected by nose
Frederic De Groef
removed format string based test generator for general suckiness
Frederic De Groef
[tests] added a helper function to generate a link extraction unit test.
Frederic De Groef
reorganised tests to match package structure
Frederic De Groef
rename csxj.datasources.common to csxj.datasources.parser_tools
Frederic De Groef
[test_lalibre] added link extraction test suite for lalibre.py
Frederic De Groef
[ipm_utils] strip titles
Frederic De Groef
[csxj_test_tools] compare TaggedURL lists equality using sets before doing per-item comparison in sorted lists
Frederic De Groef
[lalibre] added audio files extraction. import cleanup
Frederic De Groef
[dhnet] import cleanup
Frederic De Groef
doctoring are nice
Frederic De Groef
don't run nose in verbose mode by default
  1. Prev
  2. Next