Commits

Author Commit Message Labels Comments Date
Stephen Jones
Merged
Stephen Jones
Added script to make dpkgs.
Stephen Jones
Added some more output files to .hgignore.
Thomas Figg
fix for broken arc files
Thomas Figg
unicode strip
Thomas Figg
fix
Thomas Figg
unicode fix?
Thomas Figg
strip weirdo chars
Thomas Figg
using lxml
Thomas Figg
skip junk html
Thomas Figg
adding warclinks.py
Stephen Jones
Make tests ready for jenkins
Stephen Jones
Added tag 4.7 for changeset 6f9a6c0a1fc7
Branches
4.7
Stephen Jones
Added a test_suite to setup.py
Tags
4.7
Stephen Jones
Added tag 4.4-rc0 for changeset 91c98d2df6ae
Branches
4.4
Thomas Figg
include check for unused data
Thomas Figg
read until we have something to avoid reading \r without following \n
Thomas Figg
removing skip_newline as it means content-length is wrong
Thomas Figg
reporting collisions in names generated
Tags
4.4-rc0
Thomas Figg
move to IA, cleaning up IA options
Thomas Figg
logging support in warcunpack
Thomas Figg
adding logging, cleaning up bits
Thomas Figg
adding filename collision handling, default name as command line option
Thomas Figg
warcunpack - making clean filename
Thomas Figg
warcunpack - making clean filename
Thomas Figg
skeleton of warcunpack
Thomas Figg
fixing \r\n problem on chunk_size boundary when reading gzipd warcs - issue #4
Stephen Jones
Added tag 4.3-rc0 for changeset 17110fd860a8
Branches
4.3
Thomas Figg
Added tag hanzo-4.1-rc4 for changeset f54be58d0d8b
Tags
4.3-rc0
Thomas Figg
Added tag hanzo-4.1-rc2 for changeset 8ceff9fcde58
Tags
hanzo-4.1-rc4
  1. Prev
  2. Next