sanitize.py - consider only a subset of reads
Some transcriptome data sets are far larger than are needed (or is even beneficial) for a good transcriptome assembly.
Add an option to
million that allows the user to specify a float. If not specified, all reads are considered. If it is specified, just the first X reads are considered (as specified by