Issue #99 resolved

Improve `load` behavior for protein data

Casey Dunn
repo owner created an issue

At present, load assumes that when a protein file is provided, the headers in the protein and nucleotide file are the same. The handling of protein data with --generic is not explicitly described.

It should be possible to load protein data with generic headers, with or without simultaneously loading nucleotide data.

If --generic is specified and both nucleotide and protein data are being loaded, it should be assumed that the sequences are in the same order in the two files. This will require using ordered dictionaries to store protein and nucleotide data.

Comments (1)

  1. Mark Howison

    overhaul of load: proteins can be load independently of nucleotides (fixes #99) the --generic flag is no longer needed, and the type is automatically detected * genome and molecule type is inferred for sequences with BLAST hits

    → <<cset d820e5c58f62>>

  2. Log in to comment