Adding indel support to buildPhylipLineage
It'd be super helpful to add indel support to the buildPhylipLineage function (maybe using the SEQUENCE_INPUT column?) Our data has a ton of indels (both real and erroneous) and we're really interested in seeing how indels increase diversity in our lineages of interest; would be happy to talk more about this by email or whatever.
Comments (5)
-
-
Oh, I forgot to mention. I recently revived the
AlignRecords
script in changeo (the version on bitbucket), which will allow you to perform a multiple alignment between the germline column andSEQUENCE_VDJ
orSEQUENCE_INPUT
columns, which should let you preserve indels for lineage building:http://changeo.readthedocs.io/en/latest/tools/AlignRecords.html
-
reporter Hey Jason, thanks for all your help. I tried AlignRecords just now but it didn't work; can I send you my data and the error message to see if its something you can fix?
Thanks!
-
Sure, please email me the data, error message and command used and I'll see if I can figure out what's wrong.
-
- changed status to resolved
No news is good news?
- Log in to comment
Hey @emukherj,
I added a
mask_char
option tomakeChangeoClone
and adist_mat
option tobuildPhylipLineage
, which will, hopefully, let you count indels as mismatches.For example:
This:
gap=-1
). If you instead setgap=1
, then this will count each individual gap character as a mismatch (if it's not a gap in both sequences).I didn't test extensively, but let me know if this accomplishes what you wanted.