Run iphop only on host lineages of interest

Issue #108 closed
Anneliek ter Horst created an issue

Hi Simon,

Really like this tool, but the database is really big. I was wondering if there is a way that I can only run iphop on host genomes of interest, for example only on hosts with a specific taxonomic lineage (only E. coli species, or only Methanomicrobiales).

Would there be a way to subset the database in such a way?

Comments (3)

  1. Simon Roux repo owner

    Hi !
    Right, the database is big and that makes the compute that much longer. Unfortunately, iPHoP relies a lot of these “non-host” similarities to identify potential hosts, i.e. potential hosts are detected based on “microbial genomes that stand out compared to everything else”, so I have not found a good way to reduce / subset the host database at this point :-(

    Best,

    Simon

  2. Log in to comment