Wiki
Clone wikienterobase-web / EnteroBase Backend Pipeline: refMapper_matrix
Top level links:
- Main top level page for all documentation
- EnteroBase Features
- Registering on EnteroBase and logging in
- Tutorials
- Using the API
- About the underlying pipelines and other internals
- How schemes in EnteroBase work
- FAQ
refMapper_matrix
Overview
refMapper_matrix generates a SNP matrix (not a distance matrix) file from GFF files. (Usually the GFF files will have been created as annotations of SNPs subsequent to alignment of assemblies' sequences versus a reference assembly by refMapper.)
The refMapper_matrix pipeline is usually invoked as part of a workflow to create a SNP tree.
refMapper_matrix is currently in version 2.0.
Summary
refMapper_matrix reads GFF files (generally expected in a specific format written by refMapper) that each annotate mutations in a genome assembly and also regions that successfully align with the reference, repetitive regions and regions with uncertain base calling or ambiguous alignment. It outputs a single SNP matrix file for all of the assemblies which documents mutations and their type (including identity, synonymous and non-synonymous SNPs, stops and frameshift mutations) at positions in the genome assemblies that have variation.
Updated