Parvovirus‐GLUE Alignment Tree - giffordlabcvr/Parvovirus-GLUE GitHub Wiki

Alignments Included in Parvovirus-GLUE

Multiple sequence alignments (MSAs) in the Parvovirus-GLUE core project include:

  1. A root alignment constructed to represent homology between the two largest subgroupings in the Parvoviridae.
  2. subfamily alignments constructed to represent proposed homologies between representative members of Parvoviridae subfamilies.
  3. cross-genus alignments constructed to represent proposed homologies between representative members of 'minor' Parvoviridae lineages.

Please note that the repository also contains genus-level alignments constructed to represent proposed homologies between the genomes of representative members of specific parvovirus genera. These are imported when genus-level extensions are added to the core project.


The Alignment Tree in Parvovirus-GLUE

Parvovirus-GLUE makes use of GLUE's constrained alignment tree data structure.

For the highest taxonomic levels (i.e. at the root) we aligned only the most conserved regions of the NS gene, whereas for the lower taxonomic levels (i.e. within and below genus level) we aligned complete genomes.

The root alignment contains reference sequences for major clades, whereas all children of the root inherit at least one reference from their immediate parent. Thus, all alignments are linked to one another via our chosen set of master reference sequences.

Parvovirus alignment tree

The Alignment Tree in Parvovirus-GLUE The schematic figure above shows the 'alignment tree' data structure currently implemented in Parvovirus-GLUE. For the highest taxonomic levels (i.e. at the root) we aligned only the most conserved regions of the genome, whereas for the lower taxonomic levels (i.e. within and below genus level) we aligned complete coding sequences. We used an alignment tree data structure to link these alignments, via a set of common reference sequences. The root alignment contains reference sequences for major clades, whereas all children of the root inherit at least one reference from their immediate parent. Thus, all alignments are linked to one another via our chosen set of master reference sequences.