Trees and alignments for Ribovirus orders and families - ababaian/serratus GitHub Wiki
Download
Methods
Palmprint sequences were obtained from PalmDB release 2021-03-14. All palmprints were aligned to the named subset by the usearch_global command of usearch v11.0.667 and assigned to an order or family if identity was at least 50% or 60%, respectively. Multiple alignments were created by Clustal-omega v1.2.4. Approximate maximum-likelihood trees were created by FastTree v2.1.11 SSE3. Rectangular cladograms were generated in svg format for each tree. Branches in these figures are colored green for known sequences and magenta for novel sequences, where novel is defined as <90% palmprint sequence identity to the pre-Serratus release of PalmDB (2021-03-02).
Files in the tarball
tree/
FastTree trees in Newick format.
msa/
Clustal-omega alignments.
svg/
Tree figures for each family and order, plus combined figures.
tsv/tree_master_table.tsv
TSV format 1. rank (order or family), 2. clade name (e.g. Amarillovirales), 3. PalmDB unique sequence identifier (e.g. u21583), 4. known or novel.
svg/families.svg
SVG figure with the 60 largest families.
svg/orders.svg
SVG figure with the 40 largest orders.