Choice of assembler - aechchiki/SIB_LongReadsWorkshop_Zurich17 GitHub Wiki
The most important aspect of genome assembly is the algorithm that is used. The important genomic features that will will affect the choice are genome size, heterozygosity are proportion of low complexity sequences.
assembler | genome_size | heterozygosity | pros | cons |
---|---|---|---|---|
Falcon | any | any | handling variable levels of haplotype divergence | hard to install |
Canu | any | low or high | elegant read correction, nice assembly reports | |
HGAP | bacterial size | usually single contig assembly of bacteria | slow, hard to install | |
Miniasm | any | low | easy to install, super fast to run | not that accurate |
It is important to mention that for different genomes you might need to consider replacing one of the Overlap-Layout-Consensus steps by an alternative approach that suits to your genome.
Next
Go to tutorial assembly using Canu .
Go to tutorial assembly using Miniasm .
Go back to introduction to Genome Assembly
Go back to Table of content .