02_READ_BASED_ANNOTATION - eolesin/AMOR_Indiv_Assembly_Protocol GitHub Wiki

PhyloFlash is a method to identify and assemble full-length SSU sequences from reads. Please see the sub-coassemblies protocol for details on that

1. Kraken2 profiling of contigs

# Using a file containing all of the contigs from all samples as input:

kraken2 merged.contigs.fa --db standardKraken2 --threads 40 > merged.contigs.kraken2.output

# When this finishes, you have your taxa assignments for each contig. 
# Now we can recruit reads from all samples to our merged contigs database we made for mapping.


2. Kraken2 profiling of reads

# Perhaps more importantly, kraken2 takes reads as input. With reads info we can 
# easily figure out how many of each taxa are present. The former method takes a bit more matching.