Home ‐ Microbiome Helper 2 - LangilleLab/microbiome_helper GitHub Wiki

We have been continually been updating our SOPs and workflows, and as they are now substantially different than they were at the time of publication and release of Microbiome Helper (2017), we will be working on updating our scripts, SOPs, workflows and tutorials for a second release of Microbiome Helper (i.e., Microbiome Helper 2).

The idea is that there will be a lot of different workflows for analyses that we often carry out. These cover both marker gene (e.g. 16S, 18S, ITS as well as others) and metagenomic sequencing. Some parts of this are "core", but some parts will depend on what you want to get from your data and what your questions are. This is not a comprehensive list of all of the options that are out there!! These are some tools that we use the most, but there are many others out there - we encourage you to do your own research and decide which is the best for your purposes.

For example, for metagenome analyses, you may want to do:

  • Initial pre-processing and QC (core)
  • Taxonomic annotation with Kraken
  • Checking coverage of obtained annotations

Or you may want to do:

  • Initial pre-processing and QC (core)
  • Annotation of reads with CARD RGI
  • MAG assembly, binning, and curation with Anvi'o
  • Annotation of contigs with CARD RGI

We will update this page as we work on these different aspects.

We also plan to make available both an "image" that has all current versions of programs pre-installed and is more-or-less ready to use, as well as instructions on setting up a computing environment for yourself.

In this, we plan to have:

General notes on analysis

Introduction to Unix

Using AWS instances

  • Introduction to Amazon instances and what they are
  • Setting up an AWS instance for your analysis

Tutorial data

Amplicon workflows

Metagenomics initial pre-processing and QC

Metagenomic taxonomic annotation

Metagenomic functional annotation

MAGs

Analysis workflow

Full workflow:

  • Importing data:
    • 16S
    • Metagenomics taxonomic data
    • Functional data (predictions or metagenomic annotations)
    • Stratified data
  • Overview of samples:
    • Basic stacked bar plots
    • Heatmaps
    • Phylogenetic trees
    • Including fundamentals of adding plots together? (e.g. ordering by taxa so that a heatmap will appear in the same order as the tree and can be linked?)
  • Alpha/beta diversity:
    • Alpha diversity metrics and visualisation
    • Beta diversity metrics and visualisation - ordination vs others e.g. heatmap or dendrogram
    • Look at taxonomic contributions to function
  • Differential abundance:
    • ANCOM
    • ALDEx
    • MaAsLin
    • Corncob
    • radEmu
  • Longitudinal analysis
  • Output R Notebook
  • Combining taxonomic/functional visualisations with JarrVis

Other useful things

  • Downloading reads from SRA
  • Uploading reads to SRA (or ENA)