Datasets - NBISweden/workshop-genome_assembly GitHub Wiki

Datasets

Special tags:

  • [WGA]: Whole Genome Amplified.

Multiple platforms for a Species:

Ecoli K12 substrain MG1655

Platform Location
Illumina https://www.ebi.ac.uk/ena/data/view/ERX008638
PacBio https://github.com/PacificBiosciences/DevNet/wiki/E.-coli-Bacterial-Assembly
ONT Nanopore https://s3-eu-west-1.amazonaws.com/ont-research/medaka_walkthrough.tar.gz

Plasmodium knowlesi (primate malaria parasite)

Platform Location
PacBio https://www.ebi.ac.uk/ena/data/view/PRJNA377737
Hi-C https://www.ebi.ac.uk/ena/data/view/PRJNA377737

Solanum verrucosum (wild potato) - 722 Mbp - diploid

Platform Location
Illumina PCR-free (x2 types) https://www.ebi.ac.uk/ena/data/view/PRJEB20860
Illumina Mate-pair https://www.ebi.ac.uk/ena/data/view/PRJEB20860
PacBio https://www.ebi.ac.uk/ena/data/view/PRJEB20860
Hi-C (Dovetail Chicago) https://www.ebi.ac.uk/ena/data/view/PRJEB20860

Illumina

Description Location
Ecoli K12 substrain MG1655 https://www.ebi.ac.uk/ena/data/view/ERX008638
[WGA] Cryptosporidium parvum https://www.ebi.ac.uk/ena/data/view/SRX1522147

PacBio

Description Location
Ecoli K12 substrain MG1655 https://github.com/PacificBiosciences/DevNet/wiki/E.-coli-Bacterial-Assembly
[WGA] Arabidopsis thaliana https://www.ebi.ac.uk/ena/data/view/ERX2095150

Nanopore (Oxford Nanopore Technologies - ONT)

Description Location
Ecoli K12 substrain MG1655 https://s3-eu-west-1.amazonaws.com/ont-research/medaka_walkthrough.tar.gz
Zymo Mock Community https://github.com/LomanLab/mockcommunity

10X Genomics

Description Location
Tiny test data http://cf.10xgenomics.com/supp/assembly/tiny-bcl-2.0.0.tar.gz
Many species https://support.10xgenomics.com/de-novo-assembly/datasets

Hi-C

Description Location
Human https://www.ebi.ac.uk/ena/data/view/SRR6675327