Datasets - BenLangmead/jhu-compute GitHub Wiki

There are a few, big, relevant datasets we would like to have available locally (on HHPC). Note that our usage of these datasets may be restricted depending on whether the data is derived from humans and/or subject to HIPAA.

Generally, these datasets are located in:

/scratch0/langmead-fs1/data
/scratch1/langmead-fs1/data

(Those are two different partitions.)

Here are some of the datasets available and their locations:

GEUVADIS

The GEUVADIS study discovered eQTLs in data from 1000-Genomes-Project individuals. They also examined RNA-seq study design and bias/reprodicubility. 100 x 100 nt paired-end reads.

/scratch0/langmead-fs1/data/big_public_datasets/geuvadis

Depression Genes Network

This study. 50 nt unpaired reads, TruSeq unstranded protocol.