Datasets - BenLangmead/jhu-compute GitHub Wiki
There are a few, big, relevant datasets we would like to have available locally (on HHPC). Note that our usage of these datasets may be restricted depending on whether the data is derived from humans and/or subject to HIPAA.
Generally, these datasets are located in:
/scratch0/langmead-fs1/data
/scratch1/langmead-fs1/data
(Those are two different partitions.)
Here are some of the datasets available and their locations:
GEUVADIS
The GEUVADIS study discovered eQTLs in data from 1000-Genomes-Project individuals. They also examined RNA-seq study design and bias/reprodicubility. 100 x 100 nt paired-end reads.
/scratch0/langmead-fs1/data/big_public_datasets/geuvadis
Depression Genes Network
This study. 50 nt unpaired reads, TruSeq unstranded protocol.