Download data from GDC to AWS EC2 - OXPHOS/GeneMiner GitHub Wiki
1. Data source:
GDC repository open access files
For data downloading, apply filters and download:
- JSON file: contatins case id and other meta information about all the datasets filtered
- manifest file: contains md5 for downloading datasets
Then, download the GDC data transfer tool
Reference genome:
Description and Download
2. Information transfer
Setup connection with AWS control machine via FileZilla (Tutorial)
Transfer gdc-client to control machine
Transfer manifest and JSON files to control machine
3. Dataset download
path/to/gdc-client download -m path/to/manifest/files .