CRDC - NIH-NCPI/ncpi-model-forge GitHub Wiki

Cancer Research Data Commons (CRDC)

https://datacommons.cancer.gov

Overview

The goal of the National Cancer Institute’s Cancer Research Data Commons (CRDC) is to empower researchers to accelerate data-driven scientific discovery by connecting diverse datasets with analytical tools in the cloud. The CRDC is built upon an expandable data science infrastructure that provides secure access to many different data across scientific domains via Data Commons Framework.

The CRDC enables users to search and aggregate data across repositories via the Cancer Data Aggregator using a common data model developed by the Center for Cancer Data Harmonization. Users can access CRDC data using NCI Cloud Resources (Broad FireCloud, Seven Bridges Cancer Genomics Cloud, and Institute for Systems Biology Cancer Genomics Cloud) that bring data and computational power together to enable cancer research and discovery.

NCI Cloud Resources eliminate the need for researchers to download and store extremely large data sets by allowing them to bring analysis tools to the data in the cloud. The platforms also provide access to on-demand computational capacity to analyze these data.

The ability to combine diverse data types and perform cross-domain analysis of large cancer datasets can lead to new discoveries in cancer prevention, treatment and diagnosis, further supporting the goals of precision medicine and the Cancer Moonshot℠.

The CRDC will encompass and connect multiple cloud-based data repositories and serve as a central location to support public data sharing for NCI-funded programs.

Linked issues

https://github.com/ncpi-fhir/solid-octo-fortnight/labels/CRDC