Home - fhircat/CORD-19-on-FHIR GitHub Wiki
SPARQL Query Examples
COVID-19 PICO Ontology
CORD-19 dataset content
- Commercial use subset (includes PMC content) -- 9000 papers, 186Mb
- Non-commercial use subset (includes PMC content) -- 1973 papers, 36Mb
- PMC custom license subset -- 1426 papers, 19Mb
- bioRxiv/medRxiv subset (pre-prints that are not peer reviewed) -- 803 papers, 13Mb
Proposed approach:
- Run documents through the NLP2FHIR pipeline, producing FHIR R4 resources descriptions.
- Convert FHIR R4 resources to RDF using the FHIR to RDF converter
- Load resulting RDF into into a SPARQL Endpoint (target host at the moment: https://graph.fhircat.org/graphdb)