PubMed - sporedata/researchdesigneR GitHub Wiki

General description

PubMed comprises more than 29 million citations for biomedical literature from life science journals, MEDLINE, and online books. Multiple papers use PubMed papers to pre-train word embeddings and others use papers to do long document summarization.

More precisely, the PubMed Central (PMC) Open Access Subset describes a set of PubMed Central articles usable under licenses other than traditional copyright.

Data access

PMC Open Access Subset

PubMed Dataset