GENIA - sporedata/researchdesigneR GitHub Wiki

General description

The GENIA Bio-medical event dataset is a simplified version of the event-annotated GENIA dataset. Overall, it comprises three sets of data train (8k+ sentences), devel (about 3k sentences), and test (about 3k sentences)), each set containing four columns: "Sentence," "TriggerWord," "TriggerWordLoc," and "EventType." Each column represents the original biomedical text, labeled trigger words, the location of the trigger word in the text, and the type of event associated with the trigger word, respectively.

Bio-medical events may show the effects of drugs on a person and can be used to identify specific medical conditions in a person.

Data access

GENIA Bio-medical event dataset