ctakes preprocessor - apache/ctakes GitHub Wiki
Transforms CDA text into plain text, inserts hyphens into words, stores the resulting text in a new View and creates Sections.
Source class: CdaCasInitializer
Source package: org.apache.ctakes.preprocessor.ae
Parent class: org.apache.uima.fit.component.JCasAnnotator_ImplBase
Dependencies: Document Id
Products: Section
Parameter | Description | Class | Required | Default |
---|---|---|---|---|
DtdFile | Path to File that contains the DTD file | String | No | org/apache/ctakes/preprocessor/cda/ NotesIIST_RTF.DTD |
HyphenFile | Path to File that contains the hypenated file | String | No | org/apache/ctakes/preprocessor/tokenizer/ hyphenated.txt |