Home - hcts-hra/ecpo-fulltext-experiments GitHub Wiki
This wiki documents the process of extracting full text from the 1919–1940 issues of the Republican Chinese entertainment newspaper 晶報 Jīngbào.
Page Segmentation:
-
Rule-based Approaches
-
ML-driven Approaches
Character Segmentation Using HRCenterNet
Building an OCR Classifier: