Data Files Contributions - kana112233/tesseract GitHub Wiki
此页面列出了Tesseract社区使用Tesseract4兼容tessdata(for --oem 1 - LSTM)的存储库.
这样的tessdata贡献应该理想地记录重现训练过程所需的一切(字体,图像,基本事实,文本,脚本,文档......).
Language Code | Language | Data File | Contributor | Info |
---|---|---|---|---|
khmLimon | Khmer | best | OpenInstituteCambodia/phyrumsk | PR in tessdata_best |
cop | Coptic | best | shreeshrii/tessdata_coptic | tesseract-ocr forum post |
jpn_vert | Japanese Vertical | best | zodiac3539/jpn_vert | tesseract-ocr forum post |
ocrb_plus | MRZ | best | shreeshrii/tessdata_ocrb | tesseract-ocr forum post |
jav_java | Aksara Jawa | best | Shreeshrii/tessdata_jav_java | tesseract-ocr forum post |
Lang_Code | Language | best | User_Repo | tesseract-ocr forum post |
---|