tesseract - Vermont-Complex-Systems/pdf-zoo GitHub Wiki

tesseract

tags: #trad
inst: Google (originally HP)
limitations: Struggles with complex layouts and handwriting

Mother of all OCR tools. It was there when the world wide web was invented. More recently runs LSTM under the hood.