ocr - ghdrako/doc_snipets GitHub Wiki
Traditional OCR providers (azure, tesseract, aws textract, etc.) is that they're ~85% accurate New LLM
Gemini 2.0 changes everything because of the incredibly long context (2M tokens on some models), while having strong reasoning capabilities.