MinerU - Vermont-Complex-Systems/pdf-zoo GitHub Wiki

MinerU

tags: #layoutAnalysis, #pdf2markdown
inst: OpenDataLab
deps: PDF-Extract-Kit, , LayoutLMv3, YOLOv8, UniMERNet, StructEqTable, PaddleOCR
paper: https://arxiv.org/abs/2409.18839
limitations: Complex setup and requires specific environment configuration

# minerU has its own environment
conda create -n MinerU python=3.10
conda activate MinerU
pip install -U magic-pdf[full] --extra-index-url https://wheels.myhloli.com

Comprehensive PDF processing pipeline with OCR, layout analysis, and markdown conversion.

The output might look something like that, identifying the different markdown objects: