MinerU - Vermont-Complex-Systems/pdf-zoo GitHub Wiki
MinerU
tags: #layoutAnalysis
, #pdf2markdown
inst: OpenDataLab
deps: PDF-Extract-Kit, , LayoutLMv3, YOLOv8, UniMERNet, StructEqTable, PaddleOCR
paper: https://arxiv.org/abs/2409.18839
limitations: Complex setup and requires specific environment configuration
# minerU has its own environment
conda create -n MinerU python=3.10
conda activate MinerU
pip install -U magic-pdf[full] --extra-index-url https://wheels.myhloli.com
Comprehensive PDF processing pipeline with OCR, layout analysis, and markdown conversion.
The output might look something like that, identifying the different markdown objects: