Azure Video OCR

[About ]
- [Azure Video OCR utilizes the results produced by Azure Video Indexer, including the main indexer file and the video OCR artifacts, and generates the standard AMP Video OCR JSON.]
- [It has been added as a tool on AMP's Galaxy and performs video OCR on the input videos.]
- [The output produced by this tool is a JSON consisting of the text and the corresponding bounding box information on each frame in the input.]
- [ If "dedupe" option is checked, will also generate AMP OCRR JSON with duplicate frames removed, i.e. consecutive frames with same texts within the specified period.]

[Source Code]
- [galaxy/tools/amp_vocr/azure_video_ocr.xml : This is the configuration file that details the tools usage, its inputs, outputs, version, and other things.]
- [galaxy/tools/amp_vocr/azure_video_ocr.py : This is a python wrapper to generate AMP Video OCR JSON based on Azure Video indexer JSON and Video OCR Artifacts JSON output from Azure Video Indexer.]
- [galaxy/tools/amp_schema/video_ocr.py : Classes used to construct AMP Video OCR json output.]

[Running ]
- [The tool can be invoked from Galaxy UI as other tools, it needs be used as the next MGM taking outputs from Azure Video Indexer with the include_ocr flag set to true. ]

[Outputs][
]
- [amp_vocr: The standardized AMP Video OCR JSON]
- [amp_vocr_dedupe: The AMP OCRR JSON with duplicate frames removed]

Document generated by Confluence on Feb 25, 2025 10:39

Azure Video OCR - AudiovisualMetadataPlatform/amp_documentation GitHub Wiki