Some useful commands - GitBruno/docxicml GitHub Wiki

Output the AST (Abstract Syntax Tree) representation of the source document as JSON:

pandoc -f docx input.docx -t json -o input_ast.json

Transform XHTML to ICML

saxon -o:myicml.icml -s:myxhtml.html transform.xslt