Some useful commands - GitBruno/docxicml GitHub Wiki
Output the AST (Abstract Syntax Tree) representation of the source document as JSON:
pandoc -f docx input.docx -t json -o input_ast.json
Transform XHTML to ICML
saxon -o:myicml.icml -s:myxhtml.html transform.xslt