Supported file formats - nuanxed/MateCat-Filters GitHub Wiki

Directly supported formats

  • Microsoft Office

    • DOCX
    • XLSX
    • PPTX
  • Open Office

    • ODT
    • OTT
    • ODS
    • OTS
    • ODP
    • OTP
  • Hypertext

    • HTML
    • XHTML
  • Localization

    • SDLXLIFF
    • XLIFF
    • PO
    • TTX
  • Desktop publishing

    • MIF
    • IDML
    • ICML
    • DITA
  • Interchange Formats

    • CSV
    • TSV
    • XML
    • DTD
    • JSON
    • YAML
  • Others

    • TXT
    • PROPERTIES
    • RESX
    • STRINGS
    • SRT
    • WIX

Formats supported using MateCAT Win Converter

MateCAT Win Converter transforms some filetypes in formats directly supported by MateCat Filters, using some external commercial dependencies. It uses Microsoft Office to convert old legacy Office formats to the new Office Open XML, Nuance OCR SDK to convert images in DOCX, and CloudConvert to convert PDF to DOCX. See the dedicated repository for more info.

  • Microsoft Office

    • DOC
    • DOT
    • DOCM
    • DOTX
    • DOTM
    • XLS
    • XLT
    • XLSM
    • XLTX
    • XLTM
    • PPT
    • PPS
    • POT
    • PPTM
    • PPSX
    • PPSM
    • POTX
    • POTM
  • OCR

    • BMP
    • GIF
    • PNG
    • JPEG
    • TIFF
    • Scanned PDFs
  • Others

    • Regular PDFs
    • RTF