ImageGear PDF v24.14 - Updated
Output Text Format List
User Guide > How to Work with... > OCR > Concepts > Technical Specifications > Output Text Format List

The following list contains all the selectable output formats of the converters.

Commonly Used Output Formats

Office Formats

The toolkit can generate output for Office file types DOCX, XLSX and PPTX. These files can be opened in Office 2007 and higher versions.

The DOCX file type specification can be downloaded from: http://www.ecma-international.org/news/TC45_current_work/TC45_available_docs.htm.

The DOCX / XLSX / PPTX file types conform with a Microsoft standard called "Open Packaging Conventions (OPC)" with specifications available for download at http://go.microsoft.com/fwlink/?linkID=71255.

PDF Formats

Text Formats

ML Formats

Ebook Formats

Direct Text Output Formats

This group of output formats allows you to convert recognized text simply and quickly. That is, you use the output of the recognition module as is (without reading order and paragraph detection). Therefore, Direct Outputs are faster to produce, because they do not include slow detection processes.

Legacy and Deprecated Output Formats