User Guide > How to Work with... > OCR |
The ImageGear Recognition component provides access to document recognition technology.
This core software module provides recognition for almost any document, including those produced on typewriters, dot-matrix printers, ink-jet printers, laser printers, and phototypesetters, as well as photocopied and faxed versions of any document.
Using the ImageGear Recognition component, you can recognize a document image and export the text in one of the supported output formats. These formats include 8-bit ASII and 16-bit Unicode text, Microsoft Word, HTML, and others.
The ImageGear Recognition Component also allows a user to:
Portions of this document are excerpted from OmniPage Capture SDK 18.6 reference materials. Copyright © 2011 Nuance Communications, Inc. All Rights Reserved. Nuance and the Nuance logo are trademarks or registered trademarks of Nuance Communications, Inc. or its affiliates in the United States and/or other countries. |
The figure below illustrates a simple application that takes a page image as input and produces recognized text in a variety of output formats.
ImageGear's Recognition component enables you to build OCR applications for the Windows development environment. An application using the Recognition API can:
Documents with multiple languages can be recognized with accuracy because the API allows the application to specify the set of languages for recognition.
This section provides information about the following: