OCR

ImageGear offers a comprehensive full-page OCR solution. The ImageGear.Recognition namespace API provides a set of objects that provide access to document recognition technology, enabling you to build OCR applications for the Windows .NET development environment that can do the following:

Accept as input any image file format supported by ImageGear. This image data can be binary, gray, or color.
Recognize a document image and export text in any format defined by the recognition engine. These formats include 8-bit text, HTML, XML and Word processor formats such as MS Word or WordPerfect.
Select the language or languages of documents to be recognized. The list includes English, Eastern and Western European languages, Asian languages, Cyrillic based languages (Russian), the Baltic languages, Turkish, and Greek. Documents with multiple languages can be recognized with accuracy because the API allows the application to specify the set of languages for recognition.
Enable end users to verify text during the recognition process.
Increase recognition accuracy with built-in and user-defined dictionaries.
Output confidence values for post-recognition processing.
Automatically segment the page to correctly recognize text on pages with complex or irregular layouts, including tables, reverse video, and line art as well.
Allow the user to manage delineate zones of a document page and then specify treatment for those zones. This includes the ability to correct the OCR engine's automatic segmentation between the segmentation phase and the recognition phase.
Process both text and graphics. The recognition software's ability to distinguish graphics from text can provide the basis of a compound document processing system.
Automatically detect fax, dot matrix, and other degraded documents and compensate accordingly.
Use a scalable voting architecture that provides developers with 2 pre-made voting interfaces (OmniPage and PLUS) and direct access to 3 leading OCR engines (MOR, MTX, and FireWorX).
Recognize handwritten text using a numbers only module or alphanumeric module.

The ImageGear.Recognition.Forms Namespace contains classes that offer recognition-related user interface functionality. For example, it provides a customizable toolbar for creating recognition zones, as well as interactive, visual creation and editing of recognition zones, which are displayed directly over the image to be recognized.

See ImageGear.Recognition Namespace and ImageGear.Recognition.Forms Namespace for more information.

Portions of this document are excerpted from OmniPage Capture SDK 18.6 reference materials. Copyright © 2011 Nuance Communications, Inc. All Rights Reserved. Nuance and the Nuance logo are trademarks or registered trademarks of Nuance Communications, Inc. or its affiliates in the United States and/or other countries.

This core software module, referred to hereafter as the recognition engine, provides unsurpassed recognition accuracy for almost any document, including those produced on typewriters, dot-matrix printers, ink-jet printers, laser printers, and phototypesetters, as well as photocopied and faxed versions of any documents.

The figure below illustrates a simple application that takes a page image as input and produces recognized text in a variety of output formats.

This section provides information about the following:

Concepts

Specifications

Technical Specifications

How to...

Pre-Process an Image

OCR an Image or Document

Access and Analyze OCR Output