OCR (Optical Character Recognition) is reading text data from an image. This process scans the image, character by character, then translates the image characters to character code which you have the option to output to a file.
The OCR section includes:
ImageGear OCR Functionality
ImageGear offers a comprehensive full-page OCR solution. The ImageGear.Recognition namespace API provides a set of objects that provide access to document recognition technology, enabling you to build OCR applications for the Windows .NET development environment that can do the following:
- Input any image file format supported by ImageGear. This image data can be binary, gray, or color.
- Recognize a document image and export text in any format defined by the recognition engine. These formats include 8-bit text, HTML, XML and Word processor formats such as MS Word or WordPerfect.
- Select the language(s) to be recognized. See SpellingLanguage property for the list of languages.
- Enable end users to verify text during the recognition process.
- Increase recognition accuracy with built-in and user-defined dictionaries.
- Output confidence values for post-recognition processing.
- Manage delineated zones of a document page and then specify treatment for those zones. This includes the ability to correct the OCR engine's automatic segmentation between the segmentation phase and the recognition phase.
- Automatically segment the page to correctly recognize text on pages with complex or irregular layouts (including tables, reverse video, and line art).
- Process both text and graphics.
- Automatically detect fax, dot matrix, and other degraded documents and compensate accordingly.
- Use a scalable voting architecture that provides developers with 2 pre-made voting interfaces (OmniPage and PLUS) and direct access to 3 leading OCR engines (MOR, MTX, and FireWorX).
OCR Samples
We have an entire set of OCR samples that illustrate common OCR functionalities and use cases.
Portions of this document are excerpted from OmniPage Capture SDK 19.20 reference materials. Copyright © 2015 Nuance Communications, Inc. All Rights Reserved. Nuance and the Nuance logo are trademarks or registered trademarks of Nuance Communications, Inc. or its affiliates in the United States and/or other countries.
See Also