OCR (Optical Character Recognition) is the process of converting machine printed information into editable text. The SmartZone OCR component used by FormAssist performs this process by using zones or selected field areas on the form template, with pre-defined character sets to select from, to analyze the filled-in form fields.
Add OCR Fields
To add an OCR field in a form template, select the OCR Tool button by clicking on it in the toolbar or pressing "Ctrl+Shift+O" (step 2 in the topic Steps to Define, Create and Modify Form Fields). The OCR field can then be defined by selecting the corner of the field on the form image with your mouse and drag to the diagonal corner similar to any other field type.
Set OCR Field Settings
FormAssist window with OCR tab open and an OCR field highlighted
The image above displays an example OCR field highlighted (Your Name) on both the Tree and Image Views.
Because the OCR engine is analyzing for specific text on images, it's recommended to remove or 'dropout' the form and perform any image enhancements to improve the OCR processing performance.
See the Image Enhancement topic in this section for more information on how to improve recognition processing performance.
To create OCR fields, see the OCR Fields topic below the Define and Create Fields section.
Properties View
The tabs of the Properties View are:
Tab |
Description |
General |
This tab contains the field area coordinates which can be modified by either dragging the outlined field in the Image View or by modifying the values in the Properties View on the General tab. |
Dropout |
Dropout is important because it helps the OCR engine to accurately determine the machine printed text from part of the original form. Dropout is recommended whenever possible. See the Dropout Properties topic for more information. |
ScanFix Xpress |
The ScanFix Xpress settings are recommended as they improve the image, allowing the OCR engine to recognize the text with greater accuracy. See the ScanFix Xpress Properties topic for more details. |
OCR |
The OCR property settings allow you to select the language, character set, confidence values, rejection character, spaces, and multiple text lines. Adjusting these settings for the OCR engine can improve performance and increase result accuracy. See the OCR Property Details below for more information. |