Best Practices for Preprocessing Images and ICR/OCR Recognition

For optimum recognition results when using SmartZone ICR/OCR, use the following steps as image preprocessing and recognition guidelines.

Step 1: Preprocessing

ImageGear (Available for purchase separately)

Use ImageGear to cleanup the images that are to be processed. You can also use any third party tool of your choice.

Step 2: Recognition

SmartZone ICR/OCR

Specify and identify your zone of interest for recognition. The specified zone is read by the Reader object's (ICR: Reader; OCR: Reader) Zone property (ICR: Zone; OCR: Zone).
Determine and select which character sets are to be used in the text recognition. Results returned include only characters within your specified character set, and recognition is improved by limiting the character sets to only values you expect to have returned. SmartZone ICR/OCR let you customize character sets by combining character sets provided by Accusoft, and/or omitting characters you don't expect in your data.
1. Field type (ICR: FieldType; OCR: FieldType) is required, and its default value is General Text. Change the field type when your data is expected to match any of these predefined formats: date, time, United States phone number, URL, email address, currency, currency plus, social security number, taxpayer ID.
Recognition results can be improved by using one of the two options for further specifying expected results.
1. Write a regular expression (ICR: SetRegularExpression; OCR: SetRegularExpression) to augment off-the-shelf field types, or to create your own masking format.
2. Use a data validation list (ICR: DataValidationListAddEntry; OCR: DataValidationListAddEntry) to provide a list of expected data contents, which will be used by the recognition system to choose among possible results.
Provide any necessary error and exception handling. See Debug Your Application for more information on errors and exceptions.
Set MinimumCharacterConfidence (ICR: MinimumCharacterConfidence; OCR: MinimumCharacterConfidence), RejectionCharacter (ICR: RejectionCharacter; OCR: RejectionCharacter) and Segmentation (ICR: Segmentation; OCR: Segmentation) properties for the Reader.
You can use the SmartZone ICR/OCR ReadFromStream (ICR: ReadFromStream; OCR: ReadFromStream) and WriteToStream (ICR: WriteToStream; OCR: WriteToStream) methods to read and write image clean-up instructions and perform further processing.
Use the AnalyzeField methods (ICR: AnalyzeField; OCR: AnalyzeField) to perform the text recognition.
Recognition results will be returned in a TextBlockResult (ICR: TextBlockResult; OCR: TextBlockResult) where you can get the text, area, and confidence for the text block, every text line, and character. See Determine Results for more information.

Step 1: Preprocessing

ImageGear (Available for purchase separately)

Step 2: Recognition

SmartZone ICR/OCR

See Also