ImageGear for .NET
Defining the Character Set

You can improve text recognition accuracy by narrowing the range of characters valid for recognition. This way the recognition engine doesn't always have to choose its solutions from all 500 characters in the recognition engine's Total Character Set. The multi-lingual omnifont MOR recognition module supports all of these characters; other recognition modules recognize fewer of them. Broadly, the Set is compiled as follows:

The following examples illustrate various techniques for limiting the Character Set:

To summarize, the Character Set for each zone was:

The IsCharEnabled Method can be used to inquire whether a given character is validated for the current page by its Language environment and FilterPlus.

 

 


©2014. Accusoft Corporation. All Rights Reserved.

Send Feedback