In addition to image recognition with default language settings, ImageGear has the ability to specify the language to be recognized on a page. If the language on the recognized page is known prior to recognition, defining the language will make the recognition more precise because the appropriate character sets will be used in the recognition process and dictionaries specific to the language dictionaries will be applied to recognized character constructions.
Use the LanguageEnabled property of ImGearOCRSettings to define a specific language or languages.
- Set a specific language in this collection to true if the language is used on the page to be recognized.
- Set a specific language to false to exclude the language from the recognition process.
The list of languages is separated into a few language groups. The languages from one group may be incompatible with languages from other groups. When the languages from different groups are enabled, recognition may return an error. To avoid using incompatible languages, the following list of language groups should be used:
- Greek language.
- Latin and Cyrillic language group unites CentralEurope languages, Cyrillic languages, WesternEurope languages, Turkish language and Baltic languages. This set of languages includes: Afrikaans, Albanian, Andorra, Argentina, Australia, Austria, AzerbaijanCyrillic, AzerbaijanLatin, Baltic, Basque, Belarusian, Belgium, Bosnian,
Brazil, Bulgarian, Canada, Catalan, CentralAmerica, CentralEurope, Chile, Colombia, Croatian, Cyrillic, Czech, Danish, Dutch, English, Estonian, Faroese,
Finnish, French, Frisian, German, GreatBritain, Guarani, Hani, Hungarian, Icelandic, Indonesian, Irish, Italian, JapanLatinOnly, KazakhCyrillic, KazakhLatin,
KirghizCyrillic, Kirundi, Latin, Latvian, Liechtenstein, Lithuanian, Luxembourgish, Macedonian, Malay, Mexico, Netherlands, NewZealand, Norwegian,
Polish, Portuguese, Quechua, RhaetoRomanic, Romanian, Russian, Rwanda, Scandinavia, SerbianCyrillic, Shona, Slovak, Slovenian, Somali, Sorbian,
SouthAfrica, SouthAmerica, Spanish, Swahili, Swedish, Switzerland, TajikCyrillic, Turkish, TurkmenCyrillic, TurkmenLatin, Ukrainian, USA, UzbekCyrillic,
UzbekLatin, Venezuela, WesternEurope, Wolof, Xhosa, Zulu.
- ChineseSimplified and ChineseTraditional languages.
- ChineseHongKong language.
- Japanese language.
- Korean language.
- Thai language.
You require an ImageGear license that includes support for Asian languages to enable the following:
- ChineseSimplified
- ChineseTraditional
- ChineseHongKong
- Japanese
- Korean
- Thai
The following example illustrates how to recognize a page containing only French text.