A zone is a rectangular area in the image, up to full size, containing a feature of interest to the user. The image data covered by each zone is handled and processed (and typically recognized) separately, according to zone-specific parameters. The zones of an ImGearRecPage Class image form a list, the zone list, which is attached to the image. The zone list can be examined at any time with the Zones Property of ImGearRecPage Class.
The characterizing attributes of a zone are the following:
The zone's filling method (fm) can take one of the following ImGearRecFillingMethod Enumeration values:
There are two special filling methods worth mentioning separately:
Zones can be added to a given image (to the zone list of the image, to be precise) in three different ways:
In addition to, or instead of the automatic zone search, you can program your own zones by specifying the zone coordinates and zone attributes. For adding zones to the zone list manually, the application calls the Add Method of ImGearRecZoneCollection Class. Related methods are: Insert Method, Remove Method, RemoveAt Method, and Clear Method.
The third way of creating zones is by reading zones from a file (called a zone file) that contains the attributes of previously saved zones. An application can save the current zone definitions to a zone file any time with the ImGearRecZoneCollection.SaveToFile Method. The application can load them from a zone file with the LoadFromFile Method.
When a zone file is loaded, any previous zones on the image are removed.
If the application calls the ImGearRecPage.Recognize Method on an image with an empty zone list, the page-layout decomposition method is called automatically.
The automatic page-layout decomposition process (auto-zoning) can be activated directly by calling the ImGearRecZoneCollection.Locate Method for finding text blocks on the image. It creates an entire zone list for the given image.
The Locate Method has an overload that takes an ImGearRectangle as a parameter. This rectangle serves to limit the area of the image inside which zones will be searched and defined.
The recognition engine offers three different algorithms to be applied during auto-zoning: use the ImGearRecRecognitionSettings.DecompMethod Property to specify which Page parser algorithm setting should be applied during auto-zoning from among the following:
In addition, the integrating application can set the zoning method to AUTO. With this setting, the recognition engine determines the method to use. See ImGearRecDecompositionMethod Enumeration for details.
Applying the new STANDARD algorithm results in the best throughput in overall accuracy, though in some cases it might require significantly more time to complete.
When auto-zoning is used, each resulting zone is initialized with the following:
All zones created by this method will have the following:
Any zone can be locally overridden with the Zones Property of ImGearRecPage, which allows you to access the zone list and change the attributes of a zone.
Tables with visible grid lines (gridded tables) in an original page can usually be detected successfully by the auto-zoning method. However, tables without visible cell separators in the original are harder to identify as tables, because they could alternately be word lists or data arranged in columns. The recognition engine offers an algorithm for detecting such non-gridded tables more confidently. This feature can only be used in conjunction with an auto-zoning step. The algorithm to detect non-gridded tables is based on the result of character recognition. This algorithm will run only if all the following conditions are met:
The code examples below show how to:
The following example contains uppercase letters only for multi-lingual omnifont recognition:
To get information about any particular zone of the zone list of the image, the application can access the zone list via the Zones Property of ImGearRecPage Class. This can be useful in finding out more about the zones created by the auto-zoning method.
The program can detect cells within a table-zone if automatic page decomposition is used.