The automatic page-layout decomposition process (auto-zoning) can be activated directly by calling the IG_REC_zones_locate() function for finding text blocks on the image. It creates an entire zone list for the given image.
The IG_REC_zones_locate() function has a parameter that points to a structure containing the active area. This rectangle serves to limit the area inside which zones will be searched and defined.
The Capture SDK offers four different algorithms to be applied during auto-zoning: use the IG_REC_decomp_method_set() function to specify which Page parser algorithm setting should be applied during auto-zoning. Choose between the following enumIGRecDecompMethod constants:
- IG_REC_DCM_AUTO: The Engine will determine at runtime which algorithm will be applied. This depends on the trade-off and language settings (default setting).
- IG_REC_DCM_LEGACY: Simple page decomposition. This is faster but less accurate than Standard.
- IG_REC_DCM_STANDARD: The most accurate page parsing module.
- IG_REC_DCM_FAST: This algorithm can be treated as a simplified variant of DCM_LEGACY: this algorithm does not try to find tables on the page. It is the fastest (but least accurate) parsing algorithm available. Use it only for simple page parsing tasks.
When auto-zoning is used, each resulting zone is initialized with the following:
- The zone's detected bounding rectangle
- The zone type determined (IG_REC_WT_FLOW, IG_REC_WT_TABLE, IG_REC_WT_GRAPHIC, and so on)
All zones created by this function will have the following:
- The default filling method IG_REC_FM_DEFAULT
- The automatic recognition module selection IG_REC_RM_AUTO setting
- The IG_REC_FILTER_DEFAULT Character Set filter to specify the use of global filtering, if any
Any zone can be locally overridden with the IG_REC_zone_info_set() function, which allows you to change the attributes of a zone in the zone list.