Matches text from an ImageGear multi-page document.
Declaration:
Copy Code | |
---|---|
AT_ERRCOUNT ACCUAPI IG_REC_approx_regex_search_document( const HIG_REC_APPROX_REGEX higRecApproxRegex, const HMIGEAR higDocument, const AT_REC_MATCH_RESULT **const match_result_array, AT_INT *const match_result_size ); |
Arguments:
higRecApproxRegex | The handle to an approximate regex instance. |
higDocument | The handle to an ImageGear multi-page document. |
match_result_array | A pointer to a vector of match result instances. The approximate regex owns this data. Callers that require access to this data after the approximate regex is deleted must duplicate this data and assume ownership of the copy. |
match_result_size | The number of match results in match_result_array. |
Return Value:
The number of ImageGear errors that occurred during this function call. If there are no errors, the return value is IGE_SUCCESS.Supported Raster Image Formats:
This function does not process image pixels.
Remarks:
Text recognition is performed on each page. The resulting text from each page is searched for pattern matches.
Supports matching Asian language text. |
The following installed callbacks are invoked during execution of this function:
- Recognized Word Callback. Invoked after a word boundary is recognized. Inspect the arguments to identify Unicode character codes, bounding rectangles, and confidence rating. Words are sorted in logical order.
- Match Callback. Invoked after a pattern match is detected. Matched text may be either accepted or rejected. Only accepted matches will be present in match_result_array. If a Match Callback is not installed, then all matches are accepted.
- Progress Callback. Invoked periodically to abort a lengthy search operation. If the search is stopped, then only those matches already accepted will be returned in match_result_array. A warning will be reported in the error stack if the operation is aborted.
Example:
Copy Code | |
---|---|
AT_ERRCOUNT errCount = 0 ; HMIGEAR higDocument = 0 ; HIG_REC_APPROX_REGEX higApproxRegex = 0 ; LPAT_WCHAR patterns [] = { L".+" } ; const AT_REC_MATCH_RESULT* match_result_array = 0 ; AT_INT match_result_size = 0 ; IG_mpi_create( &higDocument, 0 ); IG_mpi_file_open( "multi.tif", higDocument , 0 , IG_MP_OPENMODE_READONLY ); IG_REC_approx_regex_create( &higApproxRegex ) ; IG_REC_approx_regex_pattern_set( higApproxRegex , patterns , 1 ) ; errCount = IG_REC_approx_regex_search_document ( higApproxRegex, higDocument, &match_result_array, &match_result_size ) ; IG_REC_approx_regex_delete( higApproxRegex ) ; IG_mpi_delete( higDocument ) ; |