an:07293401
Zbl 07293401
Konovalenko, Ivan Andreevich; Kokhan, Vladislav Vladimirovich; Nikolaev, Dmitri?? Petrovich
Maximal coordinate discrepancy as accuracy criterion of image projective normalization for optical recognition of documents
EN
Vestn. Yuzhno-Ural. Gos. Univ., Ser. Mat. Model. Program. 13, No. 3, 43-58 (2020).
00456768
2020
j
90C25 90C32 90C90
orthocorrection; perspective correction; image projective normalization; optical character recognition; accuracy criteria; coordinate discrepancy; nonlinear programming
Summary: Application of projective normalization (a special case of orthocorrection and perspective correction) to photographs of documents for their further optical recognition is generally accepted. In this case, inaccuracies of normalization can lead to recognition errors. To date, a number of normalization accuracy criteria are presented in the literature, but their conformity with recognition quality was not investigated. In this paper, for the case of a fixed structured document, we justify a uniform probabilistic model of recognition errors, according to which the probability of correct recognition of a character abruptly falls to zero with an increase in the coordinate discrepancy of this character. For this model, we prove that the image normalization accuracy criterion, which is equal to the maximal coordinate discrepancy in the text fields of a document, monotonously depends on the probability of correct recognition of the entire document. Also, we show that the problem on computing the maximal coordinate discrepancy is not reduced to the nearest known one, i.e. the linear-fractional programming problem. Finally, for the first time, we obtain an analytical solution to the problem on computing the maximal coordinate discrepancy on a union of polygons.