Liu, Tingting; Cheng, Tao; Jin, Gangzeng; Wang, Xikun; Gao, Ming Recognition of mathematical formulas based on support vector machines. (Chinese. English summary) Zbl 1438.68165 J. East China Norm. Univ., Nat. Sci. Ed. 2019, No. 3, 78-85 (2019). Summary: The recognition of mathematical formulas has been widely used in intelligent education applications, such as searching for answers to questions in image format, automatic marking, and constructing a database of questions. Mathematical formulas often exist in the form of images in many applications; hence, identifying the formulas in these images is an important research topic in the field of intelligent education. Given the complex structure of mathematical formulas, however, recognizing their presence within images is far more complicated than a general optical character recognition task. This paper decomposes formula recognition into three steps: character segmentation, character recognition, and formula reconstruction. First, the characters are separated from an image by using a combination of projection and connected-domain methods. Second, the features of characters are extracted based on the proportion of pixels in a single character relative to pixels in all characters, and a supervised learning model is established to identify each character. Finally, the mathematical formula is reconstructed based on the location of each character in the formula. Experimental results on a real data set show that the proposed mathematical formula recognition method can achieve an accuracy of up to 98.0%. MSC: 68T10 Pattern recognition, speech recognition 68T05 Learning and adaptive systems in artificial intelligence Keywords:mathematical formulae recognition; support vector machine; optical character recognition PDFBibTeX XMLCite \textit{T. Liu} et al., J. East China Norm. Univ., Nat. Sci. Ed. 2019, No. 3, 78--85 (2019; Zbl 1438.68165) Full Text: DOI