Řehůřek, Radim; Sojka, Petr Automated classification and categorization of mathematical knowledge. (English) Zbl 1166.68358 Autexier, Serge (ed.) et al., Intelligent computer mathematics. 9th international conference, AISC 2008, 15th symposium, Calculemus 2008, 7th international conference, MKM 2008, Birmingham, UK, July 28–August 1, 2008. Proceedings. Berlin: Springer (ISBN 978-3-540-85109-7/pbk). Lecture Notes in Computer Science 5144. Lecture Notes in Artificial Intelligence, 543-557 (2008). Summary: There is a common Mathematics Subject Classification (MSC) System used for categorizing mathematical papers and knowledge. We present results of machine learning of the MSC on full texts of papers in the mathematical digital libraries DML-CZ and NUMDAM. The F1-measure achieved on classification task of top-level MSC categories exceeds 89%. We describe and evaluate our methods for measuring the similarity of papers in the digital library based on paper full texts.For the entire collection see [Zbl 1154.68002]. MSC: 68T30 Knowledge representation 00A35 Methodology of mathematics 68T05 Learning and adaptive systems in artificial intelligence Software:MSC PDF BibTeX XML Cite \textit{R. Řehůřek} and \textit{P. Sojka}, Lect. Notes Comput. Sci. 5144, 543--557 (2008; Zbl 1166.68358) Full Text: DOI OpenURL