swMATH ID: 34214
Software Authors: A. Pesaranghader, S. Matwin, M. Sokolova, R. G. Beiko
Description: simDEF: definition-based semantic similarity measure of gene ontology terms for functional similarity analysis of genes. Motivation: Measures of protein functional similarity are essential tools for function prediction, evaluation of protein–protein interactions (PPIs) and other applications. Several existing methods perform comparisons between proteins based on the semantic similarity of their GO terms; however, these measures are highly sensitive to modifications in the topological structure of GO, tend to be focused on specific analytical tasks and concentrate on the GO terms themselves rather than considering their textual definitions. Results: We introduce simDEF, an efficient method for measuring semantic similarity of GO terms using their GO definitions, which is based on the Gloss Vector measure commonly used in natural language processing. The simDEF approach builds optimized definition vectors for all relevant GO terms, and expresses the similarity of a pair of proteins as the cosine of the angle between their definition vectors. Relative to existing similarity measures, when validated on a yeast reference database, simDEF improves correlation with sequence homology by up to 50
Homepage: https://academic.oup.com/bioinformatics/article/32/9/1380/1743954
Related Software: LONE
Cited in: 1 Publication

Citations by Year