Wang, Jason T. L.; Zaki, Mohammed J.; Toivonen, Hannu T. T.; Shasha, Dennis Data mining in bioinformatics. (English) Zbl 1060.68026 Advanced Information and Knowledge Processing. London: Springer (ISBN 1-85233-671-4/hbk). xi, 340 p. (2005). Publisher’s description: The goal of this book is to help readers understand state-of-the-art techniques in biological data mining and data management and includes topics such as: preprocessing tasks such as data cleaning and data integration as applied to biological data; classification and clustering techniques for microarrays; comparison of RNA structures based on string properties and energetics; discovery of the sequence characteristics of different parts of the genome; mining of haplotypes to find disease markers; sequencing of events leading to the folding of a protein; inference of the subcellular location of protein activity; classification of chemical compounds based on structure; special purpose metrics and index structures for phylogenetic applications; a new query language for protein searching based on the shape of proteins; very fast indexing schemes for sequences and pathways.Aimed at computer scientists, necessary biology is explained. The articles of this volume will not be indexed individually. Cited in 4 Documents MSC: 68P05 Data structures 68U99 Computing methodologies and applications 68-01 Introductory exposition (textbooks, tutorial papers, etc.) pertaining to computer science 68T05 Learning and adaptive systems in artificial intelligence 92-04 Software, source code, etc. for problems pertaining to biology Keywords:data management; data cleaning; data integration; biological data Software:CLUTO; PHYLIP; Torch; E-CELL; RadCon; CAP3; MrBayes; Arbor3D; T-coffee; KEGG; PipMaker; AVID; REPuter; Warmr; REVEAL; CloSpan; CloseGraph; CLUSTAL; BLAT; gSpan; PatternHunter; TreeBASE; ClustalW; TreeWiz; TreeDT; CMAR PDFBibTeX XMLCite \textit{J. T. L. Wang} et al., Data mining in bioinformatics. London: Springer (2005; Zbl 1060.68026)