## Parameter advising for multiple sequence alignment.(English)Zbl 1388.92001

Computational Biology 26. Cham: Springer (ISBN 978-3-319-64917-7/hbk; 978-3-319-64918-4/ebook). xiv, 152 p. (2017).
Publisher’s description: This book develops a new approach called parameter advising for finding a parameter setting for a sequence aligner that yields a quality alignment of a given set of input sequences. In this framework, a parameter advisor is a procedure that automatically chooses a parameter setting for the input, and has two main ingredients:
$$\bullet$$
(a) the set of parameter choices considered by the advisor, and
$$\bullet$$
(b) an estimator of alignment accuracy used to rank alignments produced by the aligner.
On coupling a parameter advisor with an aligner, once the advisor is trained in a learning phase, the user simply inputs sequences to align, and receives an output alignment from the aligner, where the advisor has automatically selected the parameter setting.
The chapters first lay out the foundations of parameter advising, and then cover applications and extensions of advising. The content
$$\bullet$$
examines formulations of parameter advising and their computational complexity,
$$\bullet$$
develops methods for learning good accuracy estimators,
$$\bullet$$
presents approximation algorithms for finding good sets of parameter choices, and
$$\bullet$$
assesses software implementations of advising that perform well on real biological data.
Also explored are applications of parameter advising to
$$\bullet$$
adaptive local realignment, where advising is performed on local regions of the sequences to automatically adapt to varying mutation rates, and
$$\bullet$$
ensemble alignment, where advising is applied to an ensemble of aligners to effectively yield a new aligner of higher quality than the individual aligners in the ensemble.
The book concludes by offering future directions in advising research.

### MSC:

 92-02 Research exposition (monographs, survey articles) pertaining to biology 92-08 Computational methods for problems pertaining to biology 92D20 Protein sequences, DNA sequences
Full Text: