×

Quantitative trait loci mapping problem: an extinction-based multi-objective evolutionary algorithm approach. (English) Zbl 1461.92063

Summary: The Quantitative Trait Loci (QTL) mapping problem aims to identify regions in the genome that are linked to phenotypic features of the developed organism that vary in degree. It is a principle step in determining targets for further genetic analysis and is key in decoding the role of specific genes that control quantitative traits within species. Applications include identifying genetic causes of disease, optimization of cross-breeding for desired traits and understanding trait diversity in populations. In this paper a new multi-objective evolutionary algorithm (MOEA) method is introduced and is shown to increase the accuracy of QTL mapping identification for both independent and epistatic loci interactions. The MOEA method optimizes over the space of possible partial least squares (PLS) regression QTL models and considers the conflicting objectives of model simplicity versus model accuracy. By optimizing for minimal model complexity, MOEA has the advantage of solving the over-fitting problem of conventional PLS models. The effectiveness of the method is confirmed by comparing the new method with Bayesian Interval Mapping approaches over a series of test cases where the optimal solutions are known. This approach can be applied to many problems that arise in analysis of genomic data sets where the number of features far exceeds the number of observations and where features can be highly correlated.

MSC:

92D10 Genetics and epigenetics
68T05 Learning and adaptive systems in artificial intelligence

Software:

WinQTLCart; SimWalk2
PDFBibTeX XMLCite
Full Text: DOI

References:

[1] Devoto, M.; Falchi, M.; Genetic mapping of quantitative trait loci for disease-related phenotypes Quantitative Trait Loci (QTL); Methods Mol. Biol.: 2012; Volume 871 ,281-311.
[2] Diaz, A.; Fergany, M.; Formisano, G.; Ziarsolo, P.; Blanca, J.; Fei, Z.; Staub, J.; Zalapa, J.; Cuevas, H.; Dace, G.; A consensus linkage map for molecular markers and Quantitative Trait Loci associated with economically important traits in melon (Cucumis melo L.); BMC Plant Biol.: 2011; Volume 11 .
[3] Kompass, K.; Witte, J.; Co-regulatory expression quantitative trait loci mapping: Method and application to endometrial cancer; BMC Med. Genomics: 2011; Volume 4 .
[4] Broman, K.W.; Identifying Quantitative Trait Loci in Experimental Crosses; Ph.D. Dissertation: Berkeley, CA, USA 1997; .
[5] Dilda, C.L.; Mackay, T.F.C.; The genetic architecture of Drosophila sensory bristle number; Genetics: 2002; Volume 162 ,1655-1674.
[6] Mendel, G.; ; Experiments in Plant Hybridisation: Cambridge, MA, USA 1965; .
[7] Sax, K.; The association of size differences with seed-coat pattern and pigmentation in PHASEOLUS VULGARIS; Genetics: 1923; Volume 8 ,552-560.
[8] Thoday, J.M.; Location of polygenes; Nature: 1961; Volume 191 ,368-370.
[9] Edwards, M.D.; Stuber, C.W.; Wendel, J.F.; Molecular-marker-facilitated investigations of quantitative-trait loci in maize. I. Numbers, genomic distribution and types of gene action; Genetics: 1987; Volume 116 ,113-125.
[10] Beavis, W.D.; Grant, D.; Albertsen, M.; Fincher, R.; Quantitative trait loci for plant height in four maize populations and their associations with qualitative genetic loci; TAG Theor. Appl. Genet.: 1991; Volume 83 ,141-145.
[11] Stuber, C.W.; Lincoln, S.E.; Wolff, D.W.; Helentjaris, T.; Lander, E.S.; Identification of genetic factors contributing to heterosis in a hybrid from two elite maize inbred lines using molecular markers; Genetics: 1992; Volume 132 ,823-839.
[12] Paterson, A.; Tanksley, S.; Sorrells, M.; DNA Markers in Plant Improvement; Advances in Agronomy: Amsterdam, The Netherlands 1991; Volume Volume 46 ,39-90.
[13] DeVicente, M.C.; Tanksley, S.D.; QTL analysis of transgressive segregation in an interspecific tomato cross; Genetics: 1993; Volume 134 ,585-596.
[14] Huang, N.; Courtois, B.; Khush, G.S.; Lin, H.; Wang, G.; Wu, P.; Zheng, K.; Association of quantitative trait loci for plant height with major dwarfing genes in rice; Heredity: 1996; Volume 77 ,130-137.
[15] Gelado, P.; Kowalski, B.R.; Partial least-squares regression: A tutorial; Anal. Chim. Acta: 1985; Volume 185 ,1-17.
[16] Magidson, J.; Correlated Component Regression: Re-Thinking Regression in the Presence of Near Collinearity; New Perspectives in Partial Least Squares and Related Methods: Berlin/Heidelberg, Germany 2013; . · Zbl 1428.62254
[17] Bjørnstad, A.; Westad, F.; Martens, H.; Analysis of genetic marker-phenotype relationships by jack-knifed partial least squares regression (PLSR); Hereditas: 2004; Volume 141 ,149-165.
[18] Chun, H.; Keleş, S.; Expression quantitative trait loci mapping with multivariate sparse partial least squares regression; Genetics: 2009; Volume 182 ,79-90.
[19] Coster, A.; Calus, M.; Partial least square regression applied to the QTLMAS 2010 dataset; BMC Proc.: 2011; Volume 5 .
[20] Pirouz, D.M.; An Overview of Partial Least Squares; Social Science Research Network Working Paper Series: 2010; .
[21] Carlborg, O.; Andersson, L.; Kinghorn, B.; The use of a genetic algorithm for simultaneous mapping of multiple interacting quantitative trait loci; Genetics: 2000; Volume 155 ,2003-2010.
[22] Zhang, B.; Horvath, S.; Ridge regression based hybrid genetic algorithms for multi-locus quantitative trait mapping; Int. J. Bioinforma. Res. Appl.: 2005; Volume 1 ,261-272.
[23] Lee, S.; van der Werf, J.; Kinghorn, B.; Using an evolutionary algorithm and parallel computing for haplotyping in a general complex pedigree with multiple marker loci; BMC Bioinforma.: 2008; Volume 9 .
[24] Badzioch, M.D.; deFrance, H.B.; Jarvik, G.P.; An examination of the genotyping error detection function of SIMWALK2; BMC Genet.: 2003; Volume 4 .
[25] Fonseca, C.M.; Fleming, P.J.; Genetic Algorithms for Multiobjective Optimization: Formulation, Discussion and Generalization; Proceedings of the 5th International Conference on Genetic Algorithms: ; ,416-423.
[26] Ursem, R.K.; Models for Evolutionary Algorithms and Their Applications in System Identification and Control Optimization; Ph.D. Thesis: Aarhus, Danmark 2003; .
[27] Greenwood, G.W.; Fogel, G.B.; Ciobanu, M.; Emphasizing Extinction in Evolutionary Programming; Proceedings of the IEEE 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406): ; ,666-671.
[28] Grefenstette, J.; Genetic Algorithms for Changing Environments; Proceedings of the Parallel Problem Solving from Nature 2: ; ,137-144.
[29] Krink, T.; Thomsen, R.; Self-organized Criticality and Mass Extinction in Evolutionary Algorithms; Proceedings of the IEEE 2001 Congress on Evolutionary Computation (IEEE Cat. No.01TH8546): ; ,1155-1161.
[30] Raup, D.M.; Biological extinction in earth history; Science: 1986; Volume 231 ,1528-1533.
[31] Ghaffarizadeh, A.; Ahmadi, K.; Eftekhari, M.; Adding Crossover to Extinction-Based Evolutionary Algorithms; Proceedings of the ICCEE ’09, 2009 Second International Conference on Computer and Electrical Engineering: Washington, DC, USA 2009; ,43-48.
[32] Langdon, W.B.; Poli, R.; Fitness Causes Bloat; Soft Computing in Engineering Design and Manufacturing: Berlin/Heidelberg, Germany 1997; ,23-27.
[33] Ghaffarizadeh, A.; Ahmadi, K.; Flann, N.S.; Sorting Unsigned Permutations by Reversals Using Multi-objective Evolutionary Algorithms with Variable Size Individuals; Proceedings of the 2011 IEEE Congress on Evolutionary Computation (CEC): ; ,292-295.
[34] Kajitani, I.; Hoshino, T.; Iwata, M.; Higuchi, T.; Variable Length Chromosome GA for Evolvable Hardware; Proceedings of the IEEE International Conference on Evolutionary Computation: ; ,443-447.
[35] Broman, K.W.; Speed, T.P.; A model selection approach for the identification of quantitative trait loci in experimental crosses; J. R. Stat. Soc. Ser. B: 2002; Volume 64 ,641-656. · Zbl 1067.62108
[36] Wang, S.; Basten, C.J.; Zeng, Z.B.; ; Windows QTL Cartographer 2.5: Raleigh, NC, USA 2011; .
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.