Estimating the relative rate of recombination to mutation in bacteria from single-locus variants using composite likelihood methods. (English) Zbl 1454.62331

Summary: A number of studies have suggested using comparisons between DNA sequences of closely related bacterial isolates to estimate the relative rate of recombination to mutation for that bacterial species. We consider such an approach which uses single-locus variants: pairs of isolates whose DNA differ at a single gene locus. One way of deriving point estimates for the relative rate of recombination to mutation from such data is to use composite likelihood methods. We extend recent work in this area so as to be able to construct confidence intervals for our estimates, without needing to resort to computationally-intensive bootstrap procedures, and to develop a test for whether the relative rate varies across loci. Both our test and method for constructing confidence intervals are obtained by modeling the dependence structure in the data, and then applying asymptotic theory regarding the distribution of estimators obtained using a composite likelihood. We applied these methods to multi-locus sequence typing (MLST) data from eight bacteria, finding strong evidence for considerable rate variation in three of these: Bacillus cereus, Enterococcus faecium and Klebsiella pneumoniae.


62P10 Applications of statistics to biology and medical sciences; meta analysis


Full Text: DOI arXiv Euclid


[1] Didelot, X. and Falush, D. (2007). Inference of bacterial microevolution using multilocus sequence data. Genetics 175 1251-1266.
[2] Didelot, X., Lawson, D. and Falush, D. (2009). SimMLST: Simulation of multi-locus sequence typing data under a neutral model. Bioinformatics 25 1442-1444.
[3] Didelot, X. and Maiden, M. C. J. (2010). Impact of recombination on bacterial evolution. Trends in Microbiology 18 315-322.
[4] Didelot, X., Lawson, D., Darling, A. and Falush, D. (2010). Inference of homologous recombination in bacteria using whole-genome sequences. Genetics 186 1435-1449.
[5] Donnelly, P. and Tavaré, S. (1995). Coalescents and genealogical structure under neutrality. Annu. Rev. Genet. 29 401-421.
[6] Feil, E. J., Maiden, M. C. J., Achtman, M. and Spratt, B. G. (1999). The relative contributions of recombination and mutation to the divergence of clones of Neisseria meningitidis. Molecular Biology and Evolution 16 1496-1502.
[7] Feil, E. J., Smith, J. M., Enright, M. C. and Spratt, B. G. (2000). Estimating recombinational parameters in Streptococcus pneumoniae from multilocus sequence typing data. Genetics 154 1439-1450.
[8] Fraser, C., Hanage, W. P. and Spratt, B. G. (2007). Recombination and the nature of bacterial speciation. Science 315 476-480.
[9] Griffiths, R. C. and Marjoram, P. (1997). An ancestral recombination graph. In Progress in Population Genetics and Human Evolution ( Minneapolis , MN , 1994). IMA Vol. Math. Appl. 87 257-270. Springer, New York. · Zbl 0893.92020 · doi:10.1007/978-1-4757-2609-1_16
[10] Guy, L., Nystedt, B., Sun, Y., Naslund, K., Berglund, E. and Andersson, S. G. (2012). A genome-wide study of recombination rate variation in Bartonella henselae. BMC Evolutionary Biology 12 65.
[11] Hudson, R. R. (2001). Two-locus sampling distributions and their application. Genetics 159 1805-1817.
[12] Jolley, K. A. and Maiden, M. C. J. (2010). Scalable analysis of bacterial genome variation at the population level. BMC Bioinformatics 11 595 .
[13] Kent, J. T. (1982). Robust properties of likelihood ratio tests. Biometrika 69 19-27. · Zbl 0485.62031
[14] Larribe, F. and Fearnhead, P. (2011). On composite likelihoods in statistical genetics. Statist. Sinica 21 43-69. · Zbl 1206.62171
[15] Low, K. B. and Porter, D. D. (1978). Modes of gene transfer and recombination in bacteria. Annu. Rev. Genet. 12 249-287.
[16] Maiden, M. C. J., Bygraves, J. A., Feil, E., Morelli, G., Russell, J. E., Urwin, R., Zhang, Q., Zhou, J., Zurth, K., Caugant, D. A., Feavers, I. M., Achtman, M. and Spratt, B. G. (1998). Multilocus sequence typing: A portable approach to the identification of clones within populations of pathogenic microorganisms. Proc. Natl. Acad. Sci. USA 95 3140-3145.
[17] McVean, G. A. T., Awadalla, P. and Fearnhead, P. (2002). A coalescent method for detecting recombination from gene sequences. Genetics 160 1231-1241.
[18] Milkman, R. and Bridges, M. M. (1990). Molecular evolution of the Escherichia coli chromosome. III. Clonal frames. Genetics 126 505-517.
[19] Molenberghs, G. and Verbeke, G. (2005). Models for Discrete Longitudinal Data . Springer, New York. · Zbl 1093.62002 · doi:10.1007/0-387-28980-1
[20] Pérez-Losada, M., Browne, E. B., Madsen, A., Wirth, T., Viscidi, R. P. and Crandall, K. A. (2006). Population genetics of microbial pathogens estimated from multilocus sequence typing (MLST) data. Infection , Genetics and Evolution 6 97-112.
[21] Rotnitzky, A. and Jewell, N. P. (1990). Hypothesis testing of regression parameters in semiparametric generalized linear models for cluster correlated data. Biometrika 77 485-497. · Zbl 0734.62075 · doi:10.1093/biomet/77.3.485
[22] Sheppard, S. K., McCarthy, N. D., Falush, D. and Maiden, M. C. J. (2008). Convergence of Campylobacter species: Implications for bacterial evolution. Science 320 237-239.
[23] Spratt, B. G., Hanage, W. P. and Feil, E. J. (2001). The relative contributions of recombination and point mutation to the diversification of bacterial clones. Current Opinion in Microbiology 4 602-606.
[24] Varin, C. (2008). On composite marginal likelihoods. AStA Adv. Stat. Anal. 92 1-28. · Zbl 1171.62315 · doi:10.1007/s10182-008-0060-7
[25] Varin, C., Reid, N. and Firth, D. (2011). An overview of composite likelihood methods. Statist. Sinica 21 5-42. · Zbl 1534.62022
[26] Vos, M. (2009). Why do bacteria engage in homologous recombination? Trends Microbiol. 17 226-232.
[27] Vos, M. and Didelot, X. (2009). A comparison of homologous recombination rates in bacteria and archaea. ISME J. 3 199-208.
[28] Wakeley, J. (2007). Coalescent Theory : An Introduction . Roberts and Company, Denver, CO. · Zbl 1366.92001
[29] Yu, S., Fearnhead, P., Holland, B. R., Biggs, P., Maiden, M. and French, N. (2012). Estimating the relative roles of recombination and point mutation in the generation of single locus variants in Campylobacter jejuni and Campylobacter coli. J. Mol. Evol. 74 273-280.
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.