A posterior probability approach for gene regulatory network inference in genetic perturbation data.

*(English)*Zbl 1388.62338Summary: Inferring gene regulatory networks is an important problem in systems biology. However, these networks can be hard to infer from experimental data because of the inherent variability in biological data as well as the large number of genes involved. We propose a fast, simple method for inferring regulatory relationships between genes from knockdown experiments in the NIH LINCS dataset by calculating posterior probabilities, incorporating prior information. We show that the method is able to find previously identified edges from TRANSFAC and JASPAR and discuss the merits and limitations of this approach.

##### MSC:

62P10 | Applications of statistics to biology and medical sciences; meta analysis |

62F15 | Bayesian inference |

92D10 | Genetics and epigenetics |

92C42 | Systems biology, networks |

PDF
BibTeX
XML
Cite

\textit{W. C. Young} et al., Math. Biosci. Eng. 13, No. 6, 1241--1251 (2016; Zbl 1388.62338)

Full Text:
DOI

##### References:

[1] | M. Bansal, Inference of gene regulatory networks and compound mode of action from time course gene expression profiles,, Bioinformatics, 22, 815, (2006) |

[2] | K. Basso, Reverse engineering of regulatory networks in human B cells,, Nature Genetics, 37, 382, (2005) |

[3] | P. Bühlmann, High-dimensional statistics with a view towards applications in biology,, Annual Review of Statistics and Its Application, 1, 255, (2014) |

[4] | E. Y. Chen, Enrichr: Interactive and collaborative HTML5 gene list enrichment analysis tool,, BMC Bioinformatics, 14, 128, (2013) |

[5] | S. Christley, Incorporating existing network information into gene network inference,, PLoS One, 4, (2009) |

[6] | M. Clyde, Model uncertainty,, Statistical Science, 19, 81, (2004) · Zbl 1062.62044 |

[7] | A. P. Dempster, Maximum likelihood from incomplete data via the EM algorithm,, Journal of the Royal Statistical Society. Series B (Methodological), 39, 1, (1977) · Zbl 0364.62022 |

[8] | P. D’haeseleer, Linear modeling of mRNA expression levels during CNS development and injury,, Pacific Symposium on Biocomputing, 4, 41, (1999) |

[9] | C. Ding, Minimum redundancy feature selection from microarray gene expression data,, Bioinformatics Conference, 523, (2003) |

[10] | DREAM4 In Silico Network Challenge, website,, <a href= |

[11] | Q. Duan, LINCS Canvas Browser: Interactive web app to query, browse and interrogate LINCS L1000 gene expression signatures,, Nucleic Acids Research, 42, (2014) |

[12] | S. A. Dunbar, Applications of Luminex® \(xMAP^{TM}\) technology for rapid, high-throughput multiplexed nucleic acid detection,, Clinica Chimica Acta, 363, 71, (2006) |

[13] | J. J. Faith, Large-scale mapping and validation of Escherichia coli transcriptional regulation from a compendium of expression profiles,, PLoS Biol, 5, (2007) |

[14] | N. Friedman, Using bayesian networks to analyze expression data,, RECOMB ’00 Proceedings of the fourth annual international conference on Computational molecular biology, 127, (2000) |

[15] | H. Fröhlich, Large scale statistical inference of signaling pathways from RNAi and microarray data,, BMC Bioinformatics, 8, (2007) |

[16] | N. Guelzim, Topological and causal structure of the yeast transcriptional regulatory network,, Nature Genetics, 31, 60, (2002) |

[17] | M. Gustafsson, Reverse engineering of gene networks with LASSO and nonlinear basis functions,, Annals of the New York Academy of Sciences, 1158, 265, (2009) |

[18] | M. Hecker, Gene regulatory network inference: Data integration in dynamic models, A review,, Biosystems, 96, 86, (2009) |

[19] | J. A. Hoeting, Bayesian model averaging: A tutorial,, Statistical Science, 14, 382, (1999) · Zbl 1059.62525 |

[20] | R. E. Kass, Bayes factors,, Journal of the American Statistical Association, 90, 773, (1995) · Zbl 0846.62028 |

[21] | S. Y. Kim, Inferring gene networks from time series microarray data using dynamic Bayesian networks,, Briefings in Bioinformatics, 4, 228, (2003) |

[22] | S. Y. Kim, Dynamic Bayesian network and nonparametric regression for nonlinear modeling of gene networks from time series gene expression data,, Computational Methods in Systems Biology, 2602, 104, (2003) · Zbl 1112.92322 |

[23] | S. Klamt, TRANSWESD: inferring cellular networks with transitive reduction,, Bioinformatics, 26, 2160, (2010) |

[24] | S. Lèbre, Statistical inference of the time-varying structure of gene-regulation networks,, BMC Systems Biology, 4, (2010) |

[25] | W. Lee, Computational methods for discovering gene networks from expression data,, Briefings in Bioinformatics, 10, 408, (2009) |

[26] | J. Li, Finding consistent patterns: A nonparametric approach for identifying differential expression in RNA-Seq data,, Statistical Methods in Medical Research, 22, 519, (2013) |

[27] | Z. Li, Large-scale dynamic gene regulatory network inference combining differential equation models with local dynamic Bayesian network analysis,, Bioinformatics, 27, 2686, (2011) |

[28] | Library of Integrated Network-based Cellular Signatures (LINCS), website,, <a href= |

[29] | K. Lo, Integrating external biological knowledge in the construction of regulatory networks from time-series expression data,, BMC Systems Biology, 6, (2012) |

[30] | F. M. Lopes, Inference of gene regulatory networks from time series by Tsallis entropy,, BMC Systems Biology, 5, (2011) |

[31] | M. J. McGeachie, CGBayesNets: Conditional Gaussian Bayesian network learning and inference with mixed discrete and continuous data,, PLoS Computational Biology, 10, (2014) |

[32] | D. Marbach, Generating realistic in silico gene networks for performance assessment of reverse engineering methods,, Journal of Computational Biology, 16, 229, (2009) |

[33] | D. Marbach, Revealing strengths and weaknesses of methods for gene network inference,, Proceedings of the National Academy of Sciences, 107, 6286, (2010) |

[34] | A. A. Margolin, ARACNE: an algorithm for the reconstruction of gene regulatory networks in a mammalian cellular context,, BMC Bioinformatics, 7, (2006) |

[35] | F. Markowetz, Inferring cellular networks: A review,, BMC Bioinformatics, 8, (2007) |

[36] | P. Menéndez, Gene regulatory networks from multifactorial perturbations using Graphical Lasso: Application to the DREAM4 challenge,, PLoS One, 5, (2010) |

[37] | P. E. Meyer, Information-theoretic inference of large transcriptional regulatory networks,, EURASIP Journal on Bioinformatics and Systems Biology, 2007, (2007) |

[38] | P. E. Meyer, minet: A R/Bioconductor package for inferring large transcriptional networks using mutual information,, BMC Bioinformatics, 9, (2008) |

[39] | G. Michailidis, Autoregressive models for gene regulatory network inference: Sparsity, stability and causality issues,, Mathematical Biosciences, 246, 326, (2013) · Zbl 1308.92032 |

[40] | K. Murphy, <em>Modelling Gene Expression Data Using Dynamic Bayesian Networks</em>,, Vol. 104. Technical report, (1999) |

[41] | A. Pinna, From knockouts to networks: Establishing direct cause-effect relationships through graph analysis,, PLoS One, 5, (2010) |

[42] | A. E. Raftery, Bayesian model averaging for linear regression models,, Journal of the American Statistical Association, 92, 179, (1997) · Zbl 0888.62026 |

[43] | A. E. Raftery, Bayes factors and BIC,, Sociological Methods & Research, 27, 411, (1999) |

[44] | S. Rogers, A Bayesian regression approach to the inference of regulatory networks from gene expression data,, Bioinformatics, 21, 3131, (2005) |

[45] | F. H. M. Salleh, Reconstructing gene regulatory networks from knockout data using Gaussian Noise Model and Pearson Correlation Coefficient,, Computational Biology and Chemistry, 59, 3, (2015) |

[46] | M. Sanchez-Castillo, Bayesian sparse factor model for transcriptional regulatory networks inference,, Signal Processing Conference (EUSIPCO), 1, (2013) |

[47] | A. Sandelin, JASPAR: an open-access database for eukaryotic transcription factor binding profiles,, Nucleic Acids Research, 32, (2004) |

[48] | M. Scutari, Learning Bayesian Networks with the bnlearn R Package,, Journal of Statistical Software, 35, 1, (2010) |

[49] | A. Shojaie, Analysis of gene sets based on the underlying regulatory network,, Journal of Computational Biology, 16, 407, (2009) |

[50] | A. Shojaie, Discovering graphical Granger causality using the truncating lasso penalty,, Bioinformatics, 26, (2010) |

[51] | A. Shojaie, Inferring regulatory networks by combining perturbation screens and steady state gene expression profiles,, PLoS One, 9, (2014) |

[52] | R. Tibshirani, Regression shrinkage and selection via the lasso,, Journal of the Royal Statistical Society. Series B (Methodological), 58, 267, (1996) · Zbl 0850.62538 |

[53] | V. G. Tusher, Significance analysis of microarrays applied to the ionizing radiation response,, Proceedings of the National Academy of Sciences, 98, 5116, (2001) · Zbl 1012.92014 |

[54] | N. Verzelen, Minimax risks for sparse regressions: Ultra-high dimensional phenomenons,, Electronic Journal of Statistics, 6, 38, (2012) · Zbl 1334.62120 |

[55] | M. J. Wainwright, Sharp thresholds for high-dimensional and noisy sparsity recovery using-constrained quadratic programming (Lasso),, IEEE Transactions on Information Theory, 55, 2183, (2009) · Zbl 1367.62220 |

[56] | E. Wingender, TRANSFAC: an integrated system for gene expression regulation,, Nucleic Acids Research, 28, 316, (2000) |

[57] | K. Y. Yeung, Construction of regulatory networks using expression time-series data of a genotyped population,, Proceedings of the National Academy of Sciences, 108, 19436, (2011) |

[58] | C. Yoo, Discovery of causal relationships in a gene regulation pathway from a mixture of experimental and observational DNA microarray data,, Pacific Symposium on Biocomputing, 7, 498, (2002) |

[59] | W. C. Young, Fast Bayesian inference for gene regulatory networks using ScanBMA,, BMC Systems Biology, 8, 47, (2014) |

[60] | A. Zellner, On assessing prior distributions and Bayesian regression analysis with g-prior distributions,, Bayesian Inference and Decision Techniques: Essays in Honor of Bruno De Finetti, 6, 233, (1986) · Zbl 0655.62071 |

[61] | P. Zoppoli, TimeDelay-ARACNE: Reverse engineering of gene networks from time-course data by an information theoretic approach,, BMC Bioinformatics, 11, (2010) |

[62] | M. Zou, A new dynamic Bayesian network (DBN) approach for identifying gene regulatory networks from time course microarray data,, Bioinformatics, 21, 71, (2005) |

This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.