Loss of conservation of graph centralities in reverse-engineered transcriptional regulatory networks. (English) Zbl 1379.05104

Summary: Graph centralities are commonly used to identify and prioritize disease genes in transcriptional regulatory networks. Studies on small networks of experimentally validated protein-protein interactions underpin the general validity of this approach and extensions of such findings have recently been proposed for networks inferred from gene expression data. However, it is largely unknown how well gene centralities are preserved between the underlying biological interactions and the networks inferred from gene expression data. Specifically, while previous studies have evaluated the performance of inference methods on synthetic gene expression, it has not been established how the choice of inference method affects individual centralities in the network. Here, we compare two gene centrality measures between reference networks and networks inferred from corresponding simulated gene expression data, using a number of commonly used network inference methods. The results indicate that the centrality of genes is only moderately conserved for all of the inference methods used. In conclusion, caution should be exercised when inspecting centralities in reverse-engineered networks and further work will be required to establish the use of such networks for prioritizing disease genes.


05C82 Small world graphs, complex networks (graph-theoretic aspects)
05C90 Applications of graph theory
92C42 Systems biology, networks
Full Text: DOI


[1] Aittokallio, T; Schwikowski, B, Graph-based methods for analysing networks in cell biology, Brief Bioinforma, 7, 243-255, (2006)
[2] Albert, R, Scale-free networks in cell biology, J Cell Sci, 118, 4947-4957, (2005)
[3] Altay, G; Emmert-Streib, F, Inferring the conservative causal core of gene regulatory networks, BMC Syst Biol, 4, 132, (2010)
[4] Altay, G; Emmert-Streib, F, Revealing differences in gene network inference algorithms on the network level by ensemble methods, Bioinformatics, 26, 1738-1744, (2010)
[5] Basso, K; Margolin, AA; Stolovitzky, G; Klein, U; Dalla-Favera, R; Califano, A, Reverse engineering of regulatory networks in human b cells, Nat Genet, 37, 382-390, (2005)
[6] Brandes, U, A faster algorithm for betweenness centrality*, J Math Sociol, 25, 163-177, (2001) · Zbl 1051.91088
[7] Cerami, EG; Gross, BE; Demir, E; Rodchenkov, I; Babur, Ö; Anwar, N; Schultz, N; Bader, GD; Sander, C, Pathway commons, a web resource for biological pathway data, Nucleic Acids Res, 39, d685-d690, (2011)
[8] Chalancon, G; Ravarani, CN; Balaji, S; Martinez-Arias, A; Aravind, L; Jothi, R; Babu, MM, Interplay between gene expression noise and regulatory network architecture, Trends Genet, 28, 221-232, (2012)
[9] Cordero, D; Solé, X; Crous-Bou, M; Sanz-Pamplona, R; Paré, L; Guinó, E; Olivares, D; Berenguer, A; Santos, C; Salazar, R; etal., Large differences in global transcriptional regulatory programs of normal and tumor colon cells, BMC Cancer, 14, 708, (2014)
[10] Dancelli L, Manisera M, Vezzoli M (2013) On two classes of weighted rank correlation measures deriving from the spearman’s \(ρ\) Statistical Models for Data Analysis. Springer, pp 107-114 · Zbl 1306.62129
[11] De Matos Simoes R, Dehmer M, Emmert-Streib F (2013) B-cell lymphoma gene regulatory networks: biological consistency among inference methods. Frontiers in genetics, 4
[12] De Matos Simoes, R; Emmert-Streib, F, Bagging statistical network inference from large-scale gene expression data, PLoS One, 7, e33624, (2012) · Zbl 1417.92042
[13] Emmert-Streib F, De Matos Simoes R, Mullan P, Haibe-Kains B, Dehmer M (2014) The gene regulatory network for breast cancer: Integrated regulatory landscape of cancer hallmarks. Frontiers in genetics, 5
[14] Erdős P, Rényi A (1959) On random graphs, i. Publ Math Debr 6:290-297
[15] Estrada, E, Protein bipartivity and essentiality in the yeast protein- protein interaction network, J Proteome Res, 5, 2177-2184, (2006)
[16] Estrada, E, Virtual identification of essential proteins within the protein interaction network of yeast, Proteomics, 6, 35-40, (2006)
[17] Faith, JJ; Hayete, B; Thaden, JT; Mogno, I; Wierzbowski, J; Cottarel, G; Kasif, S; Collins, JJ; Gardner, TS, Large-scale mapping and validation of Escherichia coli transcriptional regulation from a compendium of expression profiles, PLoS Biol, 5, e8, (2007)
[18] Freeman LC (1977) A set of measures of centrality based on betweenness. Sociometry, pp 35-41
[19] Gleich DF, Saunders M (2009) Models and algorithms for pagerank sensitivity. Stanford University
[20] Goh, K-I; Cusick, ME; Valle, D; Childs, B; Vidal, M; Barabási, A-L, The human disease network, Proc Nat Acad Sci, 104, 8685-8690, (2007)
[21] Hahn, MW; Kern, AD, Comparative genomics of centrality and essentiality in three eukaryotic protein-interaction networks, Mol Biol Evol, 22, 803-806, (2005)
[22] Huynh-Thu, VA; Irrthum, A; Wehenkel, L; Geurts, P, Inferring regulatory networks from expression data using tree-based methods, PloS one, 5, e12776, (2010)
[23] Izudheen, S; Mathew, S, Cancer gene identification using graph centrality, Curr Sci, 105, 1143, (2013)
[24] Jeong, H; Mason, SP; Barabási, A-L; Oltvai, ZN, Lethality and centrality in protein networks, Nature, 411, 41-42, (2001)
[25] Jonsson, PF; Bates, PA, Global topological features of cancer proteins in the human interactome, Bioinformatics, 22, 2291-2297, (2006)
[26] Jörnsten, R; Abenius, T; Kling, T; Schmidt, L; Johansson, E; Nordling, TE; Nordlander, B; Sander, C; Gennemark, P; Funa, K; etal., Network modeling of the transcriptional effects of copy number aberrations in glioblastoma, Mol Syst Biol, 7, 486, (2011)
[27] Joy, MP; Brock, A; Ingber, DE; Huang, S, High-betweenness proteins in the yeast protein interaction network, BioMed Res Int, 2005, 96-103, (2005)
[28] Knaack, SA; Siahpirani, AF; Roy, S, A pan-cancer modular regulatory network analysis to identify common and cancer-specific network components, Cancer Inf, 13, 69, (2014)
[29] Knüpfer C, Dittrich P, Beckstein C (2004) Artificial gene regulation: A data source for validation of reverse bioengineering. In: Proceedings of the 6th German Workshop on Artificial Life (GWAL6), pp 66-75
[30] Koschützki, D; Schreiber, F, Centrality analysis methods for biological networks and their application to gene regulatory networks, Gene Regul Syst Biol, 2, 193, (2008)
[31] Langfelder, P; Horvath, S, Wgcna: an r package for weighted correlation network analysis, BMC bioinformatics, 9, 559, (2008)
[32] Langfelder, P; Mischel, PS; Horvath, S, When is hub gene selection better than standard meta-analysis, PloS one, 8, e61505, (2013)
[33] Liu, Z-P, Reverse engineering of genome-wide gene regulatory networks from gene expression data, Curr Genomics, 16, 3-22, (2015)
[34] Liu, Z-P; Wu, H; Zhu, J; Miao, H, Systematic identification of transcriptional and post-transcriptional regulations in human respiratory epithelial cells during influenza a virus infection, BMC Bioinforma, 15, 336, (2014)
[35] Ma, X; Gao, L, Biological network analysis: insights into structure and functions, Brief Funct Genomics, 11, 434-442, (2012)
[36] Marbach, D; Costello, JC; Küffner, R; Vega, NM; Prill, RJ; Camacho, DM; Allison, KR; Kellis, M; Collins, JJ; Stolovitzky, G; etal., Wisdom of crowds for robust gene network inference, Nat Methods, 9, 796-804, (2012)
[37] Marbach, D; Prill, RJ; Schaffter, T; Mattiussi, C; Floreano, D; Stolovitzky, G, Revealing strengths and weaknesses of methods for gene network inference, Proc Nat Acad Sci, 107, 6286-6291, (2010)
[38] Margolin, AA; Califano, A, Theory and limitations of genetic network inference from microarray data, Ann New York Acad Sci, 1115, 51-72, (2007)
[39] Margolin, AA; Nemenman, I; Basso, K; Wiggins, C; Stolovitzky, G; Favera, RD; Califano, A, Aracne: an algorithm for the reconstruction of gene regulatory networks in a Mammalian cellular context, BMC Bioinform, 7, s7, (2006)
[40] MATLAB (2015) version 7.10.0 (R2015a), The MathWorks Inc., Natick, Massachusetts
[41] Mendes, P; Sha, W; Ye, K, Artificial gene networks for objective comparison of analysis algorithms, Bioinformatics, 19, ii122-ii129, (2003)
[42] Meyer, PE; Kontos, K; Lafitte, F; Bontempi, G, Information-theoretic inference of large transcriptional regulatory networks, EURASIP J Bioinforms Syst Biol, 2007, 8-8, (2007)
[43] Meyer, PE; Lafitte, F; Bontempi, G, Minet: ar/bioconductor package for inferring large transcriptional networks using mutual information, BMC Bioinform, 9, 461, (2008)
[44] Ortutay, C; Vihinen, M, Identification of candidate disease genes by integrating gene ontologies and protein-interaction networks: case study of primary immunodeficiencies, Nucleic Acids Res, 37, 622-628, (2009)
[45] Özgür, A; Vu, T; Erkan, G; Radev, DR, Identifying gene-disease associations using centrality on a literature mined gene-interaction network, Bioinformatics, 24, i277-i285, (2008)
[46] Painter, MW; Davis, S; Hardy, R; Mathis, D; Benoist, C; Zhou, Y; Shinton, S; Hardy, R; Asinovski, N; Ergun, A; etal., Transcriptomes of the b and t lineages compared by multiplatform microarray profiling, J Immunol, 186, 3047-3057, (2011)
[47] Pozzi, F; Di Matteo, T; Aste, T, Exponential smoothing weighted correlations, Eur Phys J B, 85, 1-21, (2012)
[48] Schaffter, T; Marbach, D; Floreano, D, Genenetweaver: in silico benchmark generation and performance profiling of network inference methods, Bioinformatics, 27, 2263-2270, (2011)
[49] Siddani, BR; Pochineni, LP; Palanisamy, M, Candidate gene identification for systemic lupus erythematosus using network centrality measures and gene ontology, PloS one, 8, e81766, (2013)
[50] Tarsitano A (2009) Comparing the effectiveness of rank correlation statistics. P: Dip. di Economia e Statistica, University of della Calabria
[51] Tuna S, Niranjan M (2009) Cross-platform analysis with binarized gene expression data Pattern Recognition in Bioinformatics. Springer, pp 439-449
[52] Bulcke, T; Leemput, K; Naudts, B; Remortel, P; Ma, H; Verschoren, A; De Moor, B; Marchal, K, Syntren: a generator of synthetic gene expression data for design and analysis of structure learning algorithms, BMC Bioinform, 7, 43, (2006)
[53] Wachi, S; Yoneda, K; Wu, R, Interactome-transcriptome analysis reveals the high centrality of genes differentially expressed in lung cancer tissues, Bioinformatics, 21, 4205-4208, (2005)
[54] Weishaupt H, Johansson P, Engström C, Nelander S, Silvestrov S, Swartling FJ (2016) Prediction of high centrality nodes from reverseengineered transcriptional regulator networks. In: SMTDA 2016 Proceedings: / 4th Stochastic Modeling Techniques and Data Analysis International Conference / [ed] H. Skiadas (Ed), ISAST: International Society for the Advancement of Science and Technology, pp 517?-531
[55] Welch BL (1947) The generalization ofstudent’s’ problem when several different population variances are involved. Biometrika, pp 28-35 · Zbl 0029.40802
[56] Wieczorek, G; Steinhoff, C; Schulz, R; Scheller, M; Vingron, M; Ropers, H-H; Nuber, UA, Gene expression profile of mouse bone marrow stromal cells determined by cdna microarray analysis, Cell Tissue Res, 311, 227-237, (2003)
[57] Wu, S; Liu, Z-P; Qiu, X; Wu, H, Modeling genome-wide dynamic regulatory network in mouse lungs with influenza infection using high-dimensional ordinary differential equations, PloS one, 9, e95276, (2014)
[58] Xu, J; Li, Y, Discovering disease-genes by topological features in human protein-protein interaction network, Bioinformatics, 22, 2800-2805, (2006)
[59] Yip, KY; Alexander, RP; Yan, K-K; Gerstein, M, Improved reconstruction of in silico gene regulatory networks by integrating knockout and perturbation data, PloS one, 5, e8121-e8121, (2010)
[60] Zhang M, Deng J, Fang CV, Zhang X, Lu L (2010) Molecular network analysis and applications. Knowledge-Based Bioinformatics, pp 253
[61] Zhang, X; Zhao, X-M; He, K; Lu, L; Cao, Y; Liu, J; Hao, J-K; Liu, Z-P; Chen, L, Inferring gene regulatory networks from gene expression data by path consistency algorithm based on conditional mutual information, Bioinformatics, 28, 98-104, (2012)
[62] Zhu, X; Gerstein, M; Snyder, M, Getting connected: analysis and principles of biological networks, Genes Dev, 21, 1010-1024, (2007)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.