Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
6487097 | Computational Biology and Chemistry | 2015 | 28 Pages |
Abstract
Random walk on heterogeneous networks is a recently emerging approach to effective disease gene prioritization. Laplacian normalization is a technique capable of normalizing the weight of edges in a network. We use this technique to normalize the gene matrix and the phenotype matrix before the construction of the heterogeneous network, and also use this idea to define the transition matrices of the heterogeneous network. Our method has remarkably better performance than the existing methods for recovering known gene-phenotype relationships. The Shannon information entropy of the distribution of the transition probabilities in our networks is found to be smaller than the networks constructed by the existing methods, implying that a higher number of top-ranked genes can be verified as disease genes. In fact, the most probable gene-phenotype relationships ranked within top 3 or top 5 in our gene lists can be confirmed by the OMIM database for many cases. Our algorithms have shown remarkably superior performance over the state-of-the-art algorithms for recovering gene-phenotype relationships. All Matlab codes can be available upon email request.
Keywords
Related Topics
Physical Sciences and Engineering
Chemical Engineering
Bioengineering
Authors
Zhi-Qin Zhao, Guo-Sheng Han, Zu-Guo Yu, Jinyan Li,