کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
2815090 1159849 2016 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Exploring information from the topology beneath the Gene Ontology terms to improve semantic similarity measures
ترجمه فارسی عنوان
بررسی اطلاعات از توپولوژی زیر اصطلاحات هسته شناسی ژنی برای بهبود معیارهای شباهت معنایی
موضوعات مرتبط
علوم زیستی و بیوفناوری بیوشیمی، ژنتیک و زیست شناسی مولکولی ژنتیک
چکیده انگلیسی


• A framework to explore semantic similarity measurement beneath two GO terms is proposed.
• The similarity measurement takes into account both shared information and the probability that two terms co-occur with their common descendant.
• A mirror model is proposed to quantify the information two terms share in regard with their common descendant.
• The probability of co-occurrence is computed in an effective way.
• The descending similarity contributes to better semantic similarity measure.

Measuring the similarity between pairs of biological entities is important in molecular biology. The introduction of Gene Ontology (GO) provides us with a promising approach to quantifying the semantic similarity between two genes or gene products. This kind of similarity measure is closely associated with the GO terms annotated to biological entities under consideration and the structure of the GO graph. However, previous works in this field mainly focused on the upper part of the graph, and seldom concerned about the lower part. In this study, we aim to explore information from the lower part of the GO graph for better semantic similarity. We proposed a framework to quantify the similarity measure beneath a term pair, which takes into account both the information two ancestral terms share and the probability that they co-occur with their common descendants. The effectiveness of our approach was evaluated against seven typical measurements on public platform CESSM, protein–protein interaction and gene expression datasets. Experimental results consistently show that the similarity derived from the lower part contributes to better semantic similarity measure. The promising features of our approach are the following: (1) it provides a mirror model to characterize the information two ancestral terms share with respect to their common descendant; (2) it quantifies the probability that two terms co-occur with their common descendant in an efficient way; and (3) our framework can effectively capture the similarity measure beneath two terms, which can serve as an add-on to improve traditional semantic similarity measure between two GO terms. The algorithm was implemented in Matlab and is freely available from http://ejl.org.cn/bio/GOBeneath/.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Gene - Volume 586, Issue 1, 15 July 2016, Pages 148–157
نویسندگان
, ,