کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
2815924 1159901 2015 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Semantic similarity measurement between gene ontology terms based on exclusively inherited shared information
ترجمه فارسی عنوان
اندازه گیری شباهت معنایی بین اصطلاحات آنتولوژی ژنی براساس اطلاعات به اشتراک گذاشته شده به طور منحصر به فرد به ارث رسیده است
موضوعات مرتبط
علوم زیستی و بیوفناوری بیوشیمی، ژنتیک و زیست شناسی مولکولی ژنتیک
چکیده انگلیسی


• A semantic similarity measurement between two GO terms is proposed.
• The similarity value is quantified by the information shared by two terms.
• The measurement takes into account multiple common ancestors that are exoteric inherited by either term exclusively.
• The algorithm is effective with time complexity of O(n).
• The results on real dataset support the prior knowledge of biological pathway.

Quantifying the semantic similarities between pairs of terms in the Gene Ontology (GO) structure can help to explore the functional relationships between biological entities. A common approach to this problem is to measure the information they have in common based on the information content of their common ancestors. However, many studies have their limitations in measuring the information two GO terms share. This study presented a new measurement, exclusively inherited shared information (EISI) that captured the information shared by two terms based on an intuitive observation on the multiple inheritance relationships among the terms in the GO graph. EISI was derived from the information content of the exclusively inherited common ancestors (EICAs), which were screened from the common ancestors according to the attribute of their direct children. The effectiveness of EISI was evaluated against some state-of-the-art measurements on both artificial and real datasets, it produced more relevant results with experts' scores on the artificial dataset, and supported the prior knowledge of gene function in pathways on the Saccharomyces genome database (SGD). The promising features of EISI are the following: (1) it provides a more effective way to characterize the semantic relationship between two GO terms by taking into account multiple common ancestors related, and (2) can quickly detect all EICAs with time complexity of O(n), which is much more efficient than other methods based on disjunctive common ancestors. It is a promising alternative to multiple inheritance based methods for practical applications on large-scale dataset. The algorithm EISI was implemented in Matlab and is freely available from http://treaton.evai.pl/EISI/.

Figure optionsDownload high-quality image (280 K)Download as PowerPoint slide

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Gene - Volume 558, Issue 1, 1 March 2015, Pages 108–117
نویسندگان
, ,