کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
974356 1480115 2016 16 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Gene-based and semantic structure of the Gene Ontology as a complex network
ترجمه فارسی عنوان
ساختار مبتنی بر ژن و معنایی از آنتولوژی ژنی به عنوان یک شبکه پیچیده
کلمات کلیدی
سیستم های پیچیده؛ شبکه های؛ سیستم دو طرفه؛ تشخیص جامعه؛ آنتولوژی ؛ ژن ها
موضوعات مرتبط
مهندسی و علوم پایه ریاضیات فیزیک ریاضی
چکیده انگلیسی


• We study the projected network of terms starting from a bipartite terms/genes network.
• GO terms distinct from a semantic point of view might be linked in the above network.
• Such GO terms are in the same community when considering their gene content.
• This is important from a biomedical point of view, as it reveals relations amongst biological functions.

The last decade has seen the advent and consolidation of ontology based tools for the identification and biological interpretation of classes of genes, such as the Gene Ontology. The Gene Ontology (GO) is constantly evolving over time. The information accumulated time-by-time and included in the GO is encoded in the definition of terms and in the setting up of semantic relations amongst terms. Here we investigate the Gene Ontology from a complex network perspective. We consider the semantic network of terms naturally associated with the semantic relationships provided by the Gene Ontology consortium. Moreover, the GO is a natural example of bipartite network of terms and genes. Here we are interested in studying the properties of the projected network of terms, i.e. a gene-based weighted network of GO terms, in which a link between any two terms is set if at least one gene is annotated in both terms. One aim of the present paper is to compare the structural properties of the semantic and the gene-based network. The relative importance of terms is very similar in the two networks, but the community structure changes. We show that in some cases GO terms that appear to be distinct from a semantic point of view are instead connected, and appear in the same community when considering their gene content. The identification of such gene-based communities of terms might therefore be the basis of a simple protocol aiming at improving the semantic structure of GO. Information about terms that share large gene content might also be important from a biomedical point of view, as it might reveal how genes over-expressed in a certain term also affect other biological processes, molecular functions and cellular components not directly linked according to GO semantics.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Physica A: Statistical Mechanics and its Applications - Volume 458, 15 September 2016, Pages 313–328
نویسندگان
, , ,