کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
391923 664567 2016 12 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Toward a multilevel representation of protein molecules: Comparative approaches to the aggregation/folding propensity problem
ترجمه فارسی عنوان
به نمایندگی چند سطحی از مولکول های پروتئین: رویکردهای مقایسه ای برای مسئله گرایش تجمع / انقباض
کلمات کلیدی
تجمع پروتئین، پروتئین تاشو، رابطه ساختار پیوندی، طبقه بندی داده های ساخت یافته
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی

Computational Intelligence methods are typically designed according to the assumption that the input space is essentially a vector space. When departing from vector-based pattern representations many theoretical and practical problems arise, which are mostly due to the absence of an intuitive geometric interpretation of the data. However, since such representations could offer additional insights when used in real-world applications of data-driven inference systems, their exploitation is also a practical and convenient choice. Here we apply several state-of-the-art classification methods for non-geometric data, with the aim to compare different representations of the proteins gathered from Niwa et al. (2009) [35]. Such representations include sequences of objects and labeled (contact) graphs enriched with chemico-physical attributes. The experiment performed by Niwa et al. provides the unique possibility to analyze the relative aggregation/folding propensity of the elements of the entire Escherichia coli (E. coli) proteome in a cell-free, standardized microenvironment. By this comparison, we are able to identify also some interesting general properties of proteins. Notably, (i) we suggest a threshold around 250 residues discriminating “easily foldable” from “hardly foldable” molecules consistent with other independent experiments, and (ii) we highlight the relevance of contact graph spectra for folding behavior discrimination and characterization of the E. coli solubility data. The soundness of the experimental results presented in this paper is proved by the statistically relevant relationships discovered among the chemico-physical description of proteins and the developed cost matrix of substitution that we used in the various discrimination systems.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Sciences - Volume 326, 1 January 2016, Pages 134–145
نویسندگان
, , ,