کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4495926 1623819 2016 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Natural vs. random protein sequences: Discovering combinatorics properties on amino acid words
ترجمه فارسی عنوان
دنباله های طبیعی و پروتئینی تصادفی: کشف خواص ترکیبی در کلمات اسید آمینه
کلمات کلیدی
دنباله پروتئین، توالی تصادفی، ترکیبی از کلمات، انجمن آمینو اسید
موضوعات مرتبط
علوم زیستی و بیوفناوری علوم کشاورزی و بیولوژیک علوم کشاورزی و بیولوژیک (عمومی)
چکیده انگلیسی


• We developed various measures of associativity between pairs of amino acids in sequences.
• We study amino acid associativity distributions for natural and random sequences.
• We adopt a machine learning approach based on the association values between couples of amino acids to separate natural sequences from random ones.

Casual mutations and natural selection have driven the evolution of protein amino acid sequences that we observe at present in nature. The question about which is the dominant force of proteins evolution is still lacking of an unambiguous answer. Casual mutations tend to randomize protein sequences while, in order to have the correct functionality, one expects that selection mechanisms impose rigid constraints on amino acid sequences. Moreover, one also has to consider that the space of all possible amino acid sequences is so astonishingly large that it could be reasonable to have a well tuned amino acid sequence indistinguishable from a random one.In order to study the possibility to discriminate between random and natural amino acid sequences, we introduce different measures of association between pairs of amino acids in a sequence, and apply them to a dataset of 1047 natural protein sequences and 10,470 random sequences, carefully generated in order to preserve the relative length and amino acid distribution of the natural proteins. We analyze the multidimensional measures with machine learning techniques and show that, to a reasonable extent, natural protein sequences can be differentiated from random ones.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Theoretical Biology - Volume 391, 21 February 2016, Pages 13–20
نویسندگان
, , ,