کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
1953130 1057252 2008 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Exploring an alignment free approach for protein classification and structural class prediction
موضوعات مرتبط
علوم زیستی و بیوفناوری بیوشیمی، ژنتیک و زیست شناسی مولکولی زیست شیمی
پیش نمایش صفحه اول مقاله
Exploring an alignment free approach for protein classification and structural class prediction
چکیده انگلیسی

Alignment free methods based on Chaos Game Representation (CGR), also known as sequence signature approaches, have proven of great interest for DNA sequence analysis. Indeed, they have been successfully applied for sequence comparison, phylogeny, detection of horizontal transfers or extraction of representative motifs in regulation sequences. Transposing such methods to proteins poses several fundamental questions related to representation space dimensionality. Several studies have tackled these points, but none has, so far, brought the application of CGRs to proteins to their fully expected potential. Yet, several studies have shown that techniques based on n-peptide frequencies can be relevant for proteins. Here, we investigate the effectiveness of a strategy based on the CGR approach using a fixed reverse encoding of amino acids into nucleic sequences. We first explore its relevance to protein classification into functional families. We then attempt to apply it to the prediction of protein structural classes. Our results suggest that the reverse encoding approach could be relevant in both cases. We show that it is able to classify functional families of proteins by extracting signatures close to the ProSite patterns. Applied to structural classification, the approach reaches scores of correct classification close to 84%, i.e. close to the scores of related methods in the field. Various optimizations of the approach are still possible, which open the door for future applications.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Biochimie - Volume 90, Issue 4, April 2008, Pages 615–625
نویسندگان
, ,