کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
8408174 1545069 2018 43 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Ordering Protein Contact Matrices
ترجمه فارسی عنوان
سفارش ماتریس تماس با پروتئین
کلمات کلیدی
ماتریس تماس با پروتئین، پیش بینی بسته شدن نظریه گراف، برنامه نویسی دینامیک، نقشه خودمراقبتی،
موضوعات مرتبط
علوم زیستی و بیوفناوری بیوشیمی، ژنتیک و زیست شناسی مولکولی بیوتکنولوژی یا زیست‌فناوری
چکیده انگلیسی
Numerous biophysical approaches provide information about residues spatial proximity in proteins. However, correct assignment of the protein fold from this proximity information is not straightforward if the spatially close protein residues are not assigned to residues in the primary sequence. Here, we propose an algorithm to assign such residue numbers by ordering the columns and lines of the raw protein contact matrix directly obtained from proximity information between unassigned amino acids. The ordering problem is formatted as the search of a trail within a graph connecting protein residues through the nonzero contact values. The algorithm performs in two steps: (i) finding the longest trail of the graph using an original dynamic programming algorithm, (ii) clustering the individual ordered matrices using a self-organizing map (SOM) approach. The combination of the dynamic programming and self-organizing map approaches constitutes a quite innovative point of the present work. The algorithm was validated on a set of about 900 proteins, representative of the sizes and proportions of secondary structures observed in the Protein Data Bank. The algorithm was revealed to be efficient for noise levels up to 40%, obtaining average gaps of about 20% at maximum between ordered and initial matrices. The proposed approach paves the ways toward a method of fold prediction from noisy proximity information, as TM scores larger than 0.5 have been obtained for ten randomly chosen proteins, in the case of a noise level of 10%. The methods has been also validated on two experimental cases, on which it performed satisfactorily.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computational and Structural Biotechnology Journal - Volume 16, 2018, Pages 140-156
نویسندگان
, , , , , ,