Computational Protein Design and Large-Scale Assessment by I-TASSER Structure Assembly Simulations

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
2185220	1095966	2011	13 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

sequence clustering - خوشه توالی Protein structure prediction - پیش بینی ساختار پروتئین

موضوعات مرتبط

علوم زیستی و بیوفناوری بیوشیمی، ژنتیک و زیست شناسی مولکولی بیولوژی سلول

پیش نمایش صفحه اول مقاله

Computational Protein Design and Large-Scale Assessment by I-TASSER Structure Assembly Simulations

چکیده انگلیسی

Protein design aims at designing new protein molecules of desired structure and functionality. One of the major obstacles to large-scale protein design are the extensive time and manpower requirements for experimental validation of designed sequences. Recent advances in protein structure prediction have provided potentials for an automated assessment of the designed sequences via folding simulations. We present a new protocol for protein design and validation. The sequence space is initially searched by Monte Carlo sampling guided by a public atomic potential, with candidate sequences selected by the clustering of sequence decoys. The designed sequences are then assessed by I-TASSER folding simulations, which generate full-length atomic structural models by the iterative assembly of threading fragments. The protocol is tested on 52 nonhomologous single-domain proteins, with an average sequence identity of 24% between the designed sequences and the native sequences. Despite this low sequence identity, three-dimensional models predicted for the first designed sequence have an RMSD of < 2 Å to the target structure in 62% of cases. This percentage increases to 77% if we consider the three-dimensional models from the top 10 designed sequences. Such a striking consistency between the target structure and the structural prediction from nonhomologous sequences, despite the fact that the design and folding algorithms adopt completely different force fields, indicates that the design algorithm captures the features essential to the global fold of the target. On average, the designed sequences have a free energy that is 0.39 kcal/(mol residue) lower than in the native sequences, potentially affording a greater stability to synthesized target folds.

Graphical AbstractFigure optionsDownload high-quality image (96 K)Download as PowerPoint slideResearch Highlights
► We developed a new protein design algorithm based on Monte Carlo free-energy minimization.
► We propose a new validation protocol of protein design by structure prediction.
► I-TASSER predicts the target fold for most designed sequences.
► Sequence clustering is more robust than energy in design selection.
► The sequence design promises to increase the stability of target folds.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Molecular Biology - Volume 407, Issue 5, 15 April 2011, Pages 764–776

نویسندگان

Andrea Bazzoli, Andrea G.B. Tettamanzi, Yang Zhang,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Computational Protein Design and Large-Scale Assessment by I-TASSER Structure Assembly Simulations

دسترسی سریع

ارتباط

English Website