دانلود رایگان مقاله: نقوش هندسی در پروتئین ها جستجو می شود: یک رویکرد موازی

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
523889	868518	2015	15 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Geometrical motifs search in proteins: A parallel approach

ترجمه فارسی عنوان

نقوش هندسی در پروتئین ها جستجو می شود: یک رویکرد موازی

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

OpenMP Hough transform - تحریک هات Secondary structure - مدل بیماری کبد مرحله نهایی Data parallelism - موازی داده ها Proteins - پروتئین ها

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر

پیش نمایش مقاله

نقوش هندسی در پروتئین ها جستجو می شود: یک رویکرد موازی

چکیده انگلیسی

• We present CMS, an algorithm used to search for geometric motifs in proteins.
• Complete cross-proteins analysis calls for parallel processing.
• Data parallel problem decomposition favors a shared-memory implementation.
• OpenMP implementation meets expectations, but scales only up to 8 threads.
• Hybrid OpenMP/MPI approach required for further analysis.

The analysis of the 3D structures of proteins is a very important problem in life sciences, since the geometric set-up of proteins has a deep relevance in many biological processes. The complexity of the analysis and the continuous increase in the number of proteins whose 3D structure is known, call for efficient and quick algorithms. Parallel processing is becoming an enabling tool for such research. A key component in the geometric description of a protein is the structural motif, a 3D element which appears in a variety of molecules and is usually made of just a few simpler structures, the secondary structures elements (SSEs).This paper is an extended version of Ferretti and Musci (2013), and presents the Cross Motif Search (CMS) and the Complete CMS (CCMS) algorithms, two highly optimized and efficient parallel methods to detect the presence and location of all common motifs of secondary structures in a given protein pair (CMS) or across an arbitrary large dataset of proteins (CCMS). The analysis builds on existing approaches, such as Secondary Structure Co-Occurrences (SSC), based on the General Hough Transform (GHT) technique. The main difference between our proposal and the state of the art is the innovative focus that CMS puts on the geometric description of the structural motifs, which could be simply viewed as vectors in a 3D space, rather than on the topological/biological description employed by competing algorithms, such as ProSMoS, PROMOTIF or MASS. The advantage of a geometrical approach is that it enables to retrieve the exact location of the common substructures in a protein pair.The paper analyzes all possible forms of serial and parallelism optimization of the proposed algorithms, both shared memory and message passing. It introduces a complete parallel implementation of CMS, based on OpenMP, and discusses its scalability on shared-memory architectures. Both small-scale and medium-scale testing shows that the methods produces very interesting results in real applications, and scales nicely up to the eight-processor limit. More in-depth testing also shows that the scalability limit is due to the inner structure of the problem, and that the similarities among proteins and the chosen tolerance for the analysis highly affect the overall performance.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Parallel Computing - Volume 42, February 2015, Pages 60–74

نویسندگان

Marco Ferretti, Mirto Musci,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : نقوش هندسی در پروتئین ها جستجو می شود: یک رویکرد موازی

دسترسی سریع

ارتباط

English Website