کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
523886 868518 2015 17 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
High Performance computing improvements on bioinformatics consistency-based multiple sequence alignment tools
ترجمه فارسی عنوان
بهبود عملکرد محاسباتی با کارایی بالا در ابزارهای هماهنگی توالی چندگانه مبتنی بر سازگاری بیوانفورماتیک
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
چکیده انگلیسی


• BGT method improves the execution time of progressive alignment by 62%.
• OLM decreases the memory requirements by 75%, aligning up twice more sequences than previous method.
• MTA improves the alignments accuracy, letting it to regain the quality lost due to the performance improvements.

Multiple Sequence Alignment (MSA) is essential for a wide range of applications in Bioinformatics. Traditionally, the alignment accuracy was the main metric used to evaluate the goodness of MSA tools. However, with the growth of sequencing data, other features, such as performance and the capacity to align larger datasets, are gaining strength. To achieve these new requirements, without affecting accuracy, the use of high-performance computing (HPC) resources and techniques is crucial. In this paper, we apply HPC techniques in T-Coffee, one of the more accurate but less scalable MSA tools. We integrate three innovative solutions into T-Coffee: the Balanced Guide Tree to increase the parallelism/performance, the Optimized Library Method with the aim of enhancing the scalability and the Multiple Tree Alignment, which explores different alignments in parallel to improve the accuracy. The results obtained show that the resulting tool, MTA-TCoffee, is able to improve the scalability in both the execution time and also the number of sequences to be aligned. Furthermore, not only is the alignment accuracy not affected by these improvements, as would be expected, but it improves significantly. Finally, we emphasize that the presented methods are not just restricted to T-Coffee, but may be implemented in any other alignment tools that use similar algorithms (progressive alignment, consistency or guide trees).

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Parallel Computing - Volume 42, February 2015, Pages 18–34
نویسندگان
, , , , ,