کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
431923 688658 2012 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A high performance multiple sequence alignment system for pyrosequencing reads from multiple reference genomes
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
پیش نمایش صفحه اول مقاله
A high performance multiple sequence alignment system for pyrosequencing reads from multiple reference genomes
چکیده انگلیسی

Genome resequencing with short reads generated from pyrosequencing generally relies on mapping the short reads against a single reference genome. However, mapping of reads from multiple reference genomes is not possible using a pairwise mapping algorithm. In order to align the reads w.r.t each other and the reference genomes, existing multiple sequence alignment(MSA) methods cannot be used because they do not take into account the position of these short reads with respect to the genome, and are highly inefficient for a large number of sequences. In this paper, we develop a highly scalable parallel algorithm based on domain decomposition, referred to as P-Pyro-Align, to align such a large number of reads from single or multiple reference genomes. The proposed alignment algorithm accurately aligns the erroneous reads, and has been implemented on a cluster of workstations using MPI library. Experimental results for different problem sizes are analyzed in terms of execution time, quality of the alignments, and the ability of the algorithm to handle reads from multiple haplotypes. We report high quality multiple alignment of up to 0.5 million reads. The algorithm is shown to be highly scalable and exhibits super-linear speedups with increasing number of processors.


► A domain decomposition strategy for multiple alignments of pyrosequencing short reads.
► Domain decomposition & characteristics from pyroreads exploited to enhance parallelism.
► Experiments involve a large number of reads (up to 0.5 million) from multiple genomes.
► Algorithm (using MPI) exhibits superlinear speedups for a large number of reads.
► Rigorous quality assessment comparing multiple alignments and mapping.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Parallel and Distributed Computing - Volume 72, Issue 1, January 2012, Pages 83–93
نویسندگان
, , , , ,