Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
10330121 | Future Generation Computer Systems | 2005 | 5 Pages |
Abstract
One of the fundamental components of large-scale gene discovery projects is that of clustering of expressed sequence tags (ESTs) from complementary DNA (cDNA) clone libraries. Clustering is used to create non-redundant catalogs and indices of these sequences. In particular, clustering of ESTs is frequently used to estimate the number of genes derived from cDNA-based gene discovery efforts. This paper presents a novel parallel extension to an EST clustering program, UIcluster4, that incorporates alternative splicing information and a new parallelization strategy. The results are compared to other parallelized EST clustering systems in terms of overall processing time and in accuracy of the resulting clustering.
Related Topics
Physical Sciences and Engineering
Computer Science
Computational Theory and Mathematics
Authors
Todd E. Scheetz, Nishank Trivedi, Kevin T. Pedretti, Terry A. Braun, Thomas L. Casavant,