کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
469351 698310 2010 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Detection of over-represented motifs corresponding to known TFBSs via motif clustering and matching
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)
پیش نمایش صفحه اول مقاله
Detection of over-represented motifs corresponding to known TFBSs via motif clustering and matching
چکیده انگلیسی

Detection of over-represented motifs corresponding to known TFBSs (Transcription Factor Binding Sites) is an important problem in biological sequences analysis. In this paper, a novel motif discovery method based on motif clustering and matching is proposed. Against a precompiled library of motifs described as position weight matrices (PWMs), eachL  -mer in the data set is matched to a motif base on the match score’s pp-value, and then the PWMs are updated and clustered according to their similarity. Motif features are ranked in terms of statistical significance (pp-value). We present an implementation of this approach, named MotifCM, which is capable of discovering multiple distinct motifs present in a single data set. We apply our method to the benchmark which has 56 data sets, and demonstrate that the performance of MotifCM on this data set compares well to, and in many cases exceeds, the performance of existing tools.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computers & Mathematics with Applications - Volume 59, Issue 2, January 2010, Pages 779–786
نویسندگان
, ,