کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
10351476 864471 2013 9 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Frequent patterns mining in multiple biological sequences
ترجمه فارسی عنوان
استخراج الگوهای مکرر در توالی های بیولوژیکی چندگانه
کلمات کلیدی
دنباله زیستی، الگوی اولیه، معدن الگوی مکرر، درخت پیشوند،
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
چکیده انگلیسی
Existing algorithms for mining frequent patterns in multiple biosequences may generate multiple projected databases and short candidate patterns, which can increase computation time and memory requirement. In order to overcome such shortcomings, we propose a fast and efficient algorithm for mining frequent patterns in multiple biological sequences (MSPM). We first present the concept of a primary pattern, which can be extended to form larger patterns in the sequence. To detect frequent primary patterns, a prefix tree is constructed. Based on this prefix tree, a pattern-extending approach is also presented to mine frequent patterns without producing a large number of irrelevant candidate patterns. The experimental results show that the MSPM algorithm can achieve not only faster speed, but also higher quality results as compared with other methods.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computers in Biology and Medicine - Volume 43, Issue 10, 1 October 2013, Pages 1444-1452
نویسندگان
, ,