کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
393047 665564 2013 20 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Frequency-based similarity for parameterized sequences: Formal framework, algorithms, and applications
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
Frequency-based similarity for parameterized sequences: Formal framework, algorithms, and applications
چکیده انگلیسی

Computing sequence similarity is a problem emerging in several areas of research. Current solution algorithms are often based on alignment methods under the assumption that matching symbols, or at least a substitution schema among them, are known in advance. This is natural for sequences defined over the same alphabet of symbols. However, for sequences defined over different alphabets and in absence of an appropriate background knowledge, sequence similarity can be conveniently reconsidered from a different perspective where determining the best substitution schema is also part of the computation problem. The basic idea is that any symbol of a sequence can be correlated with many symbols of another, provided each correlation frequently occurs over the various positions of the alignment. This novel setting is formalized and relevant application domains fitting its peculiarities are illustrated. Moreover, the computational complexity of the alignment problems arising therein is analyzed, and practical solution approaches are proposed and validated over synthetic and real datasets.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Sciences - Volume 237, 10 July 2013, Pages 176–195
نویسندگان
, ,