کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
385496 660867 2007 16 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
An HMM for detecting spam mail
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
An HMM for detecting spam mail
چکیده انگلیسی

Hidden Markov Models, or HMMs for short, have been recently used in Bioinformatics for the classification of DNA or protein chains, giving rise to what is known as Profile Hidden Markov Models. In this paper, we show that these models can also be adapted to the problem of classifying misspelled words by identifying its primary structure through statistical tools. This process leads to a new learning algorithm which is based in the parametrization of the set of recognizable words in order to detect any misspelled form of these words. As an application, a method to classify spam mails by means of the detection of the adulterated words, from a blacklist of words frequently used by spammers, is described.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Expert Systems with Applications - Volume 33, Issue 3, October 2007, Pages 667–682
نویسندگان
, ,