کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
403165 677060 2008 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A comparative study for content-based dynamic spam classification using four machine learning algorithms
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
A comparative study for content-based dynamic spam classification using four machine learning algorithms
چکیده انگلیسی

The growth of email users has resulted in the dramatic increasing of the spam emails during the past few years. In this paper, four machine learning algorithms, which are Naïve Bayesian (NB), neural network (NN), support vector machine (SVM) and relevance vector machine (RVM), are proposed for spam classification. An empirical evaluation for them on the benchmark spam filtering corpora is presented. The experiments are performed based on different training set size and extracted feature size. Experimental results show that NN classifier is unsuitable for using alone as a spam rejection tool. Generally, the performances of SVM and RVM classifiers are obviously superior to NB classifier. Compared with SVM, RVM is shown to provide the similar classification result with less relevance vectors and much faster testing time. Despite the slower learning procedure, RVM is more suitable than SVM for spam classification in terms of the applications that require low complexity.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Knowledge-Based Systems - Volume 21, Issue 4, May 2008, Pages 355–362
نویسندگان
, ,