Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
4960851 | Procedia Computer Science | 2017 | 6 Pages |
Abstract
The increasing volume of emails has led to the emergence of problems caused by unsolicited email, commonly referred to as Spam. One of the most commonly presentation used in Spam Filter is the BoW (Bag-of-words). However, this approach has a number of weaknesses, mainly the fact that the word order is lost; hence different emails can have the same representation since the same words are used, and it ignores the relationship between words, which can lead to poor performance. This paper proposes a new Spam filter based on PV-DM (Paragraph Vector-Distributed Memory) in order to overcome the limitations of the BoW representation.
Keywords
Related Topics
Physical Sciences and Engineering
Computer Science
Computer Science (General)
Authors
Samira Douzi, Meryem Amar, Bouabid El Ouahidi, Hicham Laanaya,