Article ID Journal Published Year Pages File Type
535998 Pattern Recognition Letters 2011 11 Pages PDF
Abstract

In their arms race against developers of spam filters, spammers have recently introduced the image spam trick to make the analysis of emails’ body text ineffective. It consists in embedding the spam message into an attached image, which is often randomly modified to evade signature-based detection, and obfuscated to prevent text recognition by OCR tools. Detecting image spam turns out to be an interesting instance of the problem of content-based filtering of multimedia data in adversarial environments, which is gaining increasing relevance in several applications and media. In this paper we give a comprehensive survey and categorisation of computer vision and pattern recognition techniques proposed so far against image spam, and make an experimental analysis and comparison of some of them on real, publicly available data sets.

► Survey and categorization of state of the art image spam filtering techniques. ► Experimental comparison of the main image spam filtering techniques. ► Fusion of image spam filtering techniques based on OCR and image classification. ► Discussion on the vulnerability of image spam filtering techniques.

Keywords
Related Topics
Physical Sciences and Engineering Computer Science Computer Vision and Pattern Recognition
Authors
, , , ,