کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
533205 870077 2016 17 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
One class proximal support vector machines
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
پیش نمایش صفحه اول مقاله
One class proximal support vector machines
چکیده انگلیسی


• We study the extraction of a target population from a dataset contaminated by outliers.
• To this end, we propose a new Fisher type contrast measure.
• We reconsider this problem from the formalism of proximal support vector machines.
• An approximation of the contrast measure is done using a conjugate gradient method.
• No matrix inversion is needed which lowers the computational complexity.

Recently in Dufrenois [1], a new Fisher type contrast measure has been proposed to extract a target population in a dataset contaminated by outliers. Although mathematically sound, this work presents some further shortcomings in both the formalism and the field of use. First, we propose to re-express this problem from the formalism of proximal support vector machines as introduced in Mangasarian and Wild [2]. This change is far from harmless since it introduces a suited writing for solving the problem. Another limiting factor of the method is that its performance relies on the assumption that the density between the target and outliers are different. This consideration can easily prove to be over-optimistic for real world datasets making the method unreliable, at least directly. The computation of the decision boundary is a time consuming part of the algorithm since it is based on solving a generalized eigenvalue problem (GEP). This method is therefore limited to medium sized data sets. In this paper, we propose appropriate strategies to unlock all these shortcomings and fully benefit from the interest of the approach. Firstly, we show under some conditions that generating appropriate artificial outliers allows to stay within the constraints of the method and thus enlarges the conditions of use. Secondly, we show that the GEP can be advantageously replaced by a conjugate gradient solution (CG) significantly decreasing the computational cost. Lastly, the proposed algorithm is compared with recent novelty detectors on synthetic and real datasets.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition - Volume 52, April 2016, Pages 96–112
نویسندگان
, ,