کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
416832 681404 2013 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Analysis of presence-only data via semi-supervised learning approaches
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
پیش نمایش صفحه اول مقاله
Analysis of presence-only data via semi-supervised learning approaches
چکیده انگلیسی

Presence-only data occur in a classification, which consist of a sample of observations from the presence class and a large number of background observations with unknown presence/absence. Since absence data are generally unavailable, conventional semi-supervised learning approaches are no longer appropriate as they tend to degenerate and assign all observations to the presence class. In this article, we propose a generalized class balance constraint, which can be equipped with semi-supervised learning approaches to prevent them from degeneration. Furthermore, to circumvent the difficulty of model tuning with presence-only data, a selection criterion based on classification stability is developed, which measures the robustness of any given classification algorithm against the sampling randomness. The effectiveness of the proposed approach is demonstrated through a variety of simulated examples, along with an application to gene function prediction.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computational Statistics & Data Analysis - Volume 59, March 2013, Pages 134–143
نویسندگان
, ,