دانلود رایگان مقاله: دو پرنده با یک سنگ: طبقه بندی نمونه های مثبت و بدون برچسب در جریان داده های نامشخص

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
6864739	1439550	2018	12 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Two birds with one stone: Classifying positive and unlabeled examples on uncertain data streams

ترجمه فارسی عنوان

دو پرنده با یک سنگ: طبقه بندی نمونه های مثبت و بدون برچسب در جریان داده های نامشخص

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

PU learning ELM - الم Cluster - خوشه Ensemble classifier - طبقه بندی را تنظیم کنید Concept drift - مفهوم رانش

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش مقاله

دو پرنده با یک سنگ: طبقه بندی نمونه های مثبت و بدون برچسب در جریان داده های نامشخص

چکیده انگلیسی

An important feature characteristic of the data streams in many of today's big data applications is the intrinsic uncertainty, which could happen for both item occurrence and attribute value. While this has already posed great challenges for fundamental data mining tasks such as classification, things are made even more complicated by the fact that completely-labeled examples are usually unavailable in such settings, leaving researchers the only option to learn classifiers on partially-labeled examples on uncertain data streams. Furthermore, there will be concept drift on evolving data streams. To address these challenges, this paper therefore focuses on the study of learning from positive and unlabeled examples (PU) on uncertain data streams. To the best of our knowledge, this paper is the first work to address the uncertainty issue in both item occurrence (occurrence level) and attribute value (attribute level) for the problem of PU learning over streaming data. Firstly, we propose an algorithm to classify positive and unlabeled examples with both uncertainties (PUU). The algorithm extracts reliable positive and negative examples by clustering-based method, and then trains the classifier with Weighted Extreme Learning Machine (Weighted ELM). Secondly, we propose an algorithm of PU learning over uncertain data streams (PUUS). It adopts ensemble model and trains base classification by PUU. In order to detect concept drift, PUUS uses cluster set similarities between the current data block and history data block. We propose different update strategies for different concept drift to adapt to evolving uncertain data streams. Experimental results show that PUUS can effectively classify uncertain data streams with just positive and unlabeled data, while achieving in the meantime good performance in detecting and handling concept drifts.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neurocomputing - Volume 277, 14 February 2018, Pages 149-160

نویسندگان

Han Donghong, Li Shuoru, Wei Fulin, Tang Yuying, Zhu Feida, Wang Guoren,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : دو پرنده با یک سنگ: طبقه بندی نمونه های مثبت و بدون برچسب در جریان داده های نامشخص

دسترسی سریع

ارتباط

English Website