دانلود رایگان مقاله: طبقه بندی داده های متنی شبکه با نمونه های مثبت و بدون برچسب

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
535096	870320	2016	7 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Classifying networked text data with positive and unlabeled examples

ترجمه فارسی عنوان

طبقه بندی داده های متنی شبکه با نمونه های مثبت و بدون برچسب

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

یادگیری PU؛ داده های متن شبکه؛ تقسیم ماتریس؛ یادگیری نیمه نظارتی؛ خوشه بندی گراف

PU learning Matrix factorization - تقسیم ماتریس Graph clustering - خوشه بندی گراف Semi-supervised learning - یاگیری نیمه‌نظارتی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو

پیش نمایش مقاله

طبقه بندی داده های متنی شبکه با نمونه های مثبت و بدون برچسب

چکیده انگلیسی

• We present a NMF–based method for PU Learning of networked text data.
• Our algorithm integrates feature and network information via a consensus principle.
• Our method deals with networked data with extremely limited positive examples.
• We demonstrate the effectiveness of our algorithm.

The rapid growth in the number of networked applications that naturally generate complex text data, which contains not only inner features but also inter-dependent relations, has created the demand of efficiently classifying such data. Many classification algorithms have been proposed, but they usually require as input fully labeled text examples. In many networked applications, however, the cost to label a text data may be expensive and hence a large amount of text may be unlabeled. In this paper we study the problem of classifying networked text data with only positive and unlabeled examples available. We present a non-negative matrix factorization-based approach to networked text classification by factorizing content matrix of the nodes and topological network structures, and by incorporating supervised information into the learning of objective function via a consensus principle. We propose a novel learning algorithm, namely puNet (positive and unlabeled learning algorithm for Networked text data), for efficiently classifying networked text, even if training datasets contain only a small amount of positive examples and a large amount of unlabeled ones. We conduct a series of experiments on benchmark networked datasets and illustrate the effectiveness of our algorithm.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition Letters - Volume 77, 1 July 2016, Pages 1–7

نویسندگان

Mei Li, Shirui Pan, Yang Zhang, Xiaoyan Cai,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : طبقه بندی داده های متنی شبکه با نمونه های مثبت و بدون برچسب

دسترسی سریع

ارتباط

English Website