دانلود رایگان مقاله: با استفاده از شبکه های ناهمگن دو طرفه برای سرعت بخشیدن به یادگیری نیمه نظارت مبتنی بر و بهبود طبقه بندی خودکار متن

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
4946127	1439269	2017	37 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Using bipartite heterogeneous networks to speed up inductive semi-supervised learning and improve automatic text categorization

ترجمه فارسی عنوان

با استفاده از شبکه های ناهمگن دو طرفه برای سرعت بخشیدن به یادگیری نیمه نظارت مبتنی بر و بهبود طبقه بندی خودکار متن

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

طبقه بندی متن، یادگیری انتقالی یادگیری مبتنی بر گراف پخش برچسب، شبکه ناهمگن دو طرفه،

Text classification - طبقه بندی متن Label propagation - پخش برچسب Transductive learning - یادگیری انتقالی Graph-based learning - یادگیری مبتنی بر گراف

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش مقاله

با استفاده از شبکه های ناهمگن دو طرفه برای سرعت بخشیدن به یادگیری نیمه نظارت مبتنی بر و بهبود طبقه بندی خودکار متن

چکیده انگلیسی

Due to the volume of texts available in digital form, the organization, management and knowledge extraction are laborious and frequently impossible to be handled. To automatically cope with these tasks, usually classification models are generated through supervised learning techniques. Unfortunately, this type of learning usually demands a huge human effort to label large volume of texts to build accurate classification models. Since collecting unlabeled texts is easy and inexpensive in several domains, the generation of classification models through inductive semi-supervised learning has been highlighted in recent years. Inductive semi-supervised learning allows to build a classification model using labeled and unlabeled texts. In this scenario, the goal is to augment the set of labeled documents with unlabeled documents to better discriminate class patterns. Hence, fewer texts must be previously labeled. However, semi-supervised learning algorithms that consider texts represented in a vector space model usually obtain unsatisfactory classification performances and are surpassed by semi-supervised learning algorithms that consider texts represented in a network. Nevertheless, despite the classification performances, effective approaches based on networks are generated through the similarities among documents and the classification of a new document are also based on the computation of similarities. This implies to set parameters and compute similarities to both generation the networks and classification of new documents. This approach is not feasible to generate fast responses and consequently to classify a huge volume of texts. In this article, we propose an approach to induce a classification model through semi-supervised learning considering text collections represented by bipartite heterogeneous networks. Bipartite networks are easily and quickly generated, leading to classification performance equivalent or better than other approaches based on network or vector space model and allows a fast classification of new documents. The results presented in this article demonstrate that the proposed approach is able to (i) speed up semi-supervised learning, (ii) speed up the classification of new documents and (iii) surpass classification performance of other existing inductive semi-supervised learning techniques.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Knowledge-Based Systems - Volume 132, 15 September 2017, Pages 94-118

نویسندگان

Rafael Geraldeli Rossi, Alneu de Andrade Lopes, Solange Oliveira Rezende,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : با استفاده از شبکه های ناهمگن دو طرفه برای سرعت بخشیدن به یادگیری نیمه نظارت مبتنی بر و بهبود طبقه بندی خودکار متن

دسترسی سریع

ارتباط

English Website