دانلود رایگان مقاله: طبقه بندی متن با استفاده از چند نمونه نشان داده شده

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
350724	618455	2014	9 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Text classification using a few labeled examples

ترجمه فارسی عنوان

طبقه بندی متن با استفاده از چند نمونه نشان داده شده

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

استخراج متن، طبقه بندی متن، استخراج مدت، موضوع احتمالی، مدل، داده کاوی

Term extraction - استخراج مدت Text classification - طبقه بندی متن Text mining - متن‌کاوی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر

پیش نمایش مقاله

طبقه بندی متن با استفاده از چند نمونه نشان داده شده

چکیده انگلیسی

• A graph of terms can be effectively used for text classification.
• Such a graph is extracted from documents thanks to a LDA based methodology.
• Proposed method achieves good performances on standard datasets.
• The approach can discover solutions matching user information needs.

Supervised text classifiers need to learn from many labeled examples to achieve a high accuracy. However, in a real context, sufficient labeled examples are not always available because human labeling is enormously time-consuming. For this reason, there has been recent interest in methods that are capable of obtaining a high accuracy when the size of the training set is small.In this paper we introduce a new single label text classification method that performs better than baseline methods when the number of labeled examples is small. Differently from most of the existing methods that usually make use of a vector of features composed of weighted words, the proposed approach uses a structured vector of features, composed of weighted pairs of words.The proposed vector of features is automatically learned, given a set of documents, using a global method for term extraction based on the Latent Dirichlet Allocation implemented as the Probabilistic Topic Model. Experiments performed using a small percentage of the original training set (about 1%) confirmed our theories.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computers in Human Behavior - Volume 30, January 2014, Pages 689–697

نویسندگان

Francesco Colace, Massimo De Santo, Luca Greco, Paolo Napoletano,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : طبقه بندی متن با استفاده از چند نمونه نشان داده شده

دسترسی سریع

ارتباط

English Website