دانلود رایگان مقاله: خوشه های برچسب گذاری از دیدگاه زبانی و آماری: رویکرد ترکیبی

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
403536	677260	2015	9 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Labeling clusters from both linguistic and statistical perspectives: A hybrid approach

ترجمه فارسی عنوان

خوشه های برچسب گذاری از دیدگاه زبانی و آماری: رویکرد ترکیبی

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

برچسب زدن خوشه، تجزیه وابستگی، نمره حساس به متن، یادگیری قانون استخراج عبارت

Cluster labeling - برچسب زدن خوشه Dependency parsing - تجزیه وابستگی Rule learning - یادگیری قانون

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش مقاله

خوشه های برچسب گذاری از دیدگاه زبانی و آماری: رویکرد ترکیبی

چکیده انگلیسی

Document clustering refers to grouping similar documents together automatically. Labels of the clusters, usually edited manually, are helpful for users to quickly grasp the major meaning of the grouped documents. Therefore, high quality labels are desired in many user-facing applications. However, assigning the labels manually is time consuming and tedious. In this paper a hybrid approach is proposed to automate the labeling process. First, linguistic knowledge are used to ensure candidate labels’ readability and information quantity by exploring the dependencies between words. Second, a statistical generative model is proposed to select representative labels. It scores a label w.r.t. a cluster by estimating how likely the cluster is generated by the label. The proposed approach is evaluated on two data sets in both English and Chinese. Experimental results show that the proposed approach produces high quality labels and outperforms existing state-of-art methods on both manual and automatic evaluations.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Knowledge-Based Systems - Volume 76, March 2015, Pages 219–227

نویسندگان

Zhixing Li, Juanzi Li, Yi Liao, Siqiang Wen, Jie Tang,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : خوشه های برچسب گذاری از دیدگاه زبانی و آماری: رویکرد ترکیبی

دسترسی سریع

ارتباط

English Website