دانلود رایگان مقاله: الگوریتم خوشه بندی تئوری کاربردی برای تشخیص عبارات از بالا به پایین

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
392610	665139	2014	13 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

A top-down information theoretic word clustering algorithm for phrase recognition

ترجمه فارسی عنوان

الگوریتم خوشه بندی تئوری کاربردی برای تشخیص عبارات از بالا به پایین

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Support vector machine - ماشین بردار پشتیبانی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش مقاله

الگوریتم خوشه بندی تئوری کاربردی برای تشخیص عبارات از بالا به پایین

چکیده انگلیسی

• We propose an efficient KL-based word clustering algorithm for large-scale text collection.
• The word clusters were adopted as features to enhance the sequential taggers.
• The use of word clusters improves the prediction power (using statistical significant tests).

Semi-supervised machine learning methods have the features of both, integrating labeled and unlabeled training data. In most structural problems, such as natural language processing and image processing, developing labeled data for a specific domain requires considerable amount of human resources. In this paper, we present a cluster-based method to fuse labeled training and unlabeled raw data. We design a top-down divisive clustering algorithm that ensures maximal information gain in the use of unlabeled data via clustering similar words. To implement this idea, we design a top-down iterative K-means clustering algorithm to merge word clusters. Differently, the derived term groups are then encoded as new features for the supervised learners in order to improve the coverage of lexical information. Without additional training data or external materials, this approach yields state-of-the-art performance on the shallow parsing and base-chunking benchmark datasets (94.50 and 93.12 in F(β) rates).

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Sciences - Volume 275, 10 August 2014, Pages 213–225

نویسندگان

Yu-Chieh Wu,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : الگوریتم خوشه بندی تئوری کاربردی برای تشخیص عبارات از بالا به پایین

دسترسی سریع

ارتباط

English Website