Domain taxonomy learning from text: The subsumption method versus hierarchical clustering

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
378814	659221	2013	16 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Clustering - خوشه بندی Classification - طبقه بندی association rules - قوانین وابستگی Text mining - متن‌کاوی Ontologies - هستی شناسی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش صفحه اول مقاله

Domain taxonomy learning from text: The subsumption method versus hierarchical clustering

چکیده انگلیسی

This paper proposes a framework to automatically construct taxonomies from a corpus of text documents. This framework first extracts terms from documents using a part-of-speech parser. These terms are then filtered using domain pertinence, domain consensus, lexical cohesion, and structural relevance. The remaining terms represent concepts in the taxonomy. These concepts are arranged in a hierarchy with either the extended subsumption method that accounts for concept ancestors in determining the parent of a concept or a hierarchical clustering algorithm that uses various text-based window and document scopes for concept co-occurrences. Our evaluation in the field of management and economics indicates that a trade-off between taxonomy quality and depth must be made when choosing one of these methods. The subsumption method is preferable for shallow taxonomies, whereas the hierarchical clustering algorithm is recommended for deep taxonomies.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Data & Knowledge Engineering - Volume 83, January 2013, Pages 54–69

نویسندگان

Jeroen de Knijff, Flavius Frasincar, Frederik Hogenboom,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Domain taxonomy learning from text: The subsumption method versus hierarchical clustering

دسترسی سریع

ارتباط

English Website