کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4962179 1446526 2016 9 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Unsupervised Concept Hierarchy Learning: A Topic Modeling Guided Approach
ترجمه فارسی عنوان
یادگیری سلسله مراتبی مفهوم بی نظیر: رویکرد هدایت مدل سازی موضوع
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)
چکیده انگلیسی
This paper proposes an efficient and scalable method for concept extraction and concept hierarchy learning from large unstructured text corpus which is guided by a topic modeling process. The method leverages “concepts” from statistically discovered “topics” and then learns a hierarchy of those concepts by exploiting a subsumption relation between them. Advantage of the proposed method is that the entire process falls under the unsupervised learning paradigm thus the use of a domain specific training corpus can be eliminated. Given a massive collection of text documents, the method maps topics to concepts by some lightweight statistical and linguistic processes and then probabilistically learns the subsumption hierarchy. Extensive experiments with large text corpora such as BBC News dataset and Reuters News corpus shows that our proposed method outperforms some of the existing methods for concept extraction and efficient concept hierarchy learning is possible if the overall task is guided by a topic modeling process.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Procedia Computer Science - Volume 89, 2016, Pages 386-394
نویسندگان
, , ,