An automatic approach for ontology-based feature extraction from heterogeneous textualresources

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
380778	1437459	2013	15 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

IE, Information extraction - استخراج اطلاعات Feature extraction - استخراج ویژگی Ontologies - هستی شناسی Wikipedia - ویکیپدیا

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش صفحه اول مقاله

An automatic approach for ontology-based feature extraction from heterogeneous textualresources

چکیده انگلیسی

Data mining algorithms such as data classification or clustering methods exploit features of entities to characterise, group or classify them according to their resemblance. In the past, many feature extraction methods focused on the analysis of numerical or categorical properties. In recent years, motivated by the success of the Information Society and the WWW, which has made available enormous amounts of textual electronic resources, researchers have proposed semantic data classification and clustering methods that exploit textual data at a conceptual level. To do so, these methods rely on pre-annotated inputs in which text has been mapped to their formal semantics according to one or several knowledge structures (e.g. ontologies, taxonomies). Hence, they are hampered by the bottleneck introduced by the manual semantic mapping process. To tackle this problem, this paper presents a domain-independent, automatic and unsupervised method to detect relevant features from heterogeneous textual resources, associating them to concepts modelled in a background ontology. The method has been applied to raw text resources and also to semi-structured ones (Wikipedia articles). It has been tested in the Tourism domain, showing promising results.

► A general method to extract features from textual descriptions according to an ontology.
► A method to automatically detect, extract and filter relevant named entities.
► A method to associate named entities to ontological classes in an unsupervised manner.
► They have been adapted to raw texts and also semi-structured Wikipedia articles.
► Evaluations performed over sources with touristic information show promising results.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Engineering Applications of Artificial Intelligence - Volume 26, Issue 3, March 2013, Pages 1092–1106

نویسندگان

Carlos Vicient, David Sánchez, Antonio Moreno,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

An automatic approach for ontology-based feature extraction from heterogeneous textualresources

دسترسی سریع

ارتباط

English Website