کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
381995 660717 2016 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Microblog semantic context retrieval system based on linked open data and graph-based theory
ترجمه فارسی عنوان
سیستم معنایی بازیابی زمینه میکروبلاگ بر اساس داده های باز در ارتباط و نظریه مبتنی بر گراف
کلمات کلیدی
بازیابی اطلاعات؛ شباهت معنایی؛ داده های باز شده مرتبط است DB pedia؛ نام نهاد پیوند؛ مركز نمودار
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی


• We present a novel information retrieval system for context similarity retrieval in microblogging platforms.
• We present a method for extracting and linking entities to DBpedia concepts.
• We contextualize all matched concepts using graph centrality property by defining a new weighting factor.
• We present two algorithms which perform the semantic similarity by considering the weight of concepts and their related concepts.
• We use a real Twitter dataset to show the effectiveness of our system.

Microblogging platforms have emerged as large collections of short documents. In fact, the provision of an effective way to retrieve short text presents a significant research challenge owing to several factors: creative language usage, high contextualization, the informal nature of micro blog posts and the limited length of this form of communication. Thus, micro blogging retrieval systems suffer from the problems of data sparseness and the semantic gap. This makes it inadequate to accurately meet users’ information needs because users compose tweets using few terms and without query terms inside; thus, many relevant tweets will not be retrieved. To overcome the problems of data sparseness and the semantic gap, recent studies on content-based microblog searching have focused on adding semantics to micro posts by linking short text to knowledge bases resources. Moreover, previous studies use bag-of-concepts representation by linking named entities to their corresponding knowledge base concepts. However, bag-of-concepts representation considers only concepts that match named entities and supposes that all concepts are equivalent and independent. Thus, in this paper, we present a graph-of-concepts method that considers the relationships among concepts that match named entities in short text and their related concepts and contextualizes each concept in the graph by leveraging the linked nature of DBpedia as a Linked Open Data knowledge base and graph-based centrality theory. Furthermore, we propose a similarity measure that computes the similarity between two graphs (query-tweet) by considering the relationships between the contextualized concepts. Finally, we introduce some experiment results, using a real Twitter dataset, to expose the effectiveness of our system.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Expert Systems with Applications - Volume 53, 1 July 2016, Pages 138–148
نویسندگان
, , ,