کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6926133 1448891 2018 13 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Knowledge based collection selection for distributed information retrieval
ترجمه فارسی عنوان
انتخاب مجموعه دانش برای بازیابی اطلاعات توزیع شده
کلمات کلیدی
انتخاب مجموعه، بازیابی اطلاعات توزیع شده، دانش محور، گسترش پرس و جو،
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
چکیده انگلیسی
Recent years have seen a great deal of work on collection selection. Most collection selection methods use central sample index (CSI) that consists of some documents sampled from each collection as collection description. The limitations of these methods are the usage of 'flat' meaning representations that ignore structure and relationships among words in CSI, and the calculation of query-collection similarity metric that ignore semantic distance between query words and indexed words. In this paper, we propose a knowledge based collection selection method (KBCS) to improve collection representation and query-collection similarity metric. KBCS models a collection as a weighted entity set and applies a novel query-collection similarity metric to select highly scored collections. Specifically, in the part of collection representation, context- and structure-based measures are employed to weight the semantic distance between two entities extracted from the sampled documents of a collection. In addition, the novel query-collection similarity metric takes the entity weight, collection size, and other factors into account. To enrich concepts contained in a query, DBpedia based query expansion is integrated. Finally, extensive experiments were conducted on a large webpage dataset, and DBpedia was chosen as the graph knowledge base. Experimental results demonstrate the effectiveness of KBCS.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Processing & Management - Volume 54, Issue 1, January 2018, Pages 116-128
نویسندگان
, , ,