کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
509079 865479 2014 15 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Enhancing passage retrieval in log files by query expansion based on explicit and pseudo relevance feedback
ترجمه فارسی عنوان
افزایش بازیابی پاساژ در فایل های ورود به سیستم با گسترش پرس و جو بر اساس بازخورد صریح و پراکنده
کلمات کلیدی
بازیابی اطلاعات، بازیابی گذرگاه، پرسش پاسخ دادن، غنی سازی پرس و جو، یادگیری متن، فایل های ورودی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
چکیده انگلیسی


• In this paper, we present a new approach for enhancing the relevancy of queries during passage retrieval in log files.
• We determine the explicit relevance feedback by identifying the context of the requested information within a learning process.
• We assign a weight to terms according to their relatedness to queries.
• Experiments conducted on real data from logs and documents show that our query expansion protocol enables retrieval of relevant passages.

Passage retrieval is usually defined as the task of searching for passages which may contain the answer for a given query. While these approaches are very efficient when dealing with texts, applied to log files (i.e. semi-structured data containing both numerical and symbolic information) they usually provide irrelevant or useless results. Nevertheless one appealing way for improving the results could be to consider query expansions that aim at adding automatically or semi-automatically additional information in the query to improve the reliability and accuracy of the returned results. In this paper, we present a new approach for enhancing the relevancy of queries during a passage retrieval in log files. It is based on two relevance feedback steps. In the first one, we determine the explicit relevance feedback by identifying the context of the requested information within a learning process. The second step is a new kind of pseudo relevance feedback. Based on a novel term weighting measure it aims at assigning a weight to terms according to their relatedness to queries. This measure, called TRQ (Term Relatedness to Query), is used to identify the most relevant expansion terms.The main advantage of our approach is that is can be applied both on log files and documents from general domains. Experiments conducted on real data from logs and documents show that our query expansion protocol enables retrieval of relevant passages.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computers in Industry - Volume 65, Issue 6, August 2014, Pages 937–951
نویسندگان
, , , ,