کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
508792 865442 2015 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Exploring effective features for recognizing the user intent behind web queries
ترجمه فارسی عنوان
بررسی ویژگی های مؤثر برای شناختن نظر کاربر در پاسخ به درخواست های وب
کلمات کلیدی
جستجوی پرس و جو درک، طبقه بندی پرس و جو، تجزیه و تحلیل پرس و جو، قصد کاربر، تجربه ی کاربر، تجزیه و تحلیل ویژگی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
چکیده انگلیسی


• Effective features for web query classification.
• Detecting user intents behind search queries.
• Natural language processing for search queries.
• Multi-class classification for web queries.

Automatically identifying the user intent behind web queries has started to catch the attention of the research community, since it allows search engines to enhance user experience by adapting results to that goal. It is broadly agreed that there are three archetypal intentions behind search queries: navigational, resource/transactional and informational.Thus, as a natural consequence, this task has been interpreted as a multi-class classification problem. At large, recent works have focused on comparing several machine learning methods built with words as features. Conversely, this paper examines the influence of assorted properties on three classification approaches. In particular, it focuses its attention on the contribution of linguistic-based attributes. However, most of natural language processing tools are designed for documents, not web queries. Therefore, as a means of bridging this linguistic gap, we benefited from caseless models, which are trained with traditionally labeled data, but all terms are converted to lowercase before their generation.Overall, tested attributes proved to be effective by improving on word-based classifiers by up to 8.347% (accuracy), and outperforming a baseline by up to 6.17%. Most notably, linguistic-oriented features, from caseless models, are shown to be instrumental in narrowing the linguistic gap between queries and documents.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computers in Industry - Volume 68, April 2015, Pages 162–169
نویسندگان
,