کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
515560 867045 2013 15 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Assessing user-specific difficulty of documents
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
پیش نمایش صفحه اول مقاله
Assessing user-specific difficulty of documents
چکیده انگلیسی

On the web, a huge variety of text collections contain knowledge in different expertise domains, such as technology or medicine. The texts are written for different uses and thus for people having different levels of expertise on the domain. Texts intended for professionals may not be understandable at all by a lay person, and texts for lay people may not contain all the detailed information needed by a professional. Many information retrieval applications, such as search engines, would offer better user experience if they were able to select the text sources that best fit the expertise level of the user. In this article, we propose a novel approach for assessing the difficulty level of a document: our method assesses difficulty for each user separately. The method enables, for instance, offering information in a personalised manner based on the user’s knowledge of different domains. The method is based on the comparison of terms appearing in a document and terms known by the user. We present two ways to collect information about the terminology the user knows: by directly asking the users the difficulty of terms or, as a novel automatic approach, indirectly by analysing texts written by the users. We examine the applicability of the methodology with text documents in the medical domain. The results show that the method is able to distinguish between documents written for lay people and documents written for experts.


► We propose a method for user-specific difficulty assessment.
► We present two approaches for user modelling: direct and indirect.
► Direct user modelling uses a survey to rate term difficulty.
► Indirect user modelling analyses text written by the users.
► Texts targeted at professionals are assessed more difficult than texts for lay people.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Processing & Management - Volume 49, Issue 1, January 2013, Pages 198–212
نویسندگان
, , ,