کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
517837 867521 2010 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Semantic Space models for classification of consumer webpages on metadata attributes
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
پیش نمایش صفحه اول مقاله
Semantic Space models for classification of consumer webpages on metadata attributes
چکیده انگلیسی

To deal with the quantity and quality issues with online healthcare resources, creating web portals centred on particular health topics and/or communities of users is a strategy to provide access to a reduced corpus of information resources that meet quality and relevance criteria. In this paper we use hyperspace analogue to language (HAL) to model the language use patterns of webpages as Semantic Spaces. We have applied machine learning methods, including support vector machine (SVM), decision forest, and a novel summed similarity measure (SSM) to automatically classify online webpages on their Semantic Space models. We find classification accuracy on metadata attributes to be over 93% for ‘medical’ versus ‘supportive’ perspective, over 92% for disease stage of ‘early’ versus ‘advanced’, and over 90% for author credentials of ‘lay’ versus ‘clinician’ based on webpages of the Breast Cancer Knowledge Online portal. These results indicate that language use patterns can be used to automate such classification with useful levels of accuracy.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Biomedical Informatics - Volume 43, Issue 5, October 2010, Pages 725–735
نویسندگان
, , ,