کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
515562 867045 2013 26 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Assessing the quality of textual features in social media
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
پیش نمایش صفحه اول مقاله
Assessing the quality of textual features in social media
چکیده انگلیسی

Social media is increasingly becoming a significant fraction of the content retrieved daily by Web users. However, the potential lack of quality of user generated content poses a challenge to information retrieval services, which rely mostly on textual features generated by users (particularly tags) commonly associated with the multimedia objects. This paper presents what, to the best of our knowledge, is currently the most comprehensive study of the relative quality of textual features in social media. We analyze four different features, namely, title, tags, description and comments posted by users, in four popular applications, namely, YouTube, Yahoo! Video, LastFM and CiteULike. Our study is based on an extensive characterization of data crawled from the four applications with respect to usage, amount and semantics of content, descriptive and discriminative power as well as content and information diversity across features. It also includes a series of object classification and tag recommendation experiments as case studies of two important information retrieval tasks, aiming at analyzing how these tasks are affected by the quality of the textual features. Classification and recommendation effectiveness is analyzed in light of our characterization results. Our findings provide valuable insights for future research and design of Web 2.0 applications and services.


► We present a thorough study of the quality of textual features for Web 2.0 services.
► We analyze four features (title, tags, description, comments) in four applications.
► We characterize content usage, amount, semantics, descriptive/discriminative powers.
► We also characterize content and information diversities across features.
► We further analyze feature quality for supporting classification and tag recommendation services.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Processing & Management - Volume 49, Issue 1, January 2013, Pages 222–247
نویسندگان
, , , , , , ,