کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
514941 866917 2016 16 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Unsupervised adaptive microblog filtering for broad dynamic topics
ترجمه فارسی عنوان
فیلترینگ میکروبلاگ تطبیقی بدون نظارت در مورد موضوعات عمومی پویا
کلمات کلیدی
توییتر؛ فیلتر میکروبلاگ. فیلتر تطبیقی بدون نظارت؛ موضوعات گسترده پویا ؛ توییت عربی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
چکیده انگلیسی


• Broad topics on Twitter are highly dynamic.
• Boolean filtering retrieve high precision but limited number of tweets.
• A proposed adaptive filtering achieved 84% gain in recall with slight drop in prec.
• Proposed method showed robustness over time, across domains, and query formulations.
• Our method is currently adopted in a live service that follows news from Twitter.

Information filtering has been a major task of study in the field of information retrieval (IR) for a long time, focusing on filtering well-formed documents such as news articles. Recently, more interest was directed towards applying filtering tasks to user-generated content such as microblogs. Several earlier studies investigated microblog filtering for focused topics. Another vital filtering scenario in microblogs targets the detection of posts that are relevant to long-standing broad and dynamic topics, i.e., topics spanning several subtopics that change over time. This type of filtering in microblogs is essential for many applications such as social studies on large events and news tracking of temporal topics. In this paper, we introduce an adaptive microblog filtering task that focuses on tracking topics of broad and dynamic nature. We propose an entirely-unsupervised approach that adapts to new aspects of the topic to retrieve relevant microblogs. We evaluated our filtering approach using 6 broad topics, each tested on 4 different time periods over 4 months. Experimental results showed that, on average, our approach achieved 84% increase in recall relative to the baseline approach, while maintaining an acceptable precision that showed a drop of about 8%. Our filtering method is currently implemented on TweetMogaz, a news portal generated from tweets. The website compiles the stream of Arabic tweets and detects the relevant tweets to different regions in the Middle East to be presented in the form of comprehensive reports that include top stories and news in each region.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Processing & Management - Volume 52, Issue 4, July 2016, Pages 513–528
نویسندگان
, ,