کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
528203 869535 2016 9 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Web news mining in an evolving framework
ترجمه فارسی عنوان
استخراج اخبار وب در یک چارچوب در حال تکامل
کلمات کلیدی
اطلاعات بزرگ؛ معدن اخبار وب؛ سیستم های فازی در حال تکامل
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
چکیده انگلیسی


• Approach for classifying in real-time web news articles into various topics.
• The approach solves a big data problem since it can cope with huge amounts of news
• The presented approach is evolving (on-line, one-pass and recursive).
• The model that describes a specific topic area changes over time.
• The approach has been successfully tested using real on-line news.

Online news has become one of the major channels for Internet users to get news. News websites are daily overwhelmed with plenty of news articles. Huge amounts of online news articles are generated and updated everyday, and the processing and analysis of this large corpus of data is an important challenge. This challenge needs to be tackled by using big data techniques which process large volume of data within limited run times. Also, since we are heading into a social-media data explosion, techniques such as text mining or social network analysis need to be seriously taken into consideration.In this work we focus on one of the most common daily activities: web news reading. News websites produce thousands of articles covering a wide spectrum of topics or categories which can be considered as a big data problem. In order to extract useful information, these news articles need to be processed by using big data techniques. In this context, we present an approach for classifying huge amounts of different news articles into various categories (topic areas) based on the text content of the articles. Since these categories are constantly updated with new articles, our approach is based on Evolving Fuzzy Systems (EFS). The EFS can update in real time the model that describes a category according to the changes in the content of the corresponding articles. The novelty of the proposed system relies in the treatment of the web news articles to be used by these systems and the implementation and adjustment of them for this task. Our proposal not only classifies news articles, but it also creates human interpretable models of the different categories. This approach has been successfully tested using real on-line news.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Fusion - Volume 28, March 2016, Pages 90–98
نویسندگان
, , , ,