کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
383023 660800 2013 19 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
RetriBlog: An architecture-centered framework for developing blog crawlers
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
RetriBlog: An architecture-centered framework for developing blog crawlers
چکیده انگلیسی

Blogs have become an important social tool. It allows the users to share their tastes, express their opinions, report news, form groups related to some subject, among others. The information obtained from the blogosphere may be used to create several applications in various fields. However, due to the growing number of blogs posted every day, as well as the dynamicity of the blogosphere, the task of extracting relevant information from the blogs has become difficult and time consuming. In this paper, we use information retrieval and extraction techniques to deal with this problem. Furthermore, as blogs have many variation points is required to provide applications that can be easily adapted. Faced with this scenario, the work proposes RetriBlog, an architecture-centered framework for the development of blog crawlers. Finally, it presents an evaluation of the proposed algorithms and three case studies.


► We developed a blog crawler using software engineering techniques.
► We create applications related to social web.
► We proposed a quantitative and qualitative evaluation.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Expert Systems with Applications - Volume 40, Issue 4, March 2013, Pages 1177–1195
نویسندگان
, , , , , ,