کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4944162 1437981 2017 30 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Explicitly and implicitly exploiting the hierarchical structure for mining website interests on news events
ترجمه فارسی عنوان
به صراحت و به طور ضمنی از ساختار سلسله مراتبی برای منافع وب سایت معدن در رویدادهای خبری استفاده می کند
کلمات کلیدی
استخراج متن، معدن وب رویداد خبر علاقه وب سایت،
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی
After a news event, many different websites publish coverage of that event, each expressing their own unique commentary, perspectives, and viewpoints. Websites form around a specific set of interests to cater to different audiences, and discovering these interests can help audiences C especially people and organizations that are interested in news C select the most appropriate websites to use as their sources of information. This paper presents three methods for formally defining and mining a websites interests, each of which is explicitly or implicitly based on a hierarchial structure: website-webpage-keyword. The first, and most straightforward, method explicitly uses keyword-layer network communities and the mapping relations between websites and keywords. The second method expands upon the first method with an iterative algorithm that combines both the mapping relations and the network relations from the website-webpage-keyword structure to further refine the keyword-layer network communities. In the third method, a website topic model implicitly captures the mapping relations among the websites, webpages, and keywords. The performance of three proposed methods in website interest mining is compared using a bespoke evaluation metric. The experimental results show that the iterative procedure designed in the second method is able to improve website interest mining performance, and the website topic model in the third method achieves the best performance among the three methods.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Sciences - Volume 420, December 2017, Pages 263-277
نویسندگان
, , , ,