کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6900709 1446490 2018 7 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Keyword query based focused Web crawler
ترجمه فارسی عنوان
کاوشگر وب متمرکز بر جستجوی کلمات کلیدی است
کلمات کلیدی
خزنده وب، بازیابی اطلاعات، خزنده وب متمرکز، کاوشگر مبتنی بر پرس و جو،
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)
چکیده انگلیسی
Finding information on Web is a difficult and challenging task because of the extremely large volume of data. Search engine can be used to facilitate this task, but it is still difficult to cover all the webpages present on Web. This paper proposes a query based crawler where a set of keywords relevant to the topic of interest of the user is used to shoot queries on search interface. These search interfaces are found on webpage of the website corresponding to seed URL. This helps crawler to get most relevant links from the domain without actually going in depth of that domain. No existing focused crawling approach uses query based approach to find webpages of interest. In the proposed crawler, list of keywords is passed to the search query interfaces found on the websites. The proposed work will give the most relevant information based on the keywords in a particular domain without actually crawling through many irrelevant links in between them.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Procedia Computer Science - Volume 125, 2018, Pages 584-590
نویسندگان
, , , ,