کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
452968 694678 2012 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Reprint of: Efficient crawling through URL ordering
کلمات کلیدی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر شبکه های کامپیوتری و ارتباطات
پیش نمایش صفحه اول مقاله
Reprint of: Efficient crawling through URL ordering
چکیده انگلیسی

In this paper we study in what order a crawler should visit the URLs it has seen, in order to obtain more “important” pages first. Obtaining important pages rapidly can be very useful when a crawler cannot visit the entire Web in a reasonable amount of time. We define several importance metrics, ordering schemes, and performance evaluation measures for this problem. We also experimentally evaluate the ordering schemes on the Stanford University Web. Our results show that a crawler with a good ordering scheme can obtain important pages significantly faster than one without.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Networks - Volume 56, Issue 18, 17 December 2012, Pages 3849–3858
نویسندگان
, , ,