Article ID Journal Published Year Pages File Type
493289 Procedia Technology 2012 6 Pages PDF
Abstract

General crawlers use a breath first search to download as many pages as possible. Focused crawler can help the search engine to index all documents present on the Web related to a specific domain which in turn provides the search engine's users complete and up-to-date contents. In this paper we present a focused crawler capable of learning. Crawling results for four consecutive crawls are shown. Results shows significant improvement in the precision value for the crawler with respect to the number of crawling attempts made.

Related Topics
Physical Sciences and Engineering Computer Science Computer Science (General)