Article ID Journal Published Year Pages File Type
384727 Expert Systems with Applications 2009 5 Pages PDF
Abstract

Online mining of path traversal patterns from Web click-streams is one of the most important problems of Web usage mining. In this paper, we propose a sliding window-based Web data mining algorithm, called Top-SW (Top-kpath traversal patterns of Stream sliding Window), to discover the set of top-k path traversal patterns from streaming maximal forward references, where k is the desired number of path traversal patterns to be mined. A new summary data structure, called Top-list (a list of Top-kpath traversal patterns) is developed to maintain the essential information about the top-k path traversal patterns from the current maximal forward references stream. Experimental studies show that the proposed Top-SW algorithm is an efficient, single-pass algorithm for mining the set of top-k path traversal patterns from a continuous stream of maximal forward references.

Related Topics
Physical Sciences and Engineering Computer Science Artificial Intelligence
Authors
,