Article ID Journal Published Year Pages File Type
4966421 Information Processing & Management 2017 16 Pages PDF
Abstract
We experimentally evaluate the properties of our algorithm by processing 2400 web pages. On this set of web pages, we prove that our algorithm is almost 90% faster than the reference algorithm. We also show that our algorithm accuracy is between 47% and 133% of the reference algorithm accuracy with indirect correlation of our algorithm's accuracy to the depth of inspected page structure. In our experiments, we also demonstrate the advantages of producing a flat segmentation structure instead of an hierarchy.
Related Topics
Physical Sciences and Engineering Computer Science Computer Science Applications
Authors
, , ,