Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
4966421 | Information Processing & Management | 2017 | 16 Pages |
Abstract
We experimentally evaluate the properties of our algorithm by processing 2400Â web pages. On this set of web pages, we prove that our algorithm is almost 90% faster than the reference algorithm. We also show that our algorithm accuracy is between 47% and 133% of the reference algorithm accuracy with indirect correlation of our algorithm's accuracy to the depth of inspected page structure. In our experiments, we also demonstrate the advantages of producing a flat segmentation structure instead of an hierarchy.
Keywords
Related Topics
Physical Sciences and Engineering
Computer Science
Computer Science Applications
Authors
Jan Zeleny, Radek Burget, Jaroslav Zendulka,