Article ID Journal Published Year Pages File Type
490409 Procedia Computer Science 2013 10 Pages PDF
Abstract

The number of semi-structured documents that is produced is steadily increasing. Thus, it will be essential for discovering new knowledge from them. In this survey paper, we review popular semi-structured documents mining approaches (structure alone and both structure and content). We provide a brief description of each technique as well as efficient algorithms for implementing the technique and comparing them using different comparison criteria.

Related Topics
Physical Sciences and Engineering Computer Science Computer Science (General)