Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
490409 | Procedia Computer Science | 2013 | 10 Pages |
Abstract
The number of semi-structured documents that is produced is steadily increasing. Thus, it will be essential for discovering new knowledge from them. In this survey paper, we review popular semi-structured documents mining approaches (structure alone and both structure and content). We provide a brief description of each technique as well as efficient algorithms for implementing the technique and comparing them using different comparison criteria.
Related Topics
Physical Sciences and Engineering
Computer Science
Computer Science (General)