Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
10321274 | Data & Knowledge Engineering | 2005 | 25 Pages |
Abstract
This paper proposes some solutions for the bootstrapping problem, that implicitly or explicitly use taxonomy definition: a baseline approach that classifies documents according to the class terms, and two clustering approaches, whose training is constrained by the a priori knowledge encoded in the taxonomy structure, which consists of both terminological and relational aspects. In particular, we propose the TaxSOM model, that clusters a set of documents in a predefined hierarchy of classes, directly exploiting the knowledge of both their topological organization and their lexical description. Experimental evaluation was performed on a set of taxonomies taken from the Google⢠and LookSmart⢠web directories, obtaining good results.
Keywords
Related Topics
Physical Sciences and Engineering
Computer Science
Artificial Intelligence
Authors
Giordano Adami, Paolo Avesani, Diego Sona,