Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
409071 | Neurocomputing | 2008 | 11 Pages |
Abstract
We propose a method for inferring semantic information from textual data in content-based multimedia retrieval. Training examples of images and videos belonging to a specific semantic class are associated with their low-level visual and aural descriptors augmented with textual features such as frequencies of significant words. A fuzzy mapping of a semantic class in the training set to a class of similar objects in the test set is created by using Self-Organizing Maps (SOMs) trained from the low-level descriptors. Experiments with two databases and different textual features show promising results, indicating the usefulness of the approach in bridging the gap from low-level visual features to semantic concepts.
Related Topics
Physical Sciences and Engineering
Computer Science
Artificial Intelligence
Authors
Mats Sjöberg, Jorma Laaksonen, Timo Honkela, Matti Pöllä,