Article ID Journal Published Year Pages File Type
552533 Decision Support Systems 2014 16 Pages PDF
Abstract

•FLOPPIES enable (semi-)automatic ontology population of online product information.•We use a product ontology that is compatible with the GoodRelations ontology.•The average Information Gain is used to determine the correct product class.•For the evaluation we have used a training and test set, consisting of 1718 products.•Our approach outperforms the baseline approach at all stages of the population process.

With the vast amount of information available on the Web, there is an urgent need to structure Web data in order to make it available to both users and machines. E-commerce is one of the areas in which growing data congestion on the Web impedes data accessibility. This paper proposes FLOPPIES, a framework capable of semi-automatic ontology population of tabular product information from Web stores. By formalizing product information in an ontology, better product comparison or parametric search applications can be built, using the semantics of product attributes and their corresponding values. The framework employs both lexical and pattern matching for classifying products, mapping properties, and instantiating values. It is shown that the performance on instantiating TVs and MP3 players from Best Buy and Newegg.com looks promising, achieving an F1-measure of approximately 77%.

Related Topics
Physical Sciences and Engineering Computer Science Information Systems
Authors
, , , ,