Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
4942482 | Electronic Commerce Research and Applications | 2017 | 34 Pages |
Abstract
In recent decades, analyzing the sentiments in online customer reviews has become important to many businesses and researchers. However, insufficient amount of labeled training corpus is a bottleneck for machine learning approaches. Self-training is one of the promising semi-supervised techniques which does not require large amounts of labeled data. However, self-training also suffers from an incorrect labeling problem along with insufficient amount of labeled data. This study proposed a semi-supervised learning framework that adds only confidently predicted data to the training corpus in order to enrich the initial classifier in self-training. The experimental results indicate that the proposed method performed better than self-training.
Related Topics
Physical Sciences and Engineering
Computer Science
Artificial Intelligence
Authors
Sangheon Lee, Wooju Kim,