Article ID Journal Published Year Pages File Type
534492 Pattern Recognition Letters 2015 5 Pages PDF
Abstract

•New feature weighting schemes for speech-act classification.•Entropy of probability distributions over all categories.•Log-odds ratio of positive and negative category distributions.

Speech-act classification is essential to generation and understanding of utterances within a natural language dialogue system since the speech-act of an utterance is closely tied to a user intention. The binary feature weighting scheme has mainly been used for speech-act classification because traditional feature weighting schemes such as tf.idf are not effective in speech-act classification due to the short length of utterances. This paper studies two effective feature weighting schemes using the category distributions of features: (1) the first one exploits the entropy of whole category distributions and (2) the second one the log-odds ratio of positive and negative category distributions. As a result, the proposed schemes show significant improvement on SVM and k-NN classifiers in our experiments.

Related Topics
Physical Sciences and Engineering Computer Science Computer Vision and Pattern Recognition
Authors
,