Article ID Journal Published Year Pages File Type
725337 The Journal of China Universities of Posts and Telecommunications 2011 5 Pages PDF
Abstract

Text classification has gained booming interest over the past few years. The traditional approaches of text classification commonly extract features from a signal test criterion, resulting in the problem of “over fitting”. This paper takes test criterions such as frequency, dispersion and concentration indices into account and proposes an improved dimension reduction method and feature weighting method, making the selection more representative and the weighting of characteristic features more reasonable. Experimental results show that the new method has high precision and recall rates.

Related Topics
Physical Sciences and Engineering Engineering Electrical and Electronic Engineering