Article ID Journal Published Year Pages File Type
488818 Procedia Computer Science 2014 10 Pages PDF
Abstract

Young people commonly use slang in the texts for weblogs or Social Networking Sites. How to treat such slang words properly is one of the problems in the field of text mining. In this paper, we examined several methods to extract Japanese slang called “Wakamono Kotoba,” which is particularly used by young people, by focusing on its script type and stroke count. In the evaluation experiment, a high precision was obtained when we adopted script type for extraction.

Related Topics
Physical Sciences and Engineering Computer Science Computer Science (General)