Article ID Journal Published Year Pages File Type
383506 Expert Systems with Applications 2015 10 Pages PDF
Abstract

•Language independent Named Entity Recognition system.•Novel features based on latent semantics.•Experiments on multiple languages – English, Spanish, Dutch, Czech.•State-of-the-art results.

In this paper, we propose new features for Named Entity Recognition (NER) based on latent semantics. Furthermore, we explore the effect of unsupervised morphological information on these methods and on the NER system in general. The newly created NER system is fully language-independent thanks to the unsupervised nature of the proposed features. We evaluate the system on English, Spanish, Dutch and Czech corpora and study the difference between weakly and highly inflectional languages. Our system achieves the same or even better results than state-of-the-art language dependent systems. The proposed features proved to be very useful and are the main reason of our promising results.

Related Topics
Physical Sciences and Engineering Computer Science Artificial Intelligence
Authors
, , ,