Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
506007 | Computers in Biology and Medicine | 2007 | 7 Pages |
Abstract
Biological named entity recognition is a critical task for automatically mining knowledge from biological literature. In this paper, this task is cast as a sequential labeling problem and Conditional Random Fields model is introduced to solve it. Under the framework of Conditional Random Fields model, rich features including literal, context and semantics are involved. Among these features, shallow syntactic features are first introduced, which effectively improve the model's performance. Experiments show that our method can achieve an F-measure of 71.2% in an open evaluation data, which is better than most of state-of-the-art systems.
Related Topics
Physical Sciences and Engineering
Computer Science
Computer Science Applications
Authors
Chengjie Sun, Yi Guan, Xiaolong Wang, Lei Lin,