کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
515909 867139 2011 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Automatic word spacing of erroneous sentences in mobile devices with limited hardware resources
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
پیش نمایش صفحه اول مقاله
Automatic word spacing of erroneous sentences in mobile devices with limited hardware resources
چکیده انگلیسی

With the rapid evolution of the mobile environment, the demand for natural language applications on mobile devices is increasing. This paper proposes an automatic word spacing system, the first step module of natural language processing (NLP) for many languages with their own word spacing rules, that is designed for mobile devices with limited hardware resources. The proposed system uses two stages. In the first stage, it preliminarily corrects word spacing errors by using a modified hidden Markov model based on character unigrams. In the second stage, the proposed system re-corrects the miscorrected word spaces by using lexical rules based on character bigrams or longer combinations. By using this hybrid method, the proposed system improves the robustness against unknown word patterns, reduces memory usage, and increases accuracy. To evaluate the proposed system in a realistic mobile environment, we constructed a mobile-style colloquial corpus using a simple simulation method. In experiments with a commercial mobile phone, the proposed system showed good performances (a response time of 0.20 s per sentence, a memory usage of 2.04 MB, and an accuracy of 92–95%) in the various evaluation measures.

Research highlights
► An automatic word spacing system for mobile devices with limited hardware resources.
► A hybrid model using statistical information and lexical rules.
► Good performances; a memory usage of 2.04 MB and an accuracy of 92–95%.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Processing & Management - Volume 47, Issue 4, July 2011, Pages 518–527
نویسندگان
, ,