Article ID Journal Published Year Pages File Type
487042 Procedia Computer Science 2016 6 Pages PDF
Abstract

Detecting the sentence boundary forms the basic step for many natural language applications. A lot of work has been done in this direction for English and other foreign languages. But not much work has been done for Indian languages. This paper proposes a rule based system for correctly identifying the boundary of the sentence written in Marathi. The task of identifying a sentence end in Marathi is made complex by the fact that Marathi language do not have indication of sentence start like the English has capital letters for indicating the start of new sentences. The system uses certain rules to correctly determine the end of sentence.

Related Topics
Physical Sciences and Engineering Computer Science Computer Science (General)