کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
385134 660860 2011 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A TENGRAM method based part-of-speech tagging of multi-category words in Hindi language
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
A TENGRAM method based part-of-speech tagging of multi-category words in Hindi language
چکیده انگلیسی

In this paper, we have dealt on the problem of part-of-speech tagging of multi-category words which appear within the sentences of Hindi language. Firstly, a Hindi tagger is proposed which provides part-of-speech tags developed using grammar of Hindi language. For this purpose, Hindi Devanagari alphabets are used and their Hindi transliteration is done within the proposed tagger. Thereafter, a Rules’ based TENGRAM method is described with an illustrative example, which guides to disambiguate multi-category words within sentences of Hindi corpus. The rules generated in TENGRAM are the result of computation of discernibility matrices, discernibility functions and reducts. These computations have been generated from decision tables which are based on theory of Rough sets. Basically, a discernibility matrix helps in cutting down indiscernible condition attributes; a discernibility function has rows corresponding to each column in the discernibility matrix which develops reducts; and the reducts provide a minimal subset of attributes which preserve indiscernibility relation of decision tables and hence they generate the decision rules.


► Development of Hindi tagger and transliteration.
► Generation of rule-based TENGRAM method for Hindi.
► TENGRAM disambiguates MCWs within Hindi sentences.
► TENGRAM employs decision table, discernibility matrix, discernibility function and reducts.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Expert Systems with Applications - Volume 38, Issue 12, November–December 2011, Pages 15084–15093
نویسندگان
, , ,