کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4977865 1452015 2016 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Indonesian syllabification using a pseudo nearest neighbour rule and phonotactic knowledge
ترجمه فارسی عنوان
سیلاب کردن اندونزیایی با استفاده از یک قانون شبه نزدیک ترین همسایه و دانش فنوتیکیک
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
چکیده انگلیسی
This paper discusses phonemic syllabification using a pseudo nearest neighbour rule (PNNR) and phonotactic knowledge for Indonesian language. The proposed data-driven model uses a four-feature phoneme encoding and a phonotactic-based pre-syllabification. Evaluating on 50 k words dataset using 5-fold cross-validation shows that the proposed encoding significantly reduces the average syllable error rate (SER) by 13.90% relatively to the commonly used orthogonal binary encoding and the pre-syllabification also reduces the average SER up to 17.17% relatively to the PNNR without pre-syllabification. Five-fold cross-validating proves that the proposed PNNR-based syllabification is stable by producing an average SER of 0.64%. Most errors come from derivatives with the prefixes 'ber', 'per', and 'ter' as well as from compound words. This result is also significantly lower than a Look-Up-based syllabification that gives an average SER of 2.60%.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Speech Communication - Volume 85, December 2016, Pages 109-118
نویسندگان
, , , ,