کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4960371 1364896 2017 7 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Enhancing Arabic stemming process using resources and benchmarking tools
ترجمه فارسی عنوان
افزایش روند فرآیند عربی با استفاده از منابع و ابزارهای معیار سنجش
کلمات کلیدی
عربی، ارزیابی، معیار، ارزیابی کپسول،
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)
چکیده انگلیسی

Many approaches and solutions have been proposed for developing Arabic light stemmers. These stemmers are often used in the context of application-oriented projects, especially when it comes to developing information retrieval (IR) systems. However, Arabic light stemming, as the process of stripping off a set of prefixes and/or suffixes, is a blinded task suffering from problems such as incorrect removal, vocalization ambiguity, single solution, etc. Moreover, each researcher claims that his/her stemmer reached a level of strength and accuracy quite high. However, in most cases, these stemmers are black boxes and it is not possible to access neither their source codes to verify their validity, nor the evaluation corpora that were used to claim such accuracy. Since these stemmers are very important for researchers, their comparison and evaluation is then essential to facilitate the choice of the stemmer to use in a given project. In this paper, we propose a new Arabic stemmer that gives solutions to the above mentioned drawbacks. In addition, we propose an automatic approach for the evaluation and comparison of Arabic stemmers that takes into account metrics related to the accuracy of results as well as the execution time of stemmers.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of King Saud University - Computer and Information Sciences - Volume 29, Issue 2, April 2017, Pages 164-170
نویسندگان
, , , ,