دانلود رایگان مقاله: رویکرد اقتصادی به داده های بزرگ در یک زبان کوچک

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
488814	703943	2014	10 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

An Economic Approach to Big Data in a Minority Language

ترجمه فارسی عنوان

رویکرد اقتصادی به داده های بزرگ در یک زبان کوچک

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)

پیش نمایش مقاله

رویکرد اقتصادی به داده های بزرگ در یک زبان کوچک

چکیده انگلیسی

Google's n-gram project brought recently big data benefits to several main world languages, like English, Chinese etc. Any attempt to derive such systems, aimed to accelerate the development of NLP applications for world minority languages, in the manner in which it has been done in the project, encounters many obstacles. This paper presents an innovative and economic approach to large-scale n-gram system creation applied to the Croatian language case. Instead of using the Web as the world's biggest text repository, our process of n-gram collection relies on the Croatian academic online spellchecker Hascheck, a language service publicly available since 1993 and popular worldwide. The service has already processed a corpus whose size exceeds the size of the Croatian web-corpus created in recent years. Contrary to the Google n-gram systems, where cutoff criteria were applied, our n-gram filtering is based on dictionary criteria. This resulted in a system comparable in size to the largest n-gram systems of today. Because of the reliance on a service in constant use, the Croatian n-gram system is a dynamic one, unique among the systems compared. The importance of having an n-gram infrastructure for rapid breakthroughs in new application areas is also exemplified in the paper.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Procedia Computer Science - Volume 35, 2014, Pages 427-436

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : رویکرد اقتصادی به داده های بزرگ در یک زبان کوچک

دسترسی سریع

ارتباط

English Website