کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4962184 1446526 2016 6 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Exploiting Parallel Sentences and Cosine Similarity for Identifying Target Language Translation
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)
پیش نمایش صفحه اول مقاله
Exploiting Parallel Sentences and Cosine Similarity for Identifying Target Language Translation
چکیده انگلیسی

In recent times, The Internet has become a huge information resource which contains information in multiple languages. Users are not acquainted with all languages and this language diversity becomes a great barrier for world communication. Cross-Language Information Retrieval (CLIR) provides a solution for this language barrier where a user can search the required information in his regional language. In this paper, a CLIR system is proposed based on Parallel Corpus (PC). A set of parallel sentences are extracted from PC which are based on query words. Term frequency matrix and cosine similarity measure are used for identifying target language translation. The proposed Term Frequency Method (TFM) approach is compared with Probabilistic Lexicon Method (PLM) approach and result analysis shows that proposed TFM approach performs better than the PLM approach.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Procedia Computer Science - Volume 89, 2016, Pages 428-433
نویسندگان
, ,