کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4960912 1446504 2017 6 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Plagiarism detection using document similarity based on distributed representation
ترجمه فارسی عنوان
تشخیص سرقت ادبی با استفاده از سند سمبل بر اساس نمایندگی توزیع شده
کلمات کلیدی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)
چکیده انگلیسی

Accurate methods are required for plagiarism detection from documents. Generally, plagiarism detection is implemented on the basis of similarity between documents. This paper evaluates the validity of using distributed representation of words for defining a document similarity. This paper proposes a plagiarism detection method based on the local maximal value of the length of the longest common subsequence (LCS) with the weight defined by a distributed representation. The proposed method and other two straightforward methods, which are based on the simple length of LCS and the local maximal value of LCS with no weight, are applied to the dataset of a plagiarism detection competition. The experimental results show that the proposed method is useful in the applications that need a strict detection of complex plagiarisms.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Procedia Computer Science - Volume 111, 2017, Pages 382-387
نویسندگان
, , ,