کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
455622 695522 2014 12 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
An efficient and scalable plagiarism checking system using Bloom filters
ترجمه فارسی عنوان
یک سیستم چک کردن دزدی اداری کارآمد و مقیاس پذیر با استفاده از بلوم فیلتر کردن یک ؟؟
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر شبکه های کامپیوتری و ارتباطات
چکیده انگلیسی


• Using Bloom filters to enhance the efficiency of the plagiarism detection system.
• Enhancing the speed of plagiarism detection and reducing the memory required to store the documents.
• Considering the privacy of content while running similarity estimation process.
• Adjustable according to the requirements of the relevant application.

With the easy access to the huge volume of articles available on the Internet, plagiarism is getting worse and worse. Most recent approaches proposed to address this problem usually focus on achieving better accuracy of similarity detection process. However, there are some real applications where plagiarized contents should be detected without revealing any information. Moreover, in such web-based applications, running time, memory consumption, communication and computational complexity should be also taken into account. In this paper, we propose a similar document detection system based on matrix Bloom filter, a new extension of standard Bloom filter. The experimental results on a real dataset show that the system can achieve 98% of accuracy. We also compare our approach with a method recently proposed for the same purpose. The results of the comparison show that the Bloom filter-based approach achieves much better performance than other in terms of the aforementioned factors.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computers & Electrical Engineering - Volume 40, Issue 6, August 2014, Pages 1789–1800
نویسندگان
, ,