کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
535757 870374 2013 5 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Source code author identification with unsupervised feature learning
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
پیش نمایش صفحه اول مقاله
Source code author identification with unsupervised feature learning
چکیده انگلیسی

Automatic identification of source code authors has many applications in different fields such as source code plagiarism detection, and law suit prosecution. This paper presents a new source code author identification system based on an unsupervised feature learning technique. As a method of extracting features from high dimensional data, unsupervised feature learning has obtained a great success in many fields such as character recognition and image classification. However, according to our knowledge it has not been applied for source code author identification systems. Therefore, we investigated an unsupervised feature learning technique called sparse auto-encoder as a method of extracting features from source code files. Our system was evaluated with several datasets and results have shown that performance is very close to the state of art techniques in the source code identification field.


► We designed and built a new source code author identification system.
► It is based on an unsupervised feature learning technique.
► Our system is evaluated with several datasets.
► Results shown that our system outperforms some existing systems.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition Letters - Volume 34, Issue 3, 1 February 2013, Pages 330–334
نویسندگان
, ,