کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4950212 1364281 2018 18 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Privacy-preserved big data analysis based on asymmetric imputation kernels and multiside similarities
ترجمه فارسی عنوان
تجزیه و تحلیل داده های بزرگ حفظ شده با حفظ حریم خصوصی بر اساس هسته تقلبی نامتقارن و شباهت های چندگانه
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
چکیده انگلیسی
This study presents an efficient approach for incomplete data classification, where the entries of samples are missing or masked due to privacy preservation. To deal with these incomplete data, a new kernel function with asymmetric intrinsic mappings is proposed in this study. Such a new kernel uses three-side similarities for kernel matrix formation. The similarity between a testing instance and a training sample relies not only on their distance but also on the relation between the testing sample and the centroid of the class, where the training sample belongs. This reduces biased estimation compared with typical methods when only one training sample is used for kernel matrix formation. Furthermore, centroid generation does not involve any clustering algorithms. The proposed kernel is capable of performing data imputation by using class-dependent averages. This enhances Fisher Discriminant Ratios and data discriminability. Experiments on two open databases were carried out for evaluating the proposed method. The result indicated that the accuracy of the proposed method was higher than that of the baseline. These findings thereby demonstrated the effectiveness of the proposed idea.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Future Generation Computer Systems - Volume 78, Part 2, January 2018, Pages 859-866
نویسندگان
, , , ,