کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
10321190 659208 2015 22 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Hiding outliers into crowd: Privacy-preserving data publishing with outliers
ترجمه فارسی عنوان
پنهان کردن ناقلین به جمعیت: انتشار اطلاعات با حفظ حریم خصوصی با نادیده گرفتن
کلمات کلیدی
امنیت، صداقت و محافظت، به اشتراک گذاری داده ها، شناسایی داده ها، ناپایدارها،
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی
In recent years, many organizations publish their data in non-aggregated format for research purpose. However, publishing non-aggregated data raises serious concerns in data privacy. One of the concerns is that when outliers exist in the dataset, they are easier to be distinguished from the crowd and their privacy is prone to be compromised. In this paper, we study the problem of privacy-preserving publishing datasets that contain outliers. We define the distinguishability-based attack by which the adversary can identify outliers and reveal their private information from an anonymized dataset. We show that the existing syntactic privacy models (e.g., k-anonymity and ℓ-diversity) cannot defend against the distinguishability-based attack. We define the plain ℓ-diversity to provide privacy guarantee to outliers against the distinguishability-based attack, and design efficient algorithms to anonymize the dataset to achieve plain ℓ-diversity with low information loss. We extend our anonymization approach to deal with continuous release of a series of datasets that contain outliers. Our experiments demonstrate the efficiency and effectiveness of our approaches.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Data & Knowledge Engineering - Volume 100, Part A, November 2015, Pages 94-115
نویسندگان
, ,