دانلود رایگان مقاله: بهبود دقت مدل های طبقه بندی ناشی از مجموعه داده های ناشناس

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
392914	665198	2014	24 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Improving accuracy of classification models induced from anonymized datasets

ترجمه فارسی عنوان

بهبود دقت مدل های طبقه بندی ناشی از مجموعه داده های ناشناس

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Privacy Preserving Data Publishing k-anonymity - K نامعلوم Privacy preserving data mining - حفظ حریم خصوصی داده کاوی Classification - طبقه بندی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش مقاله

بهبود دقت مدل های طبقه بندی ناشی از مجموعه داده های ناشناس

چکیده انگلیسی

• We present a new anonymization algorithm for privacy-preserving data publishing.
• The non-homogeneous generalization is coupled with sensitive value distributions.
• The predictive utility of the algorithm is measured on eight anonymized datasets.
• The algorithm outperforms other methods with four classification techniques.

The performance of classifiers and other data mining models can be significantly enhanced using the large repositories of digital data collected nowadays by public and private organizations. However, the original records stored in those repositories cannot be released to the data miners as they frequently contain sensitive information. The emerging field of Privacy Preserving Data Publishing (PPDP) deals with this important challenge. In this paper, we present NSVDist (Non-homogeneous generalization with Sensitive Value Distributions)—a new anonymization algorithm that, given minimal anonymity and diversity parameters along with an information loss measure, issues corresponding non-homogeneous anonymizations where the sensitive attribute is published as frequency distributions over the sensitive domain rather than in the usual form of exact sensitive values. In our experiments with eight datasets and four different classification algorithms, we show that classifiers induced from data generalized by NSVDist tend to be more accurate than classifiers induced using state-of-the-art anonymization algorithms.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Sciences - Volume 256, 20 January 2014, Pages 138–161

نویسندگان

Mark Last, Tamir Tassa, Alexandra Zhmudyak, Erez Shmueli,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : بهبود دقت مدل های طبقه بندی ناشی از مجموعه داده های ناشناس

دسترسی سریع

ارتباط

English Website