کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
528301 869553 2012 15 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Quantifying the correctness, computational complexity, and security of privacy-preserving string comparators for record linkage
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
پیش نمایش صفحه اول مقاله
Quantifying the correctness, computational complexity, and security of privacy-preserving string comparators for record linkage
چکیده انگلیسی

Record linkage is the task of identifying records from disparate data sources that refer to the same entity. It is an integral component of data processing in distributed settings, where the integration of information from multiple sources can prevent duplication and enrich overall data quality, thus enabling more detailed and correct analysis. Privacy-preserving record linkage (PPRL) is a variant of the task in which data owners wish to perform linkage without revealing identifiers associated with the records. This task is desirable in various domains, including healthcare, where it may not be possible to reveal patient identity due to confidentiality requirements, and in business, where it could be disadvantageous to divulge customers’ identities. To perform PPRL, it is necessary to apply string comparators that function in the privacy-preserving space. A number of privacy-preserving string comparators (PPSCs) have been proposed, but little research has compared them in the context of a real record linkage application. This paper performs a principled and comprehensive evaluation of six PPSCs in terms of three key properties: (1) correctness of record linkage predictions, (2) computational complexity, and (3) security. We utilize a real publicly-available dataset, derived from the North Carolina voter registration database, to evaluate the tradeoffs between the aforementioned properties. Among our results, we find that PPSCs that partition, encode, and compare strings yield highly accurate record linkage results. However, as a tradeoff, we observe that such PPSCs are less secure than those that map and compare strings in a reduced dimensional space.


► This work provides a detailed survey and evaluation of privacy-preserving string comparators.
► The accuracy, computational complexity, and security of each comparator is analyzed.
► Some comparators are sufficiently accurate, secure, and fast for real world use.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Fusion - Volume 13, Issue 4, October 2012, Pages 245–259
نویسندگان
, , , ,