کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
523158 868274 2007 12 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Measuring quality of similarity functions in approximate data matching
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
پیش نمایش صفحه اول مقاله
Measuring quality of similarity functions in approximate data matching
چکیده انگلیسی

This paper presents a method for assessing the quality of similarity functions. The scenario taken into account is that of approximate data matching, in which it is necessary to determine whether two data instances represent the same real world object. Our method is based on the semi-automatic estimation of optimal threshold values. We propose two methods for performing such estimation. The first method is an algorithm based on a reward function, and the second is a statistical method. Experiments were carried out to validate the techniques proposed. The results show that both methods for threshold estimation produce similar results. The output of such methods was used to design a grading function for similarity functions. This grading function, called discernability, was used to compare a number of similarity functions applied to an experimental data set.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Informetrics - Volume 1, Issue 1, January 2007, Pages 35–46
نویسندگان
, , , ,