کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
553717 873528 2011 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A hierarchical Naïve Bayes model for approximate identity matching
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر سیستم های اطلاعاتی
پیش نمایش صفحه اول مقاله
A hierarchical Naïve Bayes model for approximate identity matching
چکیده انگلیسی

Organizations often manage identity information for their customers, vendors, and employees. Identity management is critical to various organizational practices ranging from customer relationship management to crime investigation. The task of searching for a specific identity is difficult because disparate identity information may exist due to the issues related to unintentional errors and intentional deception. In this paper we propose a hierarchical Naïve Bayes model that improves existing identity matching techniques in terms of searching effectiveness. Experiments show that our proposed model performs significantly better than the exact-match based matching technique. With 50% training instances labeled, the proposed semi-supervised learning achieves a performance comparable to the fully supervised record comparison algorithm. The semi-supervised learning greatly reduces the efforts of manually labeling training instances without significant performance degradation.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Decision Support Systems - Volume 51, Issue 3, June 2011, Pages 413–423
نویسندگان
, , ,