کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
436175 689975 2014 26 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A theoretical framework for knowledge-based entity resolution
ترجمه فارسی عنوان
یک چارچوب نظری برای تصمیم گیری بر اساس موجودیت دانش
کلمات کلیدی
رزومه شخصیت، تطبیق داده ها، پیوند ضبط، ارزیابی درخواست، محدوده پرس و جو، نمایندگی دانش، بهینه سازی دانش
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
چکیده انگلیسی

Entity resolution is the process of determining whether a collection of entity representations refer to the same entity in the real world. In this paper we introduce a theoretical framework that supports knowledge-based entity resolution. From a logical point of view, the expressive power of the framework is equivalent to a decidable fragment of first-order logic including conjunction, disjunction and a certain form of negation. Although the framework is expressive for representing knowledge about entity resolution in a collective way, the questions that arise are: (1) how efficiently can knowledge patterns be processed; (2) how effectively can redundancy among knowledge patterns be eliminated. In answering these questions, we first study the evaluation problem for knowledge patterns. Our results show that this problem is NP-complete w.r.t. combined complexity but in ptime w.r.t. data complexity. This nice property leads us to investigate the containment problem for knowledge patterns, which turns out to be NP-complete. We further develop a notion of optimality for knowledge patterns and a mechanism of optimizing a knowledge model (i.e. a finite set of knowledge patterns). We prove that the optimality decision problem for knowledge patterns is still NP-complete.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Theoretical Computer Science - Volume 549, 11 September 2014, Pages 101–126
نویسندگان
, ,