کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
393884 665704 2014 12 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Toward detection of aliases without string similarity
ترجمه فارسی عنوان
به سوی شناسایی نامهای مستعار بدون تشابه رشته
کلمات کلیدی
تشخیص نام مستعار، زیر مجموعه یادگیری فعال، تحت نظارت طبقه بندی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی

Entity aliases commonly exist. Accurately detecting these aliases plays a vital role in various applications. In particular, it is critical to detect the aliases that are intentionally hidden from the real identities, such as those of terrorists and frauds. Most existing work does not pay close attention to the aliases that have low/no string similarity to the given entities. In this paper, we propose a classifier that is based on active learning for detecting this type of aliasing. To minimize the cost of pair-wise comparison, a subset-based method is designed to restrict the selection within entity subsets. An active learning classifier is then employed in each entity subset to find the probability of whether a candidate is the alias of a given entity within the subset. After all of the results from the classifier are integrated, a list of aliases is returned for each given entity. For evaluation, we implemented four state-of-the-art methods and compared them with our proposed approach on three datasets. The results clearly demonstrate that this new active learning classifier is superior to those existing methods.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Sciences - Volume 261, 10 March 2014, Pages 89–100
نویسندگان
, , , , , ,