کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6856198 1437948 2018 15 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Person name disambiguation on the web in a multilingual context
ترجمه فارسی عنوان
اظهارنظر شخصی در وب در یک متن چند زبانه است
کلمات کلیدی
مردم وب جستجو می کنند خوشه چند زبانه، ابهام نام ترجمه ماشین
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی
Person Name Disambiguation on the Web is the problem of grouping web pages retrieved by a search engine when looking for a person name according to the individual they refer to. This problem has been addressed in a monolingual scenario where all the search results are written in the same language. However, search engines can also return links to web pages written in different languages. We study how to address multilingualism for this problem using the MC4WePS data set, a recent gold standard that includes real search results written in different languages. For this purpose, we first analyze the suitability of using a translation tool to treat multilingualism with two state-of-the-art clustering algorithms. Since the use of this kind of tools increases the processing time of the disambiguation process, we propose an approach to deal with multilingualism that generalizes the monolingual scenario and does not require any translation resources. Our approach obtains better results than the translation approaches with the gold standard, making it a competitive choice in a real scenario.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Sciences - Volume 465, October 2018, Pages 373-387
نویسندگان
, , , ,