کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
411944 679598 2015 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Chinese–Tibetan bilingual clustering based on random walk
ترجمه فارسی عنوان
چینی تایلندی خواندن دو زبانه بر اساس پیاده روی تصادفی
کلمات کلیدی
خوشه بندی چند منبع، خوشه دو زبانه، چینی-تبت، نمودار دو زبانه، پیاده روی تصادفی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی

In recent years, multi-source clustering has received a significant amount of attention. Several multi-source clustering methods have been developed from different perspectives. In this paper, aiming at addressing the problem of Chinese–Tibetan bilingual document clustering, a novel bilingual clustering scheme is proposed, which can well capture both the intralingua document structures and interlingua document relations. The proposed scheme consists of three major phases. Firstly, to properly combine the feature structures of documents in different languages, a bilingual graph is constructed. In the second phase, two bilingual similarity matrices are computed based on the random walk performed in the bilingual graph. Finally, the similarity based clustering methods are performed on the two bilingual similarity matrices so as to generate cluster structures for documents in each language respectively, which lead to the corresponding bilingual clustering methods. Extensive experiments conducted on two Chinese–Tibetan bilingual document sets have confirmed the effectiveness of the proposed methods.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neurocomputing - Volume 158, 22 June 2015, Pages 32–41
نویسندگان
, , ,