کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6862209 677221 2016 13 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Cross-lingual sentiment classification: Similarity discovery plus training data adjustment
ترجمه فارسی عنوان
طبقه بندی احساسات متقابل زبان: کشف شباهت به همراه تنظیم داده های آموزشی
کلمات کلیدی
مدل موضوع، طبقه بندی احساسات متقابل زبانی، یادگیری نیمه نظارتی،
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی
The performance of cross-lingual sentiment classification is sharply limited by the language gap, which means that each language has its own ways to express sentiments. Many methods have been designed to transmit sentiment information across languages by making use of machine translation, parallel corpora, auxiliary unlabeled samples and other resources. In this paper, a new approach is proposed based on the selection of training data, where labeled samples highly similar to the target language are put into the training set. The refined training samples are used to build up an effective cross-lingual sentiment classifier focusing on the target language. The proposed approach contains two major strategies: the aligned-translation topic model and the semi-supervised training data adjustment. The aligned-translation topic model provides a cross-language representation space in which the semi-supervised training data adjustment procedure attempts to select effective training samples to eliminate the negative influence of the semantic distribution differences between the original and target languages. The experiments show that the proposed approach is feasible for cross-language sentiment classification tasks and provides insight into the semantic relationship between two different languages.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Knowledge-Based Systems - Volume 107, 1 September 2016, Pages 129-141
نویسندگان
, , ,