کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
391981 664584 2015 14 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Towards non-monotonic sentence alignment
ترجمه فارسی عنوان
به سوی هماهنگی گفتار غیر همجنسگرا
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی

All previous works on sentence alignment were founded on the monotonicity assumption that coupled sentences occur in a similar sequential order on the two sides of bilingual parallel corpora (i.e., bitexts), leaving out the non-monotonicity in naturally-occurring bitexts. This paper presents the very first attempt to specifically address this practical issue in sentence alignment, by taking advantage of two observations: (1) an initial (or seed) alignment can be made available using accessible lexical resources and (2) sentences with high affinity in one language tend to have their counterparts with similar affinity in the other. They are incorporated as two constraints into semisupervised learning to formulate a novel and generalized solution for both monotonic and non-monotonic sentence alignment. Our evaluation on real-world data from two remote domains and an end-to-end MT evaluation show that while representative monotonic aligners suffer more severely from a higher degree of non-monotonicity, our approach is able to maintain a stable and competitive performance across the full spectrum of non-monotonicity.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Sciences - Volume 323, 1 December 2015, Pages 34–47
نویسندگان
, ,