کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
10368598 874919 2015 18 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Topic segmentation of TV-streams by watershed transform and vectorization
ترجمه فارسی عنوان
تقسیم بندی موضوعی جریان های تلویزیونی با تغییر آبریزش و بردار سازی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
چکیده انگلیسی
A fine-grained segmentation of radio or TV broadcasts is an essential step for most multimedia processing tasks. Applying segmentation algorithms to the speech transcripts seems straightforward. Yet, most of these algorithms are not suited when dealing with short segments or noisy data. In this paper, we present a new segmentation technique inspired from the image analysis field and relying on a new way to compute similarities between candidate segments called vectorization. Vectorization makes it possible to match text segments that do not share common words; this property is shown to be particularly useful when dealing with transcripts in which transcription errors and short segments makes the segmentation difficult. This new topic segmentation technique is evaluated on two corpora of transcripts from French TV broadcasts on which it largely outperforms other existing approaches from the state-of-the-art.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Speech & Language - Volume 29, Issue 1, January 2015, Pages 63-80
نویسندگان
, ,