دانلود رایگان مقاله: آموزش تفسیر جملات توزیع شده برای تقسیم بندی داستان

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
4977392	1451925	2018	9 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Learning distributed sentence representations for story segmentation

ترجمه فارسی عنوان

آموزش تفسیر جملات توزیع شده برای تقسیم بندی داستان

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

نمایندگی توزیع شده، شبکه عصبی عمیق نمایش موضعی موضعی، بردار ورد، بردار جمله

Deep neural network - شبکه عصبی عمیق Distributed representation - نمایندگی توزیع شده

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال

پیش نمایش مقاله

آموزش تفسیر جملات توزیع شده برای تقسیم بندی داستان

چکیده انگلیسی

- A topical sentence representation is proposed for story segmentation.
- Multiple time resolution (MTR) BOW is used as neural network input to capture both slow and fast changing topics.
- Bottleneck features extracted from DNN-MTR model perform best.

Traditional sentence representations such as bag-of-words (BOW) and term frequency-inverse document frequency (tf-idf) face the problem of data sparsity and may not generalize well. Neural network based representations such as word/sentence vectors are usually trained in an unsupervised way and lack the topic information which is important for story segmentation. In this paper, we propose to learn sentence representation by using deep neural network (DNN) to directly predict the topic class of the input sentence. By using supervised training, the learned vector representation of sentences contains more topic information and is more suitable for the story segmentation task. The input of the DNN is BOW vector computed from a context window. Multiple time resolution BOW and bottleneck features (BNF) are also introduced to enhance the performance of story segmentation. As text data labeled with topic information is limited, we cluster stories into classes and use the class ID as the topic label of the stories for DNN training. We evaluated the proposed sentence representation with the TextTiling and normalized cuts (NCuts) based story segmentation methods on the topic detection and tracking (TDT2) task. Experimental results show that the proposed topical sentence representation outperforms both the BOW baseline and the recently proposed neural network based representations, i.e., word and sentence vectors.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Signal Processing - Volume 142, January 2018, Pages 403-411

نویسندگان

Jia Yu, Lei Xie, Xiong Xiao, Eng Siong Chng,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : آموزش تفسیر جملات توزیع شده برای تقسیم بندی داستان

دسترسی سریع

ارتباط

English Website