First-Feed LSTM model for video description

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
725709	1461216	2016	5 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Deep neural network (DNN)long short-term memory (LSTM)

موضوعات مرتبط

مهندسی و علوم پایه سایر رشته های مهندسی مهندسی برق و الکترونیک

پیش نمایش صفحه اول مقاله

First-Feed LSTM model for video description

چکیده انگلیسی

Video description (VD) aims to automatically generate descriptive natural language for videos. With its successful implementations and a broad range of applications, lots of work based on deep neural network (DNN) models have been put forward by researchers. This paper takes inspiration from an image caption model and develops an end-to-end VD model using long short-term memory (LSTM). Single video feature is fed to the first unit of LSTM decoder, and subsequent words of sentence are generated on previous predicted words. Experimental results on two publicly available datasets demonstrate that the performance of the proposed model outperforms that of baseline.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: The Journal of China Universities of Posts and Telecommunications - Volume 23, Issue 3, June 2016, Pages 89–93

نویسندگان

Wang Yue, Wang Xiaojie, Mao Yuzhao,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

First-Feed LSTM model for video description

دسترسی سریع

ارتباط

English Website