Squeezing bottlenecks: Exploring the limits of autoencoder semantic representation capabilities

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
407130	678129	2016	8 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Text representation - نمایش متن

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش صفحه اول مقاله

Squeezing bottlenecks: Exploring the limits of autoencoder semantic representation capabilities

چکیده انگلیسی

We present a comprehensive study on the use of autoencoders for modelling text data, in which (differently from previous studies) we focus our attention on the various issues. We explore the suitability of two different models binary deep autencoders (bDA) and replicated-softmax deep autencoders (rsDA) for constructing deep autoencoders for text data at the sentence level. We propose and evaluate two novel metrics for better assessing the text-reconstruction capabilities of autoencoders. We propose an automatic method to find the critical bottleneck dimensionality for text representations (below which structural information is lost); and finally we conduct a comparative evaluation across different languages, exploring the regions of critical bottleneck dimensionality and its relationship to language perplexity.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neurocomputing - Volume 175, Part B, 29 January 2016, Pages 1001–1008

نویسندگان

Parth Gupta, Rafael E. Banchs, Paolo Rosso,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Squeezing bottlenecks: Exploring the limits of autoencoder semantic representation capabilities

دسترسی سریع

ارتباط

English Website