کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
407130 678129 2016 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Squeezing bottlenecks: Exploring the limits of autoencoder semantic representation capabilities
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
Squeezing bottlenecks: Exploring the limits of autoencoder semantic representation capabilities
چکیده انگلیسی

We present a comprehensive study on the use of autoencoders for modelling text data, in which (differently from previous studies) we focus our attention on the various issues. We explore the suitability of two different models binary deep autencoders (bDA) and replicated-softmax deep autencoders (rsDA) for constructing deep autoencoders for text data at the sentence level. We propose and evaluate two novel metrics for better assessing the text-reconstruction capabilities of autoencoders. We propose an automatic method to find the critical bottleneck dimensionality for text representations (below which structural information is lost); and finally we conduct a comparative evaluation across different languages, exploring the regions of critical bottleneck dimensionality and its relationship to language perplexity.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neurocomputing - Volume 175, Part B, 29 January 2016, Pages 1001–1008
نویسندگان
, , ,