Memory-restricted latent semantic analysis to accumulate term-document co-occurrence events

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
536462	870529	2012	9 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Latent Semantic Analysis - تجزیه و تحلیل معنایی باقیمانده Co-occurrence - همایش Dimensionality reduction - کاهش ابعاد، فروکاهی ابعاد

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو

پیش نمایش صفحه اول مقاله

Memory-restricted latent semantic analysis to accumulate term-document co-occurrence events

چکیده انگلیسی

This paper addresses a novel adaptive problem of obtaining a new type of term-document weight. In our problem, an input is given by a long sequence of co-occurrence events between terms and documents, namely, a stream of term-document co-occurrence events. Given a stream of term-document co-occurrences, we learn unknown latent vectors of terms and documents such that their inner product adaptively approximates the target query-based term-document weights resulting from accumulating co-occurrence events. To this end, we propose a new incremental dimensionality reduction algorithm for adaptively learning a latent semantic index of terms and documents over a collection. The core of our algorithm is its partial updating style, where only a small number of latent vectors are modified for each term-document co-occurrence, while most other latent vectors remain unchanged. Experimental results on small and large standard test collections demonstrate that the proposed algorithm can stably learn the latent semantic index of terms and documents, showing an improvement in the retrieval performance over the baseline method.

► The input of our task is given by a long sequence of term-document co-occurrence events.
► The goal of our task is to learn term-documents weights give input stream.
► The weight between term and document is proportional to their co-occurrence rate.
► We propose a dimensionality reduction that approximates target term-document weights.
► Experiment results show that our algorithm learns gradually the target weight.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition Letters - Volume 33, Issue 12, 1 September 2012, Pages 1623–1631

نویسندگان

Seung-Hoon Na, Jong-Hyeok Lee,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Memory-restricted latent semantic analysis to accumulate term-document co-occurrence events

دسترسی سریع

ارتباط

English Website