کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
387558 660905 2009 13 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A new document representation using term frequency and vectorized graph connectionists with application to document retrieval
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
A new document representation using term frequency and vectorized graph connectionists with application to document retrieval
چکیده انگلیسی

This paper presents a new document representation with vectorized multiple features including term frequency and term-connection-frequency. A document is represented by undirected and directed graph, respectively. Then terms and vectorized graph connectionists are extracted from the graphs by employing several feature extraction methods. This hybrid document feature representation more accurately reflects the underlying semantics that are difficult to achieve from the currently used term histograms, and it facilitates the matching of complex graph. In application level, we develop a document retrieval system based on self-organizing map (SOM) to speed up the retrieval process. We perform extensive experimental verification, and the results suggest that the proposed method is computationally efficient and accurate for document retrieval.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Expert Systems with Applications - Volume 36, Issue 10, December 2009, Pages 12023–12035
نویسندگان
, , ,